Skip to content

Difference in performance between mikropml and caret #331

Answered by kelly-sovacool
victor5lm asked this question in Q&A
Discussion options

You must be logged in to vote

There are a few reasons I would expect different performance values and different run times with these two code samples:

  1. You set find_feature_importance = TRUE, which takes a considerable amount of run time. Caret's train() doesn't do permutation feature importance.
  2. You set calculate_performance = TRUE, which calculates the model performance on the test set, although it shouldn't be too slow. Caret's train() doesn't do this step.
  3. You let mikropml randomly split the dataset into a training and testing set, rather than specify the exact training set as you did for caret. You can give training_frac a vector of indices if you want to specify the exact training set. (Note: since you set the s…

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by victor5lm
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants