Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support for CrossValidation: Enhancement Request #42

Open
Alan-Penkar opened this issue Feb 6, 2023 · 2 comments
Open

Support for CrossValidation: Enhancement Request #42

Alan-Penkar opened this issue Feb 6, 2023 · 2 comments

Comments

@Alan-Penkar
Copy link

I am using RayDP with Spark and am using this package with Ray Tune for HyperParameter Optimization with the lightGBM regressor. Unless there is something I'm missing, there's no way to use lgbm's native cross validation as in Ray's examples, this would be a huge help to model accuracy when training large models.

@Alan-Penkar Alan-Penkar changed the title Support for CrossValidation Support for CrossValidation: Enhancement Request Feb 6, 2023
@Yard1
Copy link
Member

Yard1 commented Feb 6, 2023

I am not sure whether there's a built-in way of doing that in other distributed lightgbm implementations. I think the cv function calls train underneath - as a workaround, you should be able to replace that with lightgbm-ray's train function.

@Alan-Penkar
Copy link
Author

Thanks for the quick response - I don't actually see lightgbm.train being called in the .cv function, but I will continue looking through the nested calls. The bigger question is how to make the _make_n_folds function within lightgbm.cv compatible with the RayDMatrix objects that are used by lightgbm_ray? I haven't seen any methods on a RayDMatrix that would lead me to believe data manipulation would be straightforward, but if I've missed it I'd certainly appreciate you pointing me in the right direction.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants