Feature Request: add timeout parameter to the .fit() method #10684

fingoldo · 2024-08-08T09:50:14Z

Adding the timeout parameter to the .fit() method, that should force the library to return best known solution found so far as soon as provided number of seconds since the start of training are passed, will allow to satisfy training SLAs, when a user has only a limited time budget to finish certain model training. Also, this will make possible fair comparison of different hyperparameters.

Reaching the timeout should have the same effect as reaching max iterations, maybe with additional warning and/or attribute set so that the training job's finishing reason is clear to the end user.

RAMitchell · 2024-08-08T10:15:40Z

Can you achieve this with a custom callback?

fingoldo · 2024-08-08T16:42:18Z

I did not realize it can be used to solve this problem. If I, while using early stopping, return True from my custom callback to stop the training, will the best iteration be set correctly by xgboost, or there will be some training progress loss?

jameslamb · 2024-08-09T03:49:37Z

Can you achieve this with a custom callback?

Just to connect these 2 conversations... that is what I suggested in the feature request opened in LightGBM at the same time: microsoft/LightGBM#6596 (comment)

fingoldo · 2024-08-10T05:35:46Z

Right, it seemed very natural to me to use direct timeout instead of (or along with) n_estimators, and I ideally would like to have an universal parameter for that (similar to n_estimators) in the major gradient boosting libraries. Most of the cases I'd say exact max number of trees is not important to the user, it's actually max time spent that matters. And some hyperparameters combinations can lead to vastly different runtimes even with the same n_estimators. The timeout parameter would solve this problem.

fingoldo · 2024-08-10T05:43:48Z

Last but not least, imagine that aliens have attacked the Earth and we only have one minute to compute trajectories of their missiles with ML. If this feature request is approved, responsible person just sets timeout=60, we intervene, and survive.

fingoldo · 2024-08-10T05:45:13Z

Last but not least, imagine that aliens have attacked the Earth and we only have one minute to compute trajectories of their missiles with ML. If this feature request is approved, responsible person just sets timeout=60, we intervene, and survive.

jameslamb mentioned this issue Aug 14, 2024

Feature Request: add timeout parameter to the .fit() method catboost/catboost#2717

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature Request: add timeout parameter to the .fit() method #10684

Feature Request: add timeout parameter to the .fit() method #10684

fingoldo commented Aug 8, 2024

RAMitchell commented Aug 8, 2024

fingoldo commented Aug 8, 2024

jameslamb commented Aug 9, 2024

fingoldo commented Aug 10, 2024

fingoldo commented Aug 10, 2024

fingoldo commented Aug 10, 2024

Feature Request: add timeout parameter to the .fit() method #10684

Feature Request: add timeout parameter to the .fit() method #10684

Comments

fingoldo commented Aug 8, 2024

RAMitchell commented Aug 8, 2024

fingoldo commented Aug 8, 2024

jameslamb commented Aug 9, 2024

fingoldo commented Aug 10, 2024

fingoldo commented Aug 10, 2024

fingoldo commented Aug 10, 2024