-
Notifications
You must be signed in to change notification settings - Fork 505
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to decide value of time_budget? #155
Comments
Hi @dsbyprateekg, Thanks for your interest and your question. There is in general no absolute answer to this question, but here are some tips that might be useful:
The time budget used in the AutoML benchmark are as follows: ‘small’: 1h-4h, According to the results in the FLAML paper, the time needed to reach or surpass the best performance reported in the AutoML benchmark can be greatly reduced if FLAML is used. The time needed for different dataset categories are as follows, ‘small’: 1m-10m, You can use the AutoML benchmark as a reference to decide which category your dataset belongs to. And use the suggested time budget (especially in terms of order of magnitude) mentioned above accordingly.
In addition, it will be great if you could share more information about your use cases. We might be able to provide more accurate answers/suggestions accordingly. Thanks! |
Thanks a lot, @qingyun-wu and your entire team for this amazing work. I am testing FLAML for the first time for a challenge and the datasets have the following sizes: I have two target columns to predict and the scoring I am using is as below: Target1score1 = max(0, 100*metrics.f1_score(actual["Target1"], predicted["Target1"], average="macro")) Target2score2 = max(0, 100*metrics.f1_score(actual["Target2"], predicted["Target2"], average="macro")) Final scorescore = (score1/2)+(score2/2)` It seems my dataset is small. I have tried with |
Thanks @sonichi , with this new version 0.5.12, I am able to see a message in the console suggesting to increase the time budget. |
Hi,
Is there any way we can find the best value for the time_budget as per our dataset?
Please share some tips.
The text was updated successfully, but these errors were encountered: