-
Notifications
You must be signed in to change notification settings - Fork 660
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Early stopping configuration #38
Comments
This is pretty important. I'd like to see three criteria:
|
Here's a proposal for what I'd like to see: Parameters for flexible stopping criteria:
With these parameters, then you can implement it like this in EasyTrain.fit():
|
@gforman44 That looks pretty good. One thing I was thinking was that we could implement the early stopping with a Anyway, it sounds like you are really interested in this issue @gforman44. Do you want to implement it and submit a PR? |
Description
Early stopping configuration: Specifies the various configuration options for running training with early stopping.
1. Iteration termination conditions: how many epoch till termination.
2. score improvement termination condition - terminate training if best model score does not improve for N epochs
3. best expected score - terminate training once we achieved an expected score.
4. termination condition after certain time - terminate training after certain time
5. other termination conditions, if they are logical
Will this change the current api? How?
We can configure when model training will stop, when one of condition above is met.
Training should be implemented as listener, early stop configuration will listen for any conditions above and terminate training.
Who will benefit from this feature?
Everybody, we can easily configure when learning will end.
References
Reference implementation:
https://github.com/eclipse/deeplearning4j/blob/b5f0ec072f3fd0da566e32f82c0e43ca36553f39/deeplearning4j/deeplearning4j-nn/src/main/java/org/deeplearning4j/earlystopping/EarlyStoppingConfiguration.java
There are other implementation in different NN framework.
The text was updated successfully, but these errors were encountered: