New performance threshold for nightly tests #330
Unanswered
djdameln
asked this question in
Feature Requests
Replies: 1 comment
-
Another idea (or in addition) can be to run benchmarking script in the nightly and collect the csv file. We can then track these metrics each day to see if there is any constant decrease in performance for a model. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Currently, the nightly tests use a model- and category-specific performance score threshold, obtained by running a number of repeated training runs, and collecting the lowest observed scores per model and category. This is problematic, because it's always possible to observe an even lower score without anything being wrong with the model.
I propose the following alternative solution:
Beta Was this translation helpful? Give feedback.
All reactions