metadata_update()
function inserts verified=True
in self-reported evaluations
#1185
Labels
bug
Something isn't working
Describe the bug
With the refactor of evaluation metadata in #940, it seems that
metadata_update()
can no longer differentiate between self-reported evaluations and those from Hugging Face's evaluation service.As a result, updating a model which has both types of evaluation metrics will result in a
verified=True
entry getting inserted into the self-reported metrics. This is undesirable, because theverified
field is used to distinguish both types of evaluations on leaderboards etc.Another effect from #940 is that evaluations with the same task type / name are merged together, which isn't ideal because logically a self-reported evaluation and an automated one should each have their own entry in the
model-index.results
array.I'll take a stab at fixing this if @Wauplin doesn't beat me to it :)
cc @abhishekkrthakur
Reproduction
Run the following snippet to open a Hub PR and see the insertion of the
verified=True
field:Logs
No response
System info
The text was updated successfully, but these errors were encountered: