You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I would like to start documenting good practices of model repos to add to our documentation.
Some come to mind rather quickly
One model per repo (avoid having multiple models in the same repo)
Add metadata to the model card
Add metrics to the metadata of the model card
How do we want to encourage users to have multiple checkpoints in a single repo? There was a related discussion in GPT-J and for other contributions
One branch per checkpoint?
One commit per checkpoint?
My suggestion
When using checkpoints for version control, use a commit per checkpoint
For example, Mistral has 600 checkpoints per model. Each checkpoint correspond to a different step. In that sense, I think it makes sense to have a commit/tag per checkpoint
When using checkpoints of a model with slightly different characteristics, use a branch per checkpoint
For example, GPT-J 6B has a half precision checkpoint and a single precision checkpoint.
In #13022, @xloem raises an important point about git-lfs:
Just a note that for organizing models outside the hugging face cache, it is more convenient to have subfolders or separate repos for different content, because git-lfs can be very slow filtering many gigabytes when switching branches. Not planning on arguing the point, just making sure the use-case is shared.
I would like to start documenting good practices of model repos to add to our documentation.
Some come to mind rather quickly
How do we want to encourage users to have multiple checkpoints in a single repo? There was a related discussion in GPT-J and for other contributions
My suggestion
I'm just gathering ideas so any are welcome!
cc @patrickvonplaten @julien-c @LysandreJik @lewtun @NielsRogge I hope I did not forget anyone
The text was updated successfully, but these errors were encountered: