Make token optional and private an argument, add template #39

stephantul · 2024-09-26T09:14:30Z

No description provided.

… to False by default

tomaarsen · 2024-09-26T12:21:42Z

model2vec/model_card_template.md

-Alternatively, you can distill your own model using the `distill` method:
-```python
-from model2vec.distill import distill
-
-# Choose a Sentence Transformer model
-model_name = "BAAI/bge-base-en-v1.5"
-
-# Distill the model
-m2v_model = distill(model_name=model_name, pca_dims=256)
-
-# Save the model
-m2v_model.save_pretrained("m2v_model")
-```
-


I actually like this bit, it helps push people interested in the work to exploring it themselves. My comments regarding the "distillation should be secondary in the model card" was referring to:

Model2Vec distills a Sentence Transformer into a small, static model.
This model is ideal for applications requiring fast, lightweight embeddings.

This first sentence isn't really useful for someone just looking at the model card. Perhaps a better intro is:

This Model2Vec model is a distilled version of a Sentence Transformer that uses static embeddings, allowing text embeddings to be computed orders of magnitude faster on both GPU and CPU. It is designed for applications where computational resources are limited or where real-time performance is critical.

Or you can even use the Jinja template:

This Model2Vec model is a distilled version of {% if base_model %}the [{{ base_model }}](https://huggingface.co/{{ base_model }}){% else %}a{% endif %} Sentence Transformer that uses static embeddings, allowing text embeddings to be computed orders of magnitude faster on both GPU and CPU. It is designed for applications where computational resources are limited or where real-time performance is critical.

And then you'll get something like:

This Model2Vec model is a distilled version of the BAAI/bge-large-en-v1.5 Sentence Transformer that uses static embeddings, allowing text embeddings to be computed orders of magnitude faster on both GPU and CPU. It is designed for applications where computational resources are limited or where real-time performance is critical.

But you do have to then make sure that base_model exists on the Hub.

This is a nice suggestion! I updated the model card (added back the distillation part, and implemented your suggestion).

stephantul added 2 commits September 26, 2024 11:12

Fix modelcard comments

384db5f

Make token optional, add private as an optional argument which is set…

e04c03a

… to False by default

stephantul requested a review from Pringled September 26, 2024 09:14

Update cite

d35a8db

Pringled approved these changes Sep 26, 2024

View reviewed changes

tomaarsen reviewed Sep 26, 2024

View reviewed changes

Resolved comment

3938be7

Pringled merged commit 2f09539 into main Sep 27, 2024

Pringled deleted the make_token_optional_and_private_an_argument branch September 27, 2024 08:03

stephantul mentioned this pull request Sep 28, 2024

Various comments #38

Closed

7 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make token optional and private an argument, add template #39

Make token optional and private an argument, add template #39

stephantul commented Sep 26, 2024

tomaarsen Sep 26, 2024 •

edited

Loading

Pringled Sep 26, 2024

Make token optional and private an argument, add template #39

Make token optional and private an argument, add template #39

Conversation

stephantul commented Sep 26, 2024

tomaarsen Sep 26, 2024 • edited Loading

Choose a reason for hiding this comment

Pringled Sep 26, 2024

Choose a reason for hiding this comment

tomaarsen Sep 26, 2024 •

edited

Loading