Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Word Embedding (with Pooling) to benchmark #950

Open
dcecchini opened this issue Jan 2, 2024 · 0 comments
Open

Add Word Embedding (with Pooling) to benchmark #950

dcecchini opened this issue Jan 2, 2024 · 0 comments
Labels
⏭️ Next Release Issues or Request for the next release

Comments

@dcecchini
Copy link

dcecchini commented Jan 2, 2024

Add to the embedding benchmark the word embedding models as well. To do that, we use a pooling strategy to transform the word embeddings into a sentence embedding. Common pooling strategies are:

  • Averaging each word embedding from the sentence
  • Using the special tag [CLS] or <s> (depends on the model) as a representation of the sentence (CLS pooling)
@ArshaanNazir ArshaanNazir added the ⏭️ Next Release Issues or Request for the next release label Feb 2, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
⏭️ Next Release Issues or Request for the next release
Projects
None yet
Development

No branches or pull requests

2 participants