Skip to content
This repository has been archived by the owner on Sep 18, 2024. It is now read-only.

chore: release 079 #758

Merged
merged 1 commit into from
Jul 11, 2023
Merged

chore: release 079 #758

merged 1 commit into from
Jul 11, 2023

Conversation

bwanglzu
Copy link
Member

@bwanglzu bwanglzu commented Jul 11, 2023

Draft release note:

Release Note Finetuner 0.7.9

This release covers Finetuner version 0.7.9, including dependencies finetuner-core 0.13.9.

This release contains 1 new features and 1 refactoring.

🆕 Features

Add jina embeddings suite. (#757)

We have made contributions to the open source community by releasing three pre-trained embedding models:

  1. jima-embedding-s-en-v1: 35 million parameter compat embedding model.
  2. jina-embedding-b-en-v1: 110 million parameter standard sized embedding model.
  3. jina-embedding-l-en-v1: 330 million parameter large embeddding model.

All three embedding models have been trained using Jina AI's Linnaeus-Clean dataset. This dataset consists of 380 million pairs of sentences, which include both query-document pairs. These pairs were obtained from various domains and were carefully selected through a thorough cleaning process. The Linnaeus-Full dataset, from which the Linnaeus-Clean dataset is derived, originally contained 1.6 billion sentence pairs.

If you wish to utilize these embeddings with Finetuner (Apache 2.0), please follow the instructions below:

!pip install finetuner
import finetuner

model = finetuner.build_model('jinaai/jina-embedding-s-en-v1')
embeddings = finetuner.encode(
    model=model,
    data=['how is the weather today', 'What is the current weather like today?']
)
print(finetuner.cos_sim(embeddings[0], embeddings[1]))

⚙ Refactoring

Change installation behavior. (#757)

With the launch of Finetuner 0.7.9, installing it using pip install finetuner will automatically include the necessary torch-related dependencies. This enables Finetuner to function as an optimal provider of embedding models. If you intend to fine-tune an embedding model, please ensure that you install Finetuner with the additional dependencies by using the command pip install "finetuner[full]".

🤟 Contributors

We would like to thank all contributors to this release:


---

- [ ] This PR references an open issue
- [ ] I have added a line about this change to CHANGELOG

@bwanglzu bwanglzu marked this pull request as ready for review July 11, 2023 15:14
Copy link
Member

@gmastrapas gmastrapas left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

version number in setup?

@bwanglzu
Copy link
Member Author

@gmastrapas it has been changed in the last PR ..

@bwanglzu
Copy link
Member Author

wil trigger the release once all model public

@bwanglzu bwanglzu merged commit 2af7a05 into main Jul 11, 2023
@bwanglzu bwanglzu deleted the chore-release-079 branch July 11, 2023 15:37
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants