Skip to content

Commit

Permalink
Update moshi.md
Browse files Browse the repository at this point in the history
  • Loading branch information
ylacombe authored Oct 2, 2024
1 parent 3eff211 commit 298c4a3
Showing 1 changed file with 3 additions and 1 deletion.
4 changes: 3 additions & 1 deletion docs/source/en/model_doc/moshi.md
Original file line number Diff line number Diff line change
Expand Up @@ -72,7 +72,7 @@ This implementation has two main aims:

It is designed for intermediate use. We strongly recommend using the original [implementation](https://github.com/kyutai-labs/moshi) to infer the model in real-time streaming.

<Tip>
</Tip>

**1. Model generation**

Expand Down Expand Up @@ -101,6 +101,8 @@ To follow the example of the following image, "Hello, I'm Moshi" could be transf
<img src="https://huggingface.co/datasets/ylacombe/benchmark-comparison/resolve/main/moshi_text_sync.png">
</div>

</Tip>


[`MoshiForConditionalGeneration.generate`] then auto-regressively feeds to itself its own audio stream, but since it doesn't have access to the user input stream while using `transformers`, it will thus **assume that the user is producing blank audio**.

Expand Down

0 comments on commit 298c4a3

Please sign in to comment.