ENH: Support fish speech reference audio #2542

codingl2k1 · 2024-11-11T20:36:55Z

Fixes: Fish-Speech启用reference-audio #2532

qinxuye · 2024-11-15T05:20:00Z

Is reference_audio in fish equivalent to prompt_speech in cosyvoice? I wonder if it's possible to reuse the option?

codingl2k1 · 2024-11-15T08:54:22Z

Is reference_audio in fish equivalent to prompt_speech in cosyvoice? I wonder if it's possible to reuse the option?

We can reuse the option, then users will need to pass prompt_speech instead of reference_audio. I am not sure which name is better.

qinxuye · 2024-11-15T09:34:20Z

Is reference_audio in fish equivalent to prompt_speech in cosyvoice? I wonder if it's possible to reuse the option?

We can reuse the option, then users will need to pass prompt_speech instead of reference_audio. I am not sure which name is better.

Yeah, we can unify the APIs since prompt_speech is added already, we can add it in doc that it can be used as reference_audio for fish speech.

codingl2k1 · 2024-11-15T09:35:29Z

Is reference_audio in fish equivalent to prompt_speech in cosyvoice? I wonder if it's possible to reuse the option?

We can reuse the option, then users will need to pass prompt_speech instead of reference_audio. I am not sure which name is better.

Yeah, we can unify the APIs since prompt_speech is added already, we can add it in doc that it can be used as reference_audio for fish speech.

I will fix it.

qinxuye · 2024-11-18T10:35:42Z

I had some comments:

We may set enable_reference_audio to True if prompt_speech specified.
We can support passing compile=True for model loading which provides significant performance improvement.

qinxuye · 2024-11-19T02:55:25Z

Looks like our CI cannot run with compile=True.

qinxuye

LGTM

XprobeBot added the enhancement New feature or request label Nov 11, 2024

XprobeBot added this to the v0.16 milestone Nov 11, 2024

codingl2k1 added 4 commits November 19, 2024 12:42

Support fish speech reference audio

338ac46

Reuse prompt speech

c615c8b

Fix

657d8b3

Fix CI

66f77ca

qinxuye force-pushed the enh/fish_speech_reference_audio branch from 6c6bda2 to 66f77ca Compare November 19, 2024 12:45

qinxuye approved these changes Nov 19, 2024

View reviewed changes

qinxuye marked this pull request as ready for review November 19, 2024 13:24

qinxuye merged commit 0cdfb43 into xorbitsai:main Nov 19, 2024
12 of 13 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: Support fish speech reference audio #2542

ENH: Support fish speech reference audio #2542

codingl2k1 commented Nov 11, 2024

qinxuye commented Nov 15, 2024 •

edited

Loading

codingl2k1 commented Nov 15, 2024

qinxuye commented Nov 15, 2024

codingl2k1 commented Nov 15, 2024

qinxuye commented Nov 18, 2024 •

edited

Loading

qinxuye commented Nov 19, 2024

qinxuye left a comment

ENH: Support fish speech reference audio #2542

ENH: Support fish speech reference audio #2542

Conversation

codingl2k1 commented Nov 11, 2024

qinxuye commented Nov 15, 2024 • edited Loading

codingl2k1 commented Nov 15, 2024

qinxuye commented Nov 15, 2024

codingl2k1 commented Nov 15, 2024

qinxuye commented Nov 18, 2024 • edited Loading

qinxuye commented Nov 19, 2024

qinxuye left a comment

Choose a reason for hiding this comment

qinxuye commented Nov 15, 2024 •

edited

Loading

qinxuye commented Nov 18, 2024 •

edited

Loading