Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The Japanese sample audio for “Zero-Shot In-Context Generation” does not match the context. #1043

Open
1m-N00b opened this issue Mar 6, 2025 · 1 comment

Comments

@1m-N00b
Copy link

1m-N00b commented Mar 6, 2025

Describe the bug
The Japanese sample audio for “Zero-Shot In-Context Generation” does not match the context.
Everything up to 14 seconds of audio is incoherent Japanese and does not exist in the context; after 14 seconds, the content is consistent with the context.
Similar Issues:#303

To Reproduce
Steps to reproduce the behavior:

  1. Go to Zero-shot In-context Generation.
  2. Select the sample in the lower right corner as shown in the screenshot.

Expected behavior
Audio files should be placed after 14 seconds or the context should be added correctly.
(However, there seems to be a problem on the audio generation side.)

Screenshots
Image

Desktop (please complete the following information):

  • Browser: firefox,chromium
@1m-N00b 1m-N00b changed the title The Japanese sample audio for “Zero-Shot In-Context Generation” does not fit the context. The Japanese sample audio for “Zero-Shot In-Context Generation” does not match the context. Mar 6, 2025
@aluminumbox
Copy link
Collaborator

well this should be modified if we have more Japanese data, or try change Chinese character to Japanese character

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants