Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature request: speech-to-text & text-to-speech #113

Open
zenflow opened this issue Dec 31, 2022 · 11 comments
Open

Feature request: speech-to-text & text-to-speech #113

zenflow opened this issue Dec 31, 2022 · 11 comments

Comments

@zenflow
Copy link

zenflow commented Dec 31, 2022

It would be really cool if this app supported voice conversations with ChatGPT.
Here's a chrome extension that enhances the official site with that feature: https://github.com/C-Nedelcu/talk-to-chatgpt

@lencx
Copy link
Owner

lencx commented Dec 31, 2022

Its voice seems to be provided by chrome, so it may not be possible to do it without chrome. I can look into it.

@mefengl
Copy link

mefengl commented Feb 12, 2023

expose an API that can be called from Shortcuts will be great, so that can be integrated with Siri and other automation

@Ethkuil
Copy link

Ethkuil commented Feb 13, 2023

Its voice seems to be provided by chrome, so it may not be possible to do it without chrome. I can look into it.

It's all right to use other APIs. It would be much more appealing for people who want to practice listening and speaking of a foreign language. You know, for example to prepare for TOEFL and IELTS.

@cyhhao
Copy link
Contributor

cyhhao commented Feb 14, 2023

Great idea, I'm trying to practice my English with ChatGPT.

Maybe I can take some time to help this author with this feature.

@melbarra88
Copy link

melbarra88 commented Mar 6, 2023

As an improvement to this feature, I suggest allowing the user to change the voice in the chat window for each message (overriding the default voice configured globally).

I usually use different languages when using Chat GPT, and it would be great if I could change the synthesis voice...

It would be even better if the language of the message was automatically detected and a voice corresponding to that language was automatically selected for the speech synthesis.

@code-whale
Copy link

Can users use their own API key for speech synthesis in future versions, such as using Azure?

@zenflow
Copy link
Author

zenflow commented Mar 12, 2023

Can users use their own API key for speech synthesis in future versions, such as using Azure?

Ideally it wouldn't use a paid service for this and you wouldn't need any API key..
Speech synthesis and speech recognition can be done on local machine.

@code-whale
Copy link

@zenflow Thank you for your reply, but please forgive me, for only representing my personal opinion that the current speech synthesis is not perfect and sounds too 'mechanical' in some pronunciations. Therefore, I hope to use my own API for speech synthesis. I think Azure's speech synthesis is very suitable for people like me who want to practice English speaking and listening skills:)

@zenflow
Copy link
Author

zenflow commented Mar 14, 2023

that the current speech synthesis is not perfect and sounds too 'mechanical' in some pronunciations

I had not considered a reason like this for using a paid service for speech synthesis!
It makes sense!

That said, I hope that an API key is not required for those of us who are fine with the mechanical speech synthesis our local machines can do by themselves.

@Gourdbaby
Copy link

have this feature solved? I am also a person who want to practice English speaking skills. If this app can support the feature, I would very appreciate it.

@shawn-ann
Copy link

I also want this feature

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

9 participants