-
Notifications
You must be signed in to change notification settings - Fork 11.7k
Open
Labels
area/coreIssues related to User Interface, OS Support, Core FunctionalityIssues related to User Interface, OS Support, Core Functionalitypriority/p2Important but can be addressed in a future release.Important but can be addressed in a future release.status/need-triageIssues that need to be triaged by the triage automation.Issues that need to be triaged by the triage automation.
Description
What would you like to be added?
Suggestion: Allow for the integration of Speech-to-Text (STT) and Text-to-Speech (TTS) CLI tools to enable voice-controlled interaction with the Gemini CLI agent.
Mechanism: Users could build a wrapper script that:
1. Captures spoken input via an STT CLI tool, converting it to text.
2. Pipes this text as a prompt to the Gemini CLI.
3. Captures the Gemini CLI's text response.
4. Pipes the text response to a TTS CLI tool for spoken output.
Why is this needed?
This would create a hands-free, more natural, and voice-controlled user experience, similar to interacting with an AI assistant like JARVIS.
Additional context
No response
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
area/coreIssues related to User Interface, OS Support, Core FunctionalityIssues related to User Interface, OS Support, Core Functionalitypriority/p2Important but can be addressed in a future release.Important but can be addressed in a future release.status/need-triageIssues that need to be triaged by the triage automation.Issues that need to be triaged by the triage automation.