diff --git a/docs/guides/capabilities/audio/transcriptionGoogle.mdx b/docs/guides/capabilities/audio/transcriptionGoogle.mdx index e7ed15f86..3724deece 100644 --- a/docs/guides/capabilities/audio/transcriptionGoogle.mdx +++ b/docs/guides/capabilities/audio/transcriptionGoogle.mdx @@ -4,7 +4,7 @@ sidebar_position: 4 sidebar_slug: transcriptions-google --- -To add the realtime audio transcriptions in a Dyte meeting you can use Google Cloud Speech-to-Text +To add the realtime audio transcriptions in a Dyte meeting you can use Google Cloud Speech-to-Text and Cloud translation APIs. These Google services are paid, a Google Cloud account is required to proceed. @@ -18,7 +18,7 @@ This integration is Web only at the moment ## 1. Setup Google Cloud Credentials -You must have a service account with GCP (Google Cloud Platform) to use Google transcriptions. Please create a project in that account that allows Google Media Translations and Google Translations API. +You must have a project & a service account with GCP (Google Cloud Platform) to use Google transcriptions. Make sure that service account allows Google Speech-to-Text and Google Translation API. Once done, download the keys for the service account. @@ -26,9 +26,9 @@ Once done, download the keys for the service account. Setup a server to forward the Audio Data from client to Google Cloud. You don't want to put your GCP credentials on client side and therefore need a server which forwards audio data to Google Cloud -For this, we have provided a sample in NodeJS for you to checkout (dyte-io/google-transcription)[https://github.com/dyte-io/google-transcription/tree/main/server]. Please find it here. Currently, we only have NodeJS samples; if you're working on a different backend, feel free to port this code or connect with us to help you port it. +For this, we have provided a sample in NodeJS for you to checkout (dyte-io/google-transcription)[https://github.com/dyte-io/google-transcription/tree/main/server]. Please find it here. Currently, we only have NodeJS samples; if you're working on a different backend, feel free to port this code. -To use this sample, please clone this using the following command. +To use this sample, please clone the repository using the following command. ``` git clone git@github.com:dyte-io/google-transcription.git @@ -50,7 +50,7 @@ Note: PRIVATE_KEY should be in a single line. Try picking the value from the ser npm install ``` -This would automatically install @google-cloud/media-translation, @google-cloud/speech, and @google-cloud/translate. +This would automatically install @google-cloud/speech, and @google-cloud/translate. ``` npm run dev @@ -85,7 +85,7 @@ const speech = new DyteGoogleSpeechRecognition({ meeting, // Dyte meeting object from DyteClient.init target: 'hi', // Language that the current user wants to see source: 'en-US', // Language that the current user would speak in - baseUrl: , // Backend URL from step 2.4 + baseUrl: , // Backend URL from step 2.2 }); speech.on('transcription', async (data) => {