I am passing a .opus audio file to the google's speech-to-text api for transcription. I am using the following configurations:
- encoding = enums.RecognitionConfig.AudioEncoding.OGG_OPUS
- language_code = "en-US"
- sample_rate_hertz = 16000
I am getting the following error:
google.api_core.exceptions.GoogleAPICallError: None Unable to recognize speech, possible error in encoding or channel config. Please correct the config and retry the request.
I've tried other encodings like FLAC and LINEAR16 and get None as outputs.
Does opus audio files require additional configuration field and how should the configuration file look like?