Google Cloud Speech: word start time

Question

I'm looking at using Google Cloud Speech to convert long-form narrated audio files and I need to know the start time of each phrase in the audio file. Is there a way to do this with Google Cloud Speech? I'm currently working with the transcribe_async.py. Thanks.

blambert blambert · Accepted Answer · 2017-02-11T06:19:12

This is not possible with Google Cloud Speech. If that information is important to you, you may need to look at other ASR systems. I know that offline, non-hosted ASR systems like Kaldi and CMU Sphinx will give you this information. I don't know if or which hosted ASR systems can provide that information.

Google Cloud Speech: word start time

2 Answers