Watson Wake Word for voice commands

Question

I'm looking at using Watson's Speech to Text software to help drive voice commands for our product.

All the examples I've seen require the user to press a button before giving a command. However, rather than having the user push a button, I'd like a "wake word" or keyword to signal the beginning of a command to our product. That is, I don't want to continuously stream sound to Watson's Speech To Text software, but I'm looking for a way to have a user give a keyword or wake word to start sending sound and then let Watson's Speech To Text return the text of the command it heard.

For example, "OK, Google" starts sending sound to Google for speech to text.

Does IBM provide a way to create my own "OK, Google" keyword without having to send everything my application may hear to Watson's Speech to Text?

Stephen, where is this app going to run? Is it on some dedicated embedded product, or is this running on some existing platform (like a phone, tablet, or something similar)? — Daniel Toczala
Existing platform. Likely either a tablet (as a wall mounted kiosk) or on a general purpose Windows PC (as part of our Java application). — Stephen M -on strike-

Daniel Toczala Daniel Toczala · Accepted Answer · 2018-01-31T16:23:05

Right now the Watson Speech to Text service does not support a separate "wake word" detection module. To do this, our current customers will use some edge device or service to handle that. Something like Snowboy (https://snowboy.kitt.ai/) or something similar.

Watson Wake Word for voice commands

2 Answers