Speech to text with Amazon Transcribe
In the previous section, we learned about text to speech. In this section, we will learn about speech to text and the service that provides this: Amazon Transcribe. It is an automatic speech recognition service that uses pre-trained deep learning models, which means that we don't have to train on petabytes of data to produce a model; Amazon does this for us. We just have to use the APIs that are available to transcribe audio files or video files; it supports a number of different languages and custom vocabulary too. Accuracy is the key and through custom vocabulary, you can enhance it based on the desired domain or industry:
Some common uses of Amazon Transcribe include the following:
- Real-time audio streaming and transcription.
- Transcripting pre-recorded audio files.
- Enable text searching from a media file by combining...