Speech to Text from Own Sound File

Speech to Text from own sound file

The API does not allow it, but see this blog post and its comments for a potential workaround. Also make sure that your file contains high quality audio (at least 16 bit and 16 kHz) to get a better transcription.

Speech to Text (Voice Recognition) Directly from Audio / Transcription

There is now a relatively new service that allows Speech to Text automatic transcription, and a great web interface for human editing of the results. It's:

https://trint.com/

We've used it, and been pleased with the results. The transcription is certainly not perfect, but it's a great start, and it allows ready human editing.

There is also now a new API and service available from IBM Bluemix/Watson. You can try the free demo here:

https://speech-to-text-demo.mybluemix.net/

This service does a pretty decent job of converting audio (sourced from the mic or from an audio file) into text. Currently at least in the demo it appears that it doesn't use MP3, but will use wav and other formats. This service has a full API, and it is primarily designed to be built into applications.

Speech to Text from Own Sound File

Speech to Text from own sound file

Speech to Text (Voice Recognition) Directly from Audio / Transcription

Related Topics

Leave a reply