Voice Gateway

Speech-to-Text

Speech-to-text.


Request API Key Find out more

Voice API : Speech-to-Text

The Speech-to-Text service allows an application to have a speech-to-text (STT) conversion performed on a long or short audio stream and for the speech in that audio stream to be transcribed as text. This service can be used in interactive systems (e.g. voice controlled systems) or for offline transcribing of speech.


  • Upload an audio file (MP3)
  • Storage of text from TTS converted speech
  • Retrieve text of speech once TTS conversion complete
  • Realtime transcription of speech (coming soon)

The Speech-to-Text service is one of many building blocks that are being released as part of the Voice API over the coming months.


Note that the Speech-to-Text service may not convert speech to the correct text in all cases. Care should therefore be taken when using the Voice API.




Convert speech to text using Voice Gateway Speech-to-Text service with RESTful Voice API
Example using cURL

Submit conversion request

Request:

curl --request POST "https://api.melroselabs.com/voice/speechtotext/" --header 'Content-Type: audio/mp3' --header 'x-api-key: [API_KEY]' --data-binary @file.mp3

Response:

{"transactionID": "1ccead78-6550-4aac-a6b4-a4942b908659"}

Retrieve text of speech

Request:

curl --request GET "https://api.melroselabs.com/voice/speechtotext/1ccead78-6550-4aac-a6b4-a4942b908659" --header 'x-api-key: [API_KEY]'

Response:

{"text": "Alice was beginning to get very tired of sitting by her sister on the bank and of having nothing to do."}

Service snapshot

  • Convert MP3 containing speech to text
  • Syncronous and asyncronous RESTful API
  • Text storage

Find out more about this service...

Please provide your first name.
Please provide your last name.
Please provide a valid company name.
Please provide a valid email address.