Supported audio formats

SpeechKit allows you to recognize and synthesize the following audio formats:

Audio features in this format:

Sampling frequency:

For OggOpus, data is encoded using the OPUS audio codec and compressed using the OGG container format.

SpeechKit recognizes and synthesizes OggOpus without audio file quality and header restrictions.

For MP3, data is encoded using the MPEG-1/2/2.5 Layer III audio codec and packaged in an MP3 container.

SpeechKit recognizes MP3 without audio file quality and header restrictions.

Warning

The MP3 format is not supported in the API v1 for synchronous recognition and API v2 for streaming recognition.