Supported audio formats

SpeechKit allows you to recognize and synthesize the following audio formats:

  • LPCM
  • OggOpus
  • MP3

LPCM

Linear pulse-code modulation without a WAV header.

Audio features in this format:

OggOpus

For OggOpus, data is encoded using the OPUS audio codec and compressed using the OGG container format.

SpeechKit recognizes and synthesizes OggOpus without audio file quality and header restrictions.

MP3

For MP3, data is encoded using the MPEG-1/2/2.5 Layer III audio codec and packaged in an MP3 container.

SpeechKit recognizes MP3 without audio file quality and header restrictions.

Warning

The MP3 format is not supported in the API v1 for synchronous recognition and API v2 for streaming recognition.