Skip to content

Latest commit

 

History

History
33 lines (25 loc) · 1.16 KB

RecognitionAudio.md

File metadata and controls

33 lines (25 loc) · 1.16 KB

RecognitionAudio

Contains audio data in the encoding specified in the RecognitionConfig. Either content or uri must be supplied. Supplying both or neither returns error.

{

  // Union field audio_source can be only one of the following:
  "content": string,
  "uri": string
}
Field Description
content string (bytes format)
The audio data bytes encoded as specified in RecognitionConfig. Note: as with all bytes fields, proto buffers use a pure binary representation, whereas JSON representations use base64.
uri string
URI that points to a file that contains audio data bytes as specified in RecognitionConfig. The file must not be compressed (for example, gzip). Url must be publicly accessible.

Encoding to base64

Use base64 command to convert

base64 source_audio_file -w 0 > dest_audio_file

Content Limits

The API contains the following limits on the size of audio in content field:

Content Limit Audio Length Audio Size
Synchronous Requests ~1 Minute max 4Mb
Streaming Requests ~5 Minutes -
Asynchronous Requests ~480 Minutes -