Requests with audio files to compatible models are available via the /v1/chat/completions API with the input_audio content type. Audio files must be base64-encoded and include the format specification. Note that only models with audio processing capabilities will handle these requests.You can search for models that support audio by filtering to audio input modality on our Models page.
The audio format is typically determined from the file extension. Extract the format by removing the leading dot from the file extension (e.g., .mp3 → mp3, .wav → wav).
Supported audio formats vary by provider. Common formats include:
wav - WAV audio
mp3 - MP3 audio
aiff - AIFF audio
aac - AAC audio
ogg - OGG Vorbis audio
flac - FLAC audio
m4a - M4A audio
pcm16 - PCM16 audio
pcm24 - PCM24 audio
Check Format Support
Note: Check your model’s documentation to confirm which audio formats it supports. Not all models support all formats. Visit anannas.ai/models to see capabilities by model.