/chat/completions
endpoint. This enables rich, multimodal interactions with minimal integration overhead.
Supported Modalities
-
Images - Send images to vision-enabled models for tasks like analysis, captioning, OCR, and visual Q&A. Supports: URL-based images, base64-encoded images
Read More about Image Inputs -> -
PDFs - Process PDF documents seamlessly. Anannas extracts text and handles both text-based PDFs and scanned files. Accepted formats: URL or base64-encoded.
Read More about PDF processing -> -
Audio (Not Yet Supported) - You can technically send audio in requests, but Anthropic models do not process it. No error will be thrown, it will simply be ignored. We recommend avoiding audio inputs for now until support is expanded.
Read More about Audio Input ->
Getting Started
All multimodal inputs use/chat/completions
with the messages
array. Specify the content type for each input:
- Images ->
image_url
- PDFs ->
file
- Audio ->
audio
Model Compatibility
- Images -> Supported (URL + base64)
- PDFs -> Supported (URL + base64)
- Audio -> Ignored (no error, no output)
Input Format Support
URLs (Recommended for public content)- Images:
https://example.com/image.jpg
- PDFs:
https://example.com/document.pdf
- Audio: β Ignored
- Images:
data:image/jpeg;base64,{base64_data}
- PDFs:
data:application/pdf;base64,{base64_data}
- Audio: Ignored
URLs are preferred for large files since they avoid payload bloat.Use base64 encoding when working with local or private files.
FAQs
Can I send both images and PDFs in one request?
Can I send both images and PDFs in one request?
Yes. You can mix text, images, and PDFs in the same request. The model will process them together.
What happens if I send audio?
What happens if I send audio?
Audio inputs are accepted but not processed. You wonβt see an error, but the model wonβt respond to the audio content.
Will audio support be added later?
Will audio support be added later?
Yes, audio support is on the roadmap. Stay tuned for updates.
Was this page helpful?