Skip to main content

Overview

Anannas provides access to models from multiple providers through a unified API. Models are identified by provider/model-name format.

Model Endpoint

Query available models:
GET https://api.anannas.ai/v1/models

Response Format

{
  "data": [
    {
      "id": "openai/gpt-5-mini",
      "name": "GPT-5 Mini",
      "provider": "openai",
      "description": "Most capable GPT-5 Mini model",
      "context_length": 8192,
      "pricing": {
        "prompt": "0.00003",
        "completion": "0.00006",
        "request": "0"
      },
      "capabilities": [
        "text",
        "tool_calling",
        "json_mode"
      ]
    }
  ]
}

Model Object Schema

FieldTypeDescription
idstringModel identifier (e.g., openai/gpt-5-mini)
namestringHuman-readable name
providerstringProvider identifier (openai, anthropic, etc.)
descriptionstringModel description
context_lengthnumberMaximum context window in tokens
pricingobjectCost structure (see below)
capabilitiesstring[]Supported features

Pricing Object

type Pricing = {
  prompt: string;      // Cost per input token (USD)
  completion: string; // Cost per output token (USD)
  request?: string;   // Fixed cost per request (USD)
};
Prices are returned as strings to avoid floating-point precision issues.

Model Format

Models use the format provider/model-name:
  • openai/gpt-5-mini
  • openai/gpt-3.5-turbo
  • anthropic/claude-3-sonnet
  • anthropic/claude-3-opus
  • x-ai/grok-beta
  • groq/llama-3-70b

Capabilities

View Model Capabilities

For complete capability listings by model, visit anannas.ai/models to see which features each model supports.
Models may support:
  • text: Text generation
  • tool_calling: Function/tool execution
  • json_mode: Structured JSON output
  • streaming: Server-Sent Events
  • multimodal: Image/audio inputs
  • reasoning: Extended reasoning (o1, o3, etc.)
  • audio_output: Audio generation
  • image_output: Image generation

Provider-Specific Models

View All Models

For complete, up-to-date model listings, pricing, capabilities, and parameter support, visit anannas.ai/models.

OpenAI

  • openai/gpt-5-mini
  • openai/gpt-4-turbo
  • openai/gpt-3.5-turbo
  • openai/o1-preview
  • openai/o1-mini
  • openai/o3-mini

Anthropic

  • anthropic/claude-3-opus
  • anthropic/claude-3-sonnet
  • anthropic/claude-3-haiku
  • anthropic/claude-sonnet-4-5

xAI (Grok)

  • x-ai/grok-beta
  • x-ai/grok-2

Other Providers

Models from Groq, TogetherAI, DeepInfra, Fireworks, Nebius, and others are available. Query /v1/models for the complete list.

Model Selection

Default Model

If model is omitted, Anannas uses your account’s default model. Set defaults in the dashboard.

Model Routing

Use provider preferences to control model selection:
{
    "model": "openai/gpt-5-mini",
  "provider": {
    "order": ["openai", "anthropic"],
    "sort": "price"
  }
}

Fallbacks

Specify fallback models:
{
    "model": "openai/gpt-5-mini",
  "fallbacks": [
    "anthropic/claude-3-sonnet",
    "openai/gpt-3.5-turbo"
  ]
}

Parameter Support

Check Parameter Support

For detailed parameter support by model, visit anannas.ai/models to see which parameters each model supports.
Not all models support all parameters. Common support:
  • Temperature: Most models
  • Top P: Most models
  • Max Tokens: All models
  • Tool Calling: OpenAI, Anthropic (Claude 3+)
  • JSON Mode: OpenAI GPT-5 Mini, Claude 3+
  • Streaming: All models
  • Reasoning: o1, o3, Claude Sonnet 4.5

Context Windows

View Context Windows

For accurate context window sizes for all models, check anannas.ai/models.
Context window sizes vary by model:
  • GPT-5 Mini: 128,000 tokens
  • GPT-4 Turbo: 128,000 tokens
  • Claude 3 Opus: 200,000 tokens
  • Claude 3 Sonnet: 200,000 tokens
  • o1: 128,000 tokens
Exceeding context limits returns an error.

Rate Limits

Rate limits are tier-based and vary by model. Check your tier in the dashboard. Limits apply per API key.

Pricing

View Current Pricing

For up-to-date pricing for all models, visit anannas.ai/models. Pricing is updated in real-time.
Pricing is per-token and varies by model. Check the /v1/models endpoint for current rates. Prices are in USD. Example pricing (approximate):
  • GPT-5 Mini: 0.0015/1Kinput,0.0015/1K input, 0.002/1K output
  • GPT-3.5 Turbo: 0.0015/1Kinput,0.0015/1K input, 0.002/1K output
  • Claude 3 Sonnet: 0.003/1Kinput,0.003/1K input, 0.015/1K output

Model Updates

Models are updated automatically. Model identifiers remain stable, but underlying model versions may change. Check model descriptions for version information.

Deprecated Models

Deprecated models continue to work but may be removed in future versions. Migrate to supported alternatives.

See Also