Catalog
20 models, one endpoint
From frontier flagship models to fine-tuned open weights. Sorted by latency, price, or context length — pick what fits your workload.
| Model | Provider | Context | Input / 1M | Output / 1M | P50 latency | Modalities |
|---|---|---|---|---|---|---|
GPT-5 Featuredopenai/gpt-5 | OpenAI | 400K | $5.00 | $15.00 | 480ms | textvisioncode |
GPT-5 Mini Featuredopenai/gpt-5-mini | OpenAI | 200K | $0.60 | $2.40 | 220ms | textvision |
Claude Opus 4.7 Featuredanthropic/claude-opus-4.7 | Anthropic | 1M | $15.00 | $75.00 | 920ms | textvisioncode |
Claude Sonnet 4.6 Featuredanthropic/claude-sonnet-4.6 | Anthropic | 500K | $3.00 | $15.00 | 410ms | textvisioncode |
Gemini 2.5 Pro Featuredgoogle/gemini-2.5-pro | 2M | $1.25 | $10.00 | 520ms | textvisionaudiocode | |
DeepSeek V4 Featureddeepseek/deepseek-v4 | DeepSeek | 128K | $0.14 | $0.28 | 290ms | textcode |
GPT-4o openai/gpt-4o | OpenAI | 128K | $2.50 | $10.00 | 380ms | textvisionaudio |
Claude Haiku 4.5 anthropic/claude-haiku-4.5 | Anthropic | 200K | $0.80 | $4.00 | 180ms | textvision |
Gemini 2.5 Flash google/gemini-2.5-flash | 1M | $0.08 | $0.30 | 160ms | textvision | |
Llama 4 405B meta/llama-4-405b | Meta | 256K | $2.70 | $2.70 | 680ms | textcode |
Llama 4 70B Instruct meta/llama-4-70b | Meta | 128K | $0.55 | $0.65 | 240ms | textcode |
DeepSeek R2 deepseek/deepseek-r2 | DeepSeek | 128K | $0.55 | $2.19 | 1450ms | textcode |
Qwen3 Max alibaba/qwen3-max | Alibaba | 1M | $1.60 | $6.40 | 540ms | textvisioncode |
Qwen3 72B Instruct alibaba/qwen3-72b | Alibaba | 256K | $0.40 | $1.20 | 270ms | textcode |
Mistral Large 2 mistral/mistral-large-2 | Mistral | 128K | $2.00 | $6.00 | 350ms | textcode |
Mistral Small 3.1 mistral/mistral-small-3.1 | Mistral | 128K | $0.20 | $0.60 | 200ms | textvision |
Grok 4 xai/grok-4 | xAI | 256K | $5.00 | $15.00 | 520ms | textvision |
Grok 4 Mini xai/grok-4-mini | xAI | 128K | $0.30 | $0.50 | 190ms | text |
Command R+ v2 cohere/command-r-plus-2 | Cohere | 256K | $2.50 | $10.00 | 410ms | text |
Yi Large 2 01-ai/yi-large-2 | 01.AI | 200K | $0.70 | $2.00 | 320ms | text |