Flashship Frontier Table

OpenAI / Anthropic / Google / Grok / DeepSeek / Qwen / Kimi / MiniMax / Z.ai
Source OpenRouter
Updated 2026-06-19 05:00:53 UTC
Tracked Companies
9
Tracked Variants
20
Available Variants
17
Lowest Input
$0.0650
Largest Context
1.1M
Selection rule: each company is pinned to its current latest model family, and this page includes all maintained core variants inside that same family, including lighter and fuller tiers when OpenRouter exposes them. All prices, context, max output and supported parameters come directly from the matched OpenRouter payload. Cheapest input/output and largest context/max output are highlighted.
Company Family Variant Model Input $/1M Output $/1M Reasoning $/1M Context Max Output Modalities Supported Params
OpenAI GPT-5.4 Standard
OpenAI: GPT-5.4
openai/gpt-5.4
GPT-5.4 is OpenAI’s latest frontier model, unifying the Codex and GPT lines into a single system. It features a 1M+ token context window (922K input, 128K output) with support for...
$2.500 $15.00 FREE 1.1M 128K
INTEXTIMAGEFILE
OUTTEXT
include_reasoningmax_completion_tokensmax_tokensreasoning+5
OpenAI GPT-5.4 Pro
OpenAI: GPT-5.4 Pro
openai/gpt-5.4-pro
GPT-5.4 Pro is OpenAI's most advanced model, building on GPT-5.4's unified architecture with enhanced reasoning capabilities for complex, high-stakes tasks. It features a 1M+ token context window (922
$30.00 $180.00 FREE 1.1M 128K
INTEXTIMAGEFILE
OUTTEXT
include_reasoningmax_completion_tokensmax_tokensreasoning+5
Anthropic Claude 4.6 Sonnet
Anthropic: Claude Sonnet 4.6
anthropic/claude-sonnet-4.6
Sonnet 4.6 is Anthropic's most capable Sonnet-class model yet, with frontier performance across coding, agents, and professional work. It excels at iterative development, complex codebase navigation,
$3.000 $15.00 FREE 1M 128K
INTEXTIMAGEFILE
OUTTEXT
include_reasoningmax_completion_tokensmax_tokensreasoning+9
Anthropic Claude 4.6 Opus
Anthropic: Claude Opus 4.6
anthropic/claude-opus-4.6
Opus 4.6 is Anthropic’s strongest model for coding and long-running professional tasks. It is built for agents that operate across entire workflows rather than single prompts, making it especially eff
$5.000 $25.00 FREE 1M 128K
INTEXTIMAGEFILE
OUTTEXT
include_reasoningmax_completion_tokensmax_tokensreasoning+9
Google Gemini 3.1 Pro Preview
Google: Gemini 3.1 Pro Preview
google/gemini-3.1-pro-preview
Gemini 3.1 Pro Preview is Google’s frontier reasoning model, delivering enhanced software engineering performance, improved agentic reliability, and more efficient token usage across complex workflows
$2.000 $12.00 $12.00 1M 65.5K
INAUDIOFILEIMAGETEXTVIDEO
OUTTEXT
include_reasoningmax_tokensreasoningresponse_format+7
Grok Grok 4.20 Beta
Unavailable
Grok Grok 4.20 Multi-Agent Beta
Unavailable
DeepSeek DeepSeek V3.2 Base
DeepSeek: DeepSeek V3.2
deepseek/deepseek-v3.2
DeepSeek-V3.2 is a large language model designed to harmonize high computational efficiency with strong reasoning and agentic tool-use performance. It introduces DeepSeek Sparse Attention (DSA), a fin
$0.2288 $0.3432 FREE 131.1K 64K
INTEXT
OUTTEXT
frequency_penaltyinclude_reasoninglogit_biaslogprobs+15
DeepSeek DeepSeek V3.2 Exp
DeepSeek: DeepSeek V3.2 Exp
deepseek/deepseek-v3.2-exp
DeepSeek-V3.2-Exp is an experimental large language model released by DeepSeek as an intermediate step between V3.1 and future architectures. It introduces DeepSeek Sparse Attention (DSA), a fine-grai
$0.2700 $0.4100 FREE 163.8K 65.5K
INTEXT
OUTTEXT
frequency_penaltyinclude_reasoninglogit_biaslogprobs+15
DeepSeek DeepSeek V3.2 Speciale
Unavailable
Qwen Qwen 3.5 397B A17B
Qwen: Qwen3.5 397B A17B
qwen/qwen3.5-397b-a17b
The Qwen3.5 series 397B-A17B native vision-language model is built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher infere
$0.3850 $2.450 FREE 256K
INTEXTIMAGEVIDEO
OUTTEXT
frequency_penaltyinclude_reasoninglogit_biaslogprobs+15
Qwen Qwen 3.5 122B A10B
Qwen: Qwen3.5-122B-A10B
qwen/qwen3.5-122b-a10b
The Qwen3.5 122B-A10B native vision-language model is built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference eff
$0.2600 $2.080 FREE 262.1K 262.1K
INTEXTIMAGEVIDEO
OUTTEXT
frequency_penaltyinclude_reasoninglogit_biaslogprobs+15
Qwen Qwen 3.5 35B A3B
Qwen: Qwen3.5-35B-A3B
qwen/qwen3.5-35b-a3b
The Qwen3.5 Series 35B-A3B is a native vision-language model designed with a hybrid architecture that integrates linear attention mechanisms and a sparse mixture-of-experts model, achieving higher inf
$0.1400 $1.000 FREE 262.1K 262.1K
INTEXTIMAGEVIDEO
OUTTEXT
frequency_penaltyinclude_reasoninglogit_biaslogprobs+15
Qwen Qwen 3.5 27B
Qwen: Qwen3.5-27B
qwen/qwen3.5-27b
The Qwen3.5 27B native vision-language Dense model incorporates a linear attention mechanism, delivering fast response times while balancing inference speed and performance. Its overall capabilities a
$0.1950 $1.560 FREE 262.1K 65.5K
INTEXTIMAGEVIDEO
OUTTEXT
frequency_penaltyinclude_reasoninglogit_biaslogprobs+15
Qwen Qwen 3.5 9B
Qwen: Qwen3.5-9B
qwen/qwen3.5-9b
Qwen3.5-9B is a multimodal foundation model from the Qwen3.5 family, designed to deliver strong reasoning, coding, and visual understanding in an efficient 9B-parameter architecture. It uses a unified
$0.1000 $0.1500 FREE 262.1K 262.1K
INTEXTIMAGEVIDEO
OUTTEXT
frequency_penaltyinclude_reasoninglogit_biaslogprobs+15
Qwen Qwen 3.5 Plus
Qwen: Qwen3.5 Plus 2026-02-15
qwen/qwen3.5-plus-02-15
The Qwen3.5 native vision-language series Plus models are built on a hybrid architecture that integrates linear attention mechanisms with sparse mixture-of-experts models, achieving higher inference e
$0.2600 $1.560 FREE 1M 65.5K
INTEXTIMAGEVIDEO
OUTTEXT
include_reasoninglogprobsmax_tokenspresence_penalty+9
Qwen Qwen 3.5 Flash
Qwen: Qwen3.5-Flash
qwen/qwen3.5-flash-02-23
The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference effic
$0.0650 $0.2600 FREE 1M 65.5K
INTEXTIMAGEVIDEO
OUTTEXT
include_reasoningmax_tokenspresence_penaltyreasoning+7
Kimi Kimi K2.5 Standard
MoonshotAI: Kimi K2.5
moonshotai/kimi-k2.5
Kimi K2.5 is Moonshot AI's native multimodal model, delivering state-of-the-art visual coding capability and a self-directed agent swarm paradigm. Built on Kimi K2 with continued pretraining over appr
$0.3750 $2.025 FREE 262.1K
INTEXTIMAGE
OUTTEXT
frequency_penaltyinclude_reasoninglogit_biaslogprobs+15
MiniMax MiniMax M2.5 Standard
MiniMax: MiniMax M2.5
minimax/minimax-m2.5
MiniMax-M2.5 is a SOTA large language model designed for real-world productivity. Trained in a diverse range of complex real-world digital working environments, M2.5 builds upon the coding expertise o
$0.1500 $0.9000 FREE 204.8K 196.6K
INTEXT
OUTTEXT
frequency_penaltyinclude_reasoninglogit_biaslogprobs+17
Z.ai GLM-5 Standard
Z.ai: GLM 5
z-ai/glm-5
GLM-5 is Z.ai’s flagship open-source foundation model engineered for complex systems design and long-horizon agent workflows. Built for expert developers, it delivers production-grade performance on l
$0.6000 $1.920 FREE 202.8K
INTEXT
OUTTEXT
frequency_penaltyinclude_reasoninglogit_biaslogprobs+15