AI Model Comparison

Claude 3 Haiku

Anthropic

Anthropic's fastest and cheapest model, ideal for real-time conversations and high-frequency API calls with extremely low cost.

Context window

200K

Input price

$0.25 / 1M

Output price

$1.25 / 1M

Claude 3.5 Sonnet

Anthropic

Anthropic's strongest coding model, ranked first on SWE-bench, with top-tier code quality and instruction-following capabilities, excelling in agent tasks.

Context window

200K

Input price

$3 / 1M

Output price

$15 / 1M

Claude Haiku 4.5

Anthropic

Claude Haiku 4.5, the fastest Claude, with near-frontier intelligence and 200K context, ideal for high-concurrency and low-latency scenarios.

Context window

200K

Input price

$1 / 1M

Output price

$5 / 1M

Claude Opus 4.5

Anthropic

Claude Opus 4.5, 200K context, high-quality reasoning and coding, relatively better cost-effectiveness.

Context window

200K

Input price

$5 / 1M

Output price

$25 / 1M

Claude Opus 4.6

Anthropic

Claude Opus 4.6, 1M context, supports extended thinking, stable performance on complex tasks.

Context window

Input price

$5 / 1M

Output price

$25 / 1M

Claude Opus 4.7

Anthropic

Claude Opus, the previous generation flagship, offers 1M context, strong complex reasoning and agentic coding capabilities, and remains a top-tier choice.

Context window

Input price

$5 / 1M

Output price

$25 / 1M

Claude Opus 4.8

Anthropic

Anthropic's current strongest model, top-tier in complex reasoning, long-cycle agentic coding, and highly autonomous tasks, ranked first in the Intelligence Index.

Context window

Input price

$5 / 1M

Output price

$25 / 1M

Claude Sonnet 4.5

Anthropic

Claude Sonnet 4.5, with 200K context, balances speed and intelligence, excelling in coding and agent tasks.

Context window

200K

Input price

$3 / 1M

Output price

$15 / 1M

Claude Sonnet 4.6

Anthropic

The best balance of speed and intelligence, with 1M context, offering great value for daily development and agent tasks.

Context window

Input price

$3 / 1M

Output price

$15 / 1M

GPT / OpenAI

GPT-4o

OpenAI

OpenAI's flagship multimodal model, proficient in vision, speech, and text, with fast response speed and the most comprehensive ecosystem.

Context window

128K

Input price

$5 / 1M

Output price

$15 / 1M

GPT-4o mini

OpenAI

GPT-4o Lite, 3x faster than GPT-4o with 95% cost reduction, ideal for high-concurrency agent scenarios.

Context window

128K

Input price

$0.15 / 1M

Output price

$0.6 / 1M

GPT-5.5

OpenAI

OpenAI's current flagship, ranked second on the Intelligence Index (just behind Claude Opus 4.8), with top-tier performance in high/ultra-high reasoning settings.

Context window

—

Input price

—

Output price

—

o1-preview

OpenAI

OpenAI's specialized reasoning model, answers after deep thinking, performs best on complex math/science/code problems, but is slow.

Context window

128K

Input price

$15 / 1M

Output price

$60 / 1M

o3

OpenAI

OpenAI o-series reasoning model, answers after deep thinking, performs strongly on complex math/science/coding problems.

Context window

200K

Input price

—

Output price

—

Gemini

Gemini 1.5 Pro

Google

Google's ultra-long context specialized model with a 2 million token window, capable of analyzing entire codebases or long videos.

Context window

Input price

$3.5 / 1M

Output price

$10.5 / 1M

Gemini 2.0 Flash

Google

Google's latest agentic model with million-token ultra-long context, native tool calling support, and extremely low price.

Context window

Input price

$0.1 / 1M

Output price

$0.4 / 1M

Gemini 3.1 Pro

Google

Google's flagship, with ultra-long context, native multimodal capabilities, and tool calling, its reasoning ability ranks among the top tier.

Context window

Input price

$2 / 1M

Output price

$12 / 1M

Gemini 3.5 Flash

Google

Gemini Fast: excellent speed and cost, ideal for high concurrency and real-time scenarios.

Context window

Input price

$1.5 / 1M

Output price

$9 / 1M

DeepSeek

DeepSeek V4 Flash

DeepSeek

DeepSeek's fast open-source model with 1M context and extremely low price, suitable for large-scale batch processing.

Context window

Input price

$0.14 / 1M

Output price

$0.28 / 1M

DeepSeek V4 Pro

DeepSeek

China's open-source flagship, with 1M context and dual thinking/non-thinking modes, excels in coding and reasoning, priced at a fraction of closed-source flagships.

Context window

Input price

$0.44 / 1M

Output price

$0.87 / 1M

DeepSeek-R1

DeepSeek

Designed for complex reasoning, with math/logic/coding capabilities comparable to o1, but fully open-source with training costs at only 3%.

Context window

64K

Input price

$0.55 / 1M

Output price

$2.19 / 1M

DeepSeek-V3

DeepSeek

Domestic flagship model with code and math capabilities comparable to Claude, priced at only 5% of OpenAI, the king of cost-effectiveness.

Context window

64K

Input price

$0.27 / 1M

Output price

$1.1 / 1M

Llama

Llama 3.1 405B

Meta

Meta Llama 3.1 405B: A flagship open-source model with 405 billion parameters, 128K context, and performance approaching closed-source flagships.

Context window

128K

Input price

开源免费

Output price

开源免费

Llama 3.1 70B

Meta

Meta Llama 3.1 70B: A 70-billion-parameter open-source general-purpose model with 128K context, offering high deployment cost-effectiveness.

Context window

128K

Input price

开源免费

Output price

开源免费

Llama 3.1 8B

Meta

Meta Llama 3.1 8B: An 8-billion-parameter lightweight open-source model with 128K context, suitable for edge and low-cost deployment.

Context window

128K

Input price

开源免费

Output price

开源免费

Llama 3.3 70B

Meta

Meta's latest open-source flagship, 70 billion parameters, self-hostable, business-friendly license, performance approaching closed-source models.

Context window

128K

Input price

开源免费

Output price

开源免费

Llama 4 Maverick

Meta

Meta Llama 4 Maverick: 400B total params (17B activated, 128 experts) MoE, 1M context, open weights, commercially friendly.

Context window

Input price

开源免费

Output price

开源免费

Llama 4 Scout

Meta

Meta's latest open-source model features a native 10M ultra-long context, supports self-hosting, and is commercially friendly.

Context window

10M

Input price

开源免费

Output price

开源免费

Qwen

Qwen2.5-72B

Alibaba

Alibaba's Tongyi Qianwen latest flagship, with the strongest Chinese language capabilities domestically, fully open-source, and supports multimodal.

Context window

128K

Input price

开源免费

Output price

开源免费

Qwen2.5-Coder

Alibaba

Alibaba's specialized code model surpasses Claude 3.5 Sonnet in coding ability, achieving 98.5% on HumanEval, fully open-source.

Context window

128K

Input price

开源免费

Output price

开源免费

Qwen2.5-Max

Alibaba

Tongyi Qianwen 2.5 Max: A large-scale MoE flagship model with comprehensive capabilities comparable to mainstream closed-source models.

Context window

128K

Input price

—

Output price

—

Qwen3-Coder

Alibaba

Tongyi Qianwen 3 Code Special: 480B-A35B MoE open source (Apache 2.0), strong code capabilities.

Context window

—

Input price

—

Output price

—

Qwen3-Max

Alibaba

Tongyi Qianwen 3 Max: A closed-source flagship with over 1T parameters, representing the ceiling of the Tongyi series' capabilities.

Context window

—

Input price

—

Output price

—

Qwen3.5

Alibaba

Alibaba Tongyi Qianwen 3.5, open-source, fast, and extremely low-cost (starting from approximately $0.01/1M), available in multiple sizes.

Context window

—

Input price

$0.01 / 1M

Output price

—

Qwen3.6

Alibaba

Tongyi Qianwen 3.6: 35B-A3B MoE open-source model (Apache 2.0), the latest generation in 2026.

Context window

—

Input price

—

Output price

—

Qwen3.6-Plus

Alibaba

Tongyi Qianwen 3.6 Plus: Closed-source flagship version, released in 2026, with comprehensive capabilities comparable to mainstream closed-source models.

Context window

—

Input price

—

Output price

—

GLM (Zhipu)

GLM-4-Plus

Zhipu

Zhipu GLM-4-Plus: Closed-source flagship version (2024) with strong overall capabilities.

Context window

—

Input price

—

Output price

—

GLM-4.5

Zhipu

Zhipu GLM-4.5: 128K context, comprehensive capabilities comparable to mainstream models, one of the leading domestic open-source models.

Context window

128K

Input price

—

Output price

—

GLM-4.6

Zhipu

Zhipu GLM-4.6: 200K context (expanded from 128K in 4.5), token efficiency improved by approximately 30% compared to the previous generation.

Context window

200K

Input price

—

Output price

—

GLM-4.7

Zhipu

Zhipu GLM-4.7: Open source (MIT), an iterative upgrade of GLM-4.6.

Context window

—

Input price

—

Output price

—

GLM-5

Zhipu

Zhipu GLM-5: The latest open-source flagship in 2026 (MIT), with comprehensive capabilities comparable to mainstream closed-source models.

Context window

—

Input price

—

Output price

—

Kimi

Kimi K2

Moonshot

Kimi K2: 128K–256K context, 1T total params (32B active) MoE, trained on 15.5T tokens, open-source weights.

Context window

256K

Input price

—

Output price

—

Kimi K2 Thinking

Moonshot

Kimi K2 Thinking: 256K context, 1T parameters (32B activated) MoE, focused on deep reasoning, open-source weights.

Context window

256K

Input price

—

Output price

—

Kimi K2.5

Moonshot

Moonshot Kimi K2.5: 256K context, 1T total params (32B active) MoE + vision, open-source weights, multimodal upgrade of K2.

Context window

256K

Input price

—

Output price

—

Kimi K2.6

Moonshot

Moonshot Kimi series, one of the highest-ranked open-source weight models on the current Intelligence Index.

Context window

—

Input price

—

Output price

—

Step (StepFun)

Step-2

StepFun

Step-2 by StepStar: a trillion-parameter LLM (2024), one of the representative domestic large models.

Context window

—

Input price

—

Output price

—

Step-3

StepFun

Step-3 by StepFun: A large model released in 2025, focusing on multimodality and large-scale parameters.

Context window

—

Input price

—

Output price

—

Step-3.5-Flash

StepFun

Step-3.5-Flash by Jieyue Xingchen: 196B-A11B MoE, 256K context, supports tool calling, open-source (Apache 2.0).

Context window

256K

Input price

—

Output price

—

Step-3.7-Flash

StepFun

Step-3.7-Flash by Jieyue Xingchen: The latest open-source (Apache 2.0) fast model in 2026.

Context window

—

Input price

—

Output price

—

Mistral

Codestral

Mistral

Mistral code-specific model, supporting 80+ programming languages, code completion and generation.

Context window

32K

Input price

—

Output price

—

Mistral Large

Mistral

Europe's strongest AI model, with excellent multilingual capabilities and support for Function Calling, suitable for European compliance scenarios.

Context window

128K

Input price

$2 / 1M

Output price

$6 / 1M

Mistral Large 2

Mistral

Mistral flagship, 128K context, strong multilingual and code capabilities, representative of European open-weight models.

Context window

128K

Input price

—

Output price

—

Mixtral 8x22B

Mistral

Mistral's flagship MoE architecture open-source model, with 141 billion parameters and 39 billion activated, achieving the best balance between performance and cost.

Context window

64K

Input price

开源免费

Output price

开源免费

Mixtral 8x7B

Mistral

Mistral classic MoE model, 8×7B experts, open-source, lightweight and efficient, widely deployed.

Context window

32K

Input price

—

Output price

—

Other models

QwQ-32B

Alibaba

Tongyi Qianwen specialized open-source reasoning model with 32 billion parameters, excelling in mathematics and logical reasoning.

Context window

128K

Input price

—

Output price

—