ModelsJun 23, 2026
Doubao Model 2.1 Released: Coding and Agent Capabilities Reach Production-Grade Breakthrough
On June 23, Volcano Engine released the Doubao Model 2.1 series at the 2026 Summer FORCE Conference, including Doubao-Seed-2.1-Pro and Doubao-Seed-2.1-Turbo, with API services fully available on Volcano Ark. Volcano Engine President Tan Dai stated that the model capabilities have crossed the production-grade "qualitative change point," achieving leaps in three directions: Coding, Agent, and VLM.
Core Capabilities and Benchmark Performance
- Coding: Ranked in the first tier on Terminal Bench 2.1, SWE-Pro, SciCode, etc., matching or surpassing Claude Opus 4.7. In a chip design RTL test, Doubao 2.1 Pro ran continuously for nearly 18 hours, underwent 9 iterations, completed 6 core modules and 1303 lines of RTL code, and passed the complete engineering pipeline including simulation, testing, and synthesis checks.
- Agent: Ranked globally among the top on MCP-Atlas, GDPVal, etc. A 3D virtual city scene built on Doubao 2.1 Pro enables over 500 intelligent agents to collaborate synchronously, completing thousands of tool calls.
- VLM: Leading performance on OSWorld, MobileWorld, MMMU-Pro, etc., supporting long-video cross-temporal logical understanding and complex chart reasoning.
Pricing and Cost
- Doubao 2.1 Pro: 6 RMB per million input tokens, 30 RMB per million output tokens, and only 1.2 RMB for cache hits. Volcano Engine claims the overall usage cost is nearly 80% lower than Claude Opus 4.6.
- Doubao 2.1 Turbo: Priced at half of the Pro version, targeting high-frequency call scenarios.
- Additionally, Volcano Engine launched the Doubao-Seed-Evolving version, targeting Coding and Agent scenarios, with 2-4 updates per month.
Multimodal Models and Ecosystem
- Seedance 2.5: A video generation model supporting native 30-second single-segment video output and joint generation of up to 50 full-modal materials, expected to launch in July. Seedance 2.0 has already released native 4K 10-bit high-bit-depth capabilities.
- Seedream 5.0 Pro: An image creation model supporting interactive precise editing, multi-layer separation, and text generation in 14 languages.
- Seed-Audio 1.0: An audio generation model supporting zero-shot multimodal reference, generating multi-character dialogues, background music, and sound effects in one go.
- Volcano Engine upgraded its AI cloud-native architecture, releasing tools such as Ark CLI, AgentKit, and HiAgent 3.0, and launched the AI Trust product system.
Market Data
- As of June, the daily token call volume of the Doubao model exceeded 180 trillion, growing more than 10 times over the past year.
- IDC data shows Volcano Engine ranked first in China's public cloud MaaS market with a 49.5% share.
- Over 1.1 million enterprises and individuals use Volcano Ark, with 200 enterprises having an annual token call volume exceeding 1 trillion, doubling in six months.
Also available in 中文.