Zhipu GLM-5.2 Released and Open-Sourced: First in Code Arena Among Globally Available Models, 1M Context Long-Form Programming Leap
On June 17, 2025, Zhipu AI officially released and open-sourced its flagship model GLM-5.2 under the MIT license, supporting 1 million token context. It achieved first place among globally available models in the Code Arena front-end development blind test (second only to the unreleased Claude Fable 5) and first place globally in Design Arena.
Core Capabilities and Benchmark Performance
- Programming & Agents: On the FrontierSWE benchmark, GLM-5.2 trails Claude Opus 4.8 by only ~1% and leads GPT-5.5 by ~1%; on PostTrainBench, it surpasses Opus 4.7 and GPT-5.5, second only to Opus 4.8; on the SWE-Marathon ultra-long task benchmark, it ranks second, behind the Opus series. On the standard programming benchmark Terminal-Bench 2.1, it scores 81.0 (GLM-5.1: 63.5), and on SWE-bench Pro, it scores 62.1 (GLM-5.1: 58.4).
- Long Context: Supports 1 million token context, excelling on long-form programming benchmarks. The team reduced single-token FLOPs by approximately 2.9x via the IndexShare mechanism (shared indexer every 4 layers) and optimized the inference engine to improve long-context throughput.
- Thinking Effort Levels: Introduces effort level control, allowing users to balance capability and computational cost. The Max level further extends programming capabilities.
Architecture Innovations
- IndexShare Dynamic Sparse Attention: A lightweight indexer is shared every 4 Transformer layers, reducing indexer computation overhead by 3/4, significantly lowering computation at 1M context.
- Improved MTP (Multi-Token Prediction): IndexShare is also applied in MTP layers, increasing speculative decoding acceptance length by 20%.
Open Source and Ecosystem
- Model weights are publicly available on HuggingFace and ModelScope, supporting inference frameworks such as transformers, vLLM, and SGLang.
- On launch day, full adaptation was completed with domestic computing platforms including Huawei Ascend, T-Head, Moore Threads, Cambricon, Kunlunxin, Muxi, Haiguang, and Biren.
- Hugging Face provided 6 hours of free global computing power for a domestic model for the first time.
Reactions
- Elon Musk replied on X that China might reach the Fable level by Q1 2027; Zhipu founder Tang Jie responded, "It won't take that long," sparking heated discussion among overseas netizens.
- Silicon Valley VC a16z co-founder Marc Andreessen and Perplexity CEO Aravind Srinivas also took note.
- US open-source model researcher Nathan Lambert commented that GLM-5.2 surpasses Google Gemini in agent capabilities, marking the first time a Chinese model has entered the global "Big Three" (Claude, OpenAI, Zhipu).
Impact and Significance
The release of GLM-5.2 marks the first time a Chinese open-source large model has reached the level of world-class closed-source models in agent programming and long-form complex tasks, breaking the competitive landscape previously dominated by models like Claude and GPT. Its open-source strategy (MIT license, no regional restrictions) provides global developers with a cutting-edge model that can be freely used, advancing AI programming from single-shot output to long-term engineering capabilities.
Also available in 中文.