Zhipu GLM-5.2 Released and Open-Sourced: First in Code Arena Among Globally Available Models, 1M Context Long-Form Programming Leap

On June 17, 2025, Zhipu AI officially released and open-sourced its flagship model GLM-5.2 under the MIT license, supporting 1 million token context. It achieved first place among globally available models in the Code Arena front-end development blind test (second only to the unreleased Claude Fable 5) and first place globally in Design Arena.

Core Capabilities and Benchmark Performance

Programming & Agents: On the FrontierSWE benchmark, GLM-5.2 trails Claude Opus 4.8 by only ~1% and leads GPT-5.5 by ~1%; on PostTrainBench, it surpasses Opus 4.7 and GPT-5.5, second only to Opus 4.8; on the SWE-Marathon ultra-long task benchmark, it ranks second, behind the Opus series. On the standard programming benchmark Terminal-Bench 2.1, it scores 81.0 (GLM-5.1: 63.5), and on SWE-bench Pro, it scores 62.1 (GLM-5.1: 58.4).
Long Context: Supports 1 million token context, excelling on long-form programming benchmarks. The team reduced single-token FLOPs by approximately 2.9x via the IndexShare mechanism (shared indexer every 4 layers) and optimized the inference engine to improve long-context throughput.
Thinking Effort Levels: Introduces effort level control, allowing users to balance capability and computational cost. The Max level further extends programming capabilities.

Architecture Innovations

IndexShare Dynamic Sparse Attention: A lightweight indexer is shared every 4 Transformer layers, reducing indexer computation overhead by 3/4, significantly lowering computation at 1M context.
Improved MTP (Multi-Token Prediction): IndexShare is also applied in MTP layers, increasing speculative decoding acceptance length by 20%.

Open Source and Ecosystem

Model weights are publicly available on HuggingFace and ModelScope, supporting inference frameworks such as transformers, vLLM, and SGLang.
On launch day, full adaptation was completed with domestic computing platforms including Huawei Ascend, T-Head, Moore Threads, Cambricon, Kunlunxin, Muxi, Haiguang, and Biren.
Hugging Face provided 6 hours of free global computing power for a domestic model for the first time.

Reactions

Elon Musk replied on X that China might reach the Fable level by Q1 2027; Zhipu founder Tang Jie responded, "It won't take that long," sparking heated discussion among overseas netizens.
Silicon Valley VC a16z co-founder Marc Andreessen and Perplexity CEO Aravind Srinivas also took note.
US open-source model researcher Nathan Lambert commented that GLM-5.2 surpasses Google Gemini in agent capabilities, marking the first time a Chinese model has entered the global "Big Three" (Claude, OpenAI, Zhipu).

Impact and Significance

The release of GLM-5.2 marks the first time a Chinese open-source large model has reached the level of world-class closed-source models in agent programming and long-form complex tasks, breaking the competitive landscape previously dominated by models like Claude and GPT. Its open-source strategy (MIT license, no regional restrictions) provides global developers with a cutting-edge model that can be freely used, advancing AI programming from single-shot output to long-term engineering capabilities.

Zhipu GLM-5.2 Released and Open-Sourced: First in Code Arena Among Globally Available Models, 1M Context Long-Form Programming Leap

Core Capabilities and Benchmark Performance

Architecture Innovations

Open Source and Ecosystem

Reactions

Impact and Significance

Documentation

Getting Started

Learn more