GLM-5.1 is a next-generation flagship chat model from Z.ai, built specifically for agentic engineering and complex, long-horizon tasks. It is genuinely best at coding capabilities, achieving state-of-the-art performance on SWE-Bench Pro and leading on NL2Repo and Terminal-Bench 2.0. A notable technical trait is its exceptionally large context window of 202,752 tokens, supporting extended reasoning and iteration.
Input
Output
Context
203K
Max Output
128K
Parameters
753.9B
Input Modalities
Output Modalities
Data sourced from official provider APIs and documentation
Last updated: Jun 23, 2026
Automatically route workloads to the right model for every task, every time.