GLM-4-9B is an open-source chat model from the GLM-4 series, built by Zhipu AI. It is designed for multi-round conversations and features advanced capabilities such as web browsing, code execution, custom tool calls, and long text reasoning. The model supports a context window of up to 128K tokens and multi-language support across 26 languages.
Input
Output
Context
8K
Max Output
-
Parameters
9.4B
Input Modalities
Output Modalities
Estimates based on INT8 quantization. Actual requirements vary by framework and configuration.
Data sourced from official provider APIs and documentation
Last updated: Jun 23, 2026
Automatically route workloads to the right model for every task, every time.