GLM-4-9B is an open-source 9-billion parameter chat model from Z.ai, part of the GLM-4 series. It is designed for multi-turn dialogue and excels in semantic understanding, mathematics, reasoning, code, and knowledge tasks, outperforming Llama-3-8B on several benchmarks.
Input
Output
Context
8K
Max Output
-
Parameters
9.4B
Input Modalities
Output Modalities
Estimates based on INT8 quantization. Actual requirements vary by framework and configuration.
Data sourced from official provider APIs and documentation
Last updated: Jun 23, 2026
Automatically route workloads to the right model for every task, every time.