Developed by Moonshot AI, Kimi VL A 3B Thinking 2506 is a chat model that excels at multimodal reasoning and visual perception tasks, achieving notable scores such as 84.4 on MMBench-EN-v1.1 and 78.4 on MMVet. With a context window of 131,072 tokens and the ability to process both text and image inputs, this model showcases its capability in handling complex and high-resolution data.
Input
Output
Context
131K
Max Output
32K
Parameters
16.4B
Input Modalities
Output Modalities
Estimates based on INT8 quantization. Actual requirements vary by framework and configuration.
Data sourced from official provider APIs and documentation
Last updated: Jun 24, 2026
Automatically route workloads to the right model for every task, every time.