DeepSeek develops the DeepSeek V 3.2 Speciale, a chat model exceling at code generation, function calling, and reasoning, with notable strengths in coding and mathematical tasks. Its technical architecture features DeepSeek Sparse Attention, an efficient attention mechanism optimized for long-context scenarios, supporting a context window of 163,840 tokens.
Input
Output
Context
164K
Max Output
64K
Parameters
685.4B
Input Modalities
Output Modalities
Data sourced from official provider APIs and documentation
Last updated: Jun 23, 2026
Automatically route workloads to the right model for every task, every time.