DeepSeek builds the Deepseek Coder 1.3B Instruct, a chat model exceling at code completion, generation, and review tasks, thanks to its pre-training on a large project-level code corpus with a 16,384 token context window. This model is notable for its massive training data of 2T tokens, comprising 87% code and 13% natural language in both English and Chinese.
Input
Output
Context
16K
Max Output
-
Parameters
1.3B
Input Modalities
Output Modalities
Estimates based on INT8 quantization. Actual requirements vary by framework and configuration.
Data sourced from official provider APIs and documentation
Last updated: Jun 23, 2026
Automatically route workloads to the right model for every task, every time.