DeepSeek builds the Deepseek Coder 6.7B Instruct, a chat model exceling at code completion, generation, and review tasks, thanks to its pre-training on a large project-level code corpus with a 16,384 token context window. This model is notably flexible, offered in various sizes to suit different user requirements, and achieves state-of-the-art performance among open-source code models on multiple benchmarks, including HumanEval and MultiPL-E. With a massive 2T token training dataset, it demonstrates superior capabilities in coding tasks.
Input
Output
Context
16K
Max Output
-
Parameters
6.7B
Input Modalities
Output Modalities
Estimates based on INT8 quantization. Actual requirements vary by framework and configuration.
Data sourced from official provider APIs and documentation
Last updated: Jun 24, 2026
Automatically route workloads to the right model for every task, every time.