DeepSeek develops the Deepseek Coder 6.7B Base, a chat model exceling at code completion, generation, and review tasks, thanks to its training on a massive 2T token dataset comprising 87% code and 13% natural language. Notably, this model boasts a large context window of 16,384 tokens, enabling it to handle project-level code tasks.
Input
Output
Context
16K
Max Output
-
Parameters
6.7B
Input Modalities
Output Modalities
Estimates based on INT8 quantization. Actual requirements vary by framework and configuration.
Data sourced from official provider APIs and documentation
Last updated: Jun 24, 2026
Automatically route workloads to the right model for every task, every time.