Our smallest model, made for building powerful applications on commodity GPUs and edge devices.
Input
Output
Context
128K
Max Output
4K
Parameters
7B
Input Modalities
Output Modalities
| Platform | Input | Output |
|---|---|---|
OpenRouter | $0.037 | $0.150 |
Estimates based on INT8 quantization. Actual requirements vary by framework and configuration.
Data sourced from official provider APIs and documentation
Last updated: Mar 16, 2026
Explore models, compare pricing and benchmarks, and right-size your infrastructure — all in one place.