Qwen2.5-VL is proficient in recognizing common objects such as flowers, birds, fish, and insects. It is also highly capable of analyzing texts, charts, icons, graphics, and layouts within images.
Input
Output
Context
33K
Max Output
33K
Parameters
72B
Input Modalities
Output Modalities
Input
$0.800
Output
$0.800
| Platform | Input | Output |
|---|---|---|
OpenRouter | $0.800 | $0.800 |
Estimates based on INT8 quantization. Actual requirements vary by framework and configuration.
Data sourced from official provider APIs and documentation
Last updated: Mar 17, 2026
Explore models, compare pricing and benchmarks, and right-size your infrastructure — all in one place.