Input Modalities
Output Modalities
Estimates based on INT8 quantization. Actual requirements vary by framework and configuration.
Data sourced from official provider APIs and documentation
Last updated: Apr 3, 2026
Pricing, benchmarks, and GPU requirements for any model. Free during beta.