Qwen develops the Qwen 3.5 35B A 3B FP 8 model, a chat model capable of handling text and image inputs, with strengths in areas like reasoning and visual understanding, as evidenced by its top 25% GPQA score. It features a context window of 262,144 tokens and utilizes an efficient hybrid architecture combining Gated Delta Networks with sparse Mixture-of-Experts.
Input
Output
Context
262K
Max Output
82K
Parameters
36B
Input Modalities
Output Modalities
Estimates based on INT8 quantization. Actual requirements vary by framework and configuration.
Data sourced from official provider APIs and documentation
Last updated: Jun 23, 2026
Automatically route workloads to the right model for every task, every time.