Aya Expanse 32B is a chat model developed by Cohere, capable of handling text inputs and generating text outputs, with additional support for streaming. It is genuinely best at handling long-form conversations and generating extensive text due to its large context window of 128,000 tokens.
Input
Output
Context
128K
Max Output
-
Parameters
32B
Input Modalities
Output Modalities
Estimates based on INT8 quantization. Actual requirements vary by framework and configuration.
Data sourced from official provider APIs and documentation
Last updated: Jun 23, 2026
Automatically route workloads to the right model for every task, every time.