A cost-efficient version of GPT Audio. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency.
Input
Output
Context
128K
Max Output
16K
Parameters
—
Input Modalities
Output Modalities
Features
Data sourced from official provider APIs and documentation
Last updated: May 5, 2026
From model selection to production, one platform, no fragmentation.