GPT-4O-AUDIO Preview is a chat model developed by OpenAI, capable of processing both audio and text inputs, with additional features such as vision, function calling, and streaming. It is genuinely best at handling diverse input types, including audio input and output.
Input
Output
Context
128K
Max Output
16K
Parameters
-
Input Modalities
Output Modalities
Data sourced from official provider APIs and documentation
Last updated: Jun 23, 2026
Automatically route workloads to the right model for every task, every time.