OpenAI's GPT-4O-AUDIO Preview-2024-12-17 is a chat model capable of processing both audio and text inputs, with notable capabilities in function calling, JSON mode, reasoning, streaming, and text generation. It is particularly suited for applications requiring long context understanding, thanks to its large context window of 128,000 tokens.
Input
Output
Context
128K
Max Output
16K
Parameters
-
Input Modalities
Output Modalities
Data sourced from official provider APIs and documentation
Last updated: Jun 23, 2026
Automatically route workloads to the right model for every task, every time.