GPT-4O-REALTIME Preview is an audio model developed by OpenAI, capable of processing both audio and text inputs, with additional functionalities including vision, audio input and output, real-time processing, and function calling. It is genuinely best at handling a wide range of inputs and tasks, including audio and text-based interactions.
Input
Output
Context
128K
Max Output
4K
Parameters
-
Input Modalities
Output Modalities
Data sourced from official provider APIs and documentation
Last updated: Jun 23, 2026
Automatically route workloads to the right model for every task, every time.