OpenAI's GPT-REALTIME-2025-08-28 is an audio model capable of various tasks, including audio processing, speech-to-text, streaming, text generation, and text-to-speech. It is genuinely best at handling multiple input formats, including audio, image, and text.
Input
Output
Context
128K
Max Output
4K
Parameters
-
Input Modalities
Output Modalities
Data sourced from official provider APIs and documentation
Last updated: Jun 23, 2026
Automatically route workloads to the right model for every task, every time.