OpenAI's GPT-REALTIME Mini is an audio model capable of various tasks, including audio processing, speech-to-text, text-to-speech, and vision. It is genuinely best at handling multiple input types, including audio, images, and text.
Input
Output
Context
128K
Max Output
4K
Parameters
-
Input Modalities
Output Modalities
Data sourced from official provider APIs and documentation
Last updated: Jun 23, 2026
Automatically route workloads to the right model for every task, every time.