OpenAI

GPT-REALTIME-1.5

Name: GPT-REALTIME-1.5
Author: OpenAI

OpenAI's GPT-REALTIME-1.5 is a multifaceted audio model capable of processing audio, converting speech to text and text to speech, as well as handling vision tasks. It is genuinely best at handling a wide range of inputs, including audio, images, and text.

Input

Output

Context

128K

Max Output

Parameters

Technical Specifications

Model TypeAudio

Context Window128,000 tokens

Max Output Tokens4,096 tokens

ParametersNot available

Release DateFeb 19, 2026

Training CutoffNot available

Licenseproprietary

Open SourceNo

Input Modalities

AudioImageText

Output Modalities

AudioText

Capabilities

Resources & Links

OpenAI Docs

Official model documentation

proprietary, Proprietary - API access only

Browse More Models

Related Tools

Compare This Model

Compare this model against top alternatives

Browse All Models

Explore other models in the catalog

Data sourced from official provider APIs and documentation

Last updated: Jun 23, 2026

Start building with the right model.

Automatically route workloads to the right model for every task, every time.

Start Building Read the docs

Inferbase

Back to Models