OpenAI

GPT-4O Mini

Name: GPT-4O Mini
Author: OpenAI

GPT-4O Mini is a chat model developed by OpenAI, capable of handling various tasks such as code generation, reasoning, and text generation, with support for multiple input formats including image, PDF, and text. It is notable for its large context window of 128,000 tokens, allowing it to process and understand extensive amounts of information.

Input

Output

Context

128K

Max Output

16K

Parameters

Technical Specifications

Model TypeChat

Context Window128,000 tokens

Max Output Tokens16,384 tokens

Parameters8B

Release DateJul 16, 2024

Training CutoffOct 1, 2023

Licenseproprietary

Open SourceNo

Input Modalities

ImageText

Output Modalities

Text

Capabilities

Benchmarks

Artificial Analysis

4.0%

HLE

11.7%

AIME

42.6%

GPQA

22.9%

SciCode

78.9%

MATH 500

64.8%

MMLU Pro

14.7

Math

23.4%

LiveCodeBench

6.9

Intelligence

79.2

Speed (tok/s)

0.551

TTFA (s)

0.551

TTFT (s)

Resources & Links

OpenAI Docs

Official model documentation

proprietary, Proprietary - API access only

Estimated GPU Requirements for GPT-4O Mini

Estimates based on INT8 quantization. Actual requirements vary by framework and configuration.

NVIDIA T4

16 GB VRAM

73% used

NVIDIA A10

24 GB VRAM

49% used

NVIDIA L4

24 GB VRAM

49% used

NVIDIA V100 PCIe 32GB

32 GB VRAM

36% used

Use GPU Sizing Calculator for custom configurations

Browse More Models

Related Tools

Compare This Model

Compare this model against top alternatives

Browse All Models

Explore other models in the catalog

Data sourced from official provider APIs and documentation

Last updated: Jun 24, 2026

Start building with the right model.

Automatically route workloads to the right model for every task, every time.

Start Building Read the docs

Inferbase

Back to Models

OpenAI

GPT-4O Mini

Add to Compare

Input

Output

Context

128K

Max Output

16K

Parameters

Technical Specifications

Model TypeChat

Context Window128,000 tokens

Max Output Tokens16,384 tokens

Parameters8B

Release DateJul 16, 2024

Training CutoffOct 1, 2023

Licenseproprietary

Open SourceNo

Input Modalities

ImageText

Output Modalities

Text

Capabilities

Benchmarks

Artificial Analysis

4.0%

HLE

11.7%

AIME

42.6%

GPQA

22.9%

SciCode

78.9%

MATH 500

64.8%

MMLU Pro

14.7

Math

23.4%

LiveCodeBench

6.9

Intelligence

79.2

Speed (tok/s)

0.551

TTFA (s)

0.551

TTFT (s)

Resources & Links

OpenAI Docs

Official model documentation

proprietary, Proprietary - API access only

Estimated GPU Requirements for GPT-4O Mini

Estimates based on INT8 quantization. Actual requirements vary by framework and configuration.

NVIDIA T4

16 GB VRAM

73% used

NVIDIA A10

24 GB VRAM

49% used

NVIDIA L4

24 GB VRAM

49% used

NVIDIA V100 PCIe 32GB

32 GB VRAM

36% used

Use GPU Sizing Calculator for custom configurations

Browse More Models

Related Tools

Compare This Model

Compare this model against top alternatives

Browse All Models

Explore other models in the catalog

Data sourced from official provider APIs and documentation

Last updated: Jun 24, 2026

Start building with the right model.

Automatically route workloads to the right model for every task, every time.

Start Building Read the docs