Beta

Inferbase

Beta

Inferbase

Beta

Back to Models

OpenAI

GPT-4.1-NANO

Name: GPT-4.1-NANO
Author: OpenAI

Add to Compare

For tasks that demand low latency, GPT‑4.1 nano is the fastest and cheapest model in the GPT-4.1 series. It delivers exceptional performance at a small size with its 1 million token context window, and scores 80.1% on MMLU, 50.3% on GPQA, and 9.8% on Aider polyglot coding – even higher than GPT‑4o mini.

Input

Output

Context

1048K

Max Output

33K

Parameters

—

Technical Specifications

Model TypeChat

Context Window1.0M

Max Output Tokens33K

Parameters—

Training Cutoff—

Licenseproprietary

Capabilities

Input Modalities

imagetextfile

Output Modalities

text

Features

code_generationfunction_callingjson_modereasoningstreamingtext_generationvision

Benchmarks

3.9%

HLE

23.7%

AIME

51.2%

GPQA

25.9%

SciCode

84.8%

MATH 500

65.7%

MMLU Pro

24.0

Math

11.2

Coding

32.6%

LiveCodeBench

13.0

Intelligence

147

Speed (tok/s)

0.566

TTFA (s)

0.566

TTFT (s)

Resources & Links

Documentation

Official docs and model card

OpenAI Docs

Official model documentation

proprietary — Proprietary - API access only

Browse More Models

Related Tools

Compare This Model

Compare this model against top alternatives

Browse All Models

Explore other models in the catalog

Data sourced from official provider APIs and documentation

Last updated: May 5, 2026

Start building with the right model.

From model selection to production, one platform, no fragmentation.

Start Building Explore Models

Inferbase

Beta

Inferbase

Beta

Back to Models

OpenAI

GPT-4.1-NANO

Add to Compare

Input

Output

Context

1048K

Max Output

33K

Parameters

—

Technical Specifications

Model TypeChat

Context Window1.0M

Max Output Tokens33K

Parameters—

Training Cutoff—

Licenseproprietary

Capabilities

Input Modalities

imagetextfile

Output Modalities

text

Features

code_generationfunction_callingjson_modereasoningstreamingtext_generationvision

Benchmarks

3.9%

HLE

23.7%

AIME

51.2%

GPQA

25.9%

SciCode

84.8%

MATH 500

65.7%

MMLU Pro

24.0

Math

11.2

Coding

32.6%

LiveCodeBench

13.0

Intelligence

147

Speed (tok/s)

0.566

TTFA (s)

0.566

TTFT (s)

Resources & Links

Documentation

Official docs and model card

OpenAI Docs

Official model documentation

proprietary — Proprietary - API access only

Browse More Models

Related Tools

Compare This Model

Compare this model against top alternatives

Browse All Models

Explore other models in the catalog

Data sourced from official provider APIs and documentation

Last updated: May 5, 2026

Start building with the right model.

From model selection to production, one platform, no fragmentation.

Start Building Explore Models