Beta

Inferbase

Beta

Inferbase

Beta

NVIDIA•2024

NVIDIA H200 SXM

Name: NVIDIA H200 SXM
Brand: NVIDIA

Datacenter

The H200 is NVIDIA's latest flagship AI accelerator, featuring 141GB of HBM3e memory for handling the largest language models. Built on Hopper architecture, it delivers exceptional performance for both training and inference workloads.

VRAM

141 GB

Memory

HBM3e

Bandwidth

4800 GB/s

TDP

700W

Suitable Workloads

Large Language Models

Training and inference for models like GPT-4, Llama 70B+

Deep Learning Training

High-performance training for neural networks

Distributed Training

Multi-node training with fast interconnects

High-Throughput Inference

Optimized for batched inference workloads

Key Highlights

Built on Hopper architecture
528 AI accelerator cores for tensor operations
16,896 compute cores for parallel processing
SXM5 form factor

Memory

VRAM141 GB

Memory TypeHBM3e

Memory Bandwidth4800 GB/s

Compute Performance

FP32 (Single Precision)989 TFLOPS

FP16 (Half Precision)1979 TFLOPS

BF161979 TFLOPS

INT83958 TOPS

Architecture

ArchitectureHopper

Compute Cores16,896

AI Accelerators528

Release Year2024

Power & Physical

TDP700 W

Max Power700 W

Form FactorSXM5

PCIe GenerationGen5

Interconnect

Multi-GPU SupportYes

Interconnect Bandwidth900 GB/s

Notes

Optimized for large language models with 141GB HBM3e

Models That May Fit on NVIDIA H200 SXM

Estimates based on INT8 quantization. Actual fit depends on framework and batch size.

Llama 3.2 90B Vision

Meta AI · 90B

~108.0 GB

Llama 3.3 70B Instruct

Meta AI · 70B

~84.1 GB

Llama 3.1 70B

Meta AI · 70B

~84.0 GB

NVLM D 72B

NVIDIA · 72B

~86.4 GB

Browse all AI models

Related Tools

GPU Sizing Calculator

Calculate VRAM requirements for models

Browse All GPUs

Compare datacenter GPU specifications

AI Model Catalog

Browse and compare AI models

Added Jan 25, 2026

Last updated: Jan 25, 2026

Build Your AI Stack with Confidence

Explore models, compare pricing and benchmarks, and right-size your infrastructure — all in one place.

Get Started Compare Models