Beta

Inferbase

Beta

Inferbase

Beta

Intel•2024

Intel Gaudi 3 PCIe (HL-338)

Name: Intel Gaudi 3 PCIe (HL-338)
Brand: Intel
Price: 9000 USD
Availability: InStock

Datacenter

Dual-slot PCIe form factor of Intel's third-generation Gaudi accelerator, with 128 GB HBM2e and 24 integrated 200 GbE ports for scale-out via standard Ethernet (no proprietary fabric needed). Targets cost-conscious LLM inference and fine-tuning.

VRAM

128 GB

Memory

HBM2e

Bandwidth

3700 GB/s

TDP

600W

Suitable Workloads

Large Language Models

Training and inference for models like GPT-4, Llama 70B+

Deep Learning Training

High-performance training for neural networks

Distributed Training

Multi-node training with fast interconnects

High-Throughput Inference

Optimized for batched inference workloads

Key Highlights

Built on Habana Gaudi 3 architecture
8 AI accelerator cores for tensor operations
64 compute cores for parallel processing
PCIe form factor

Memory

VRAM128 GB

Memory TypeHBM2e

Memory Bandwidth3700 GB/s

Compute Performance

FP32 (Single Precision)229 TFLOPS

FP16 (Half Precision)1835 TFLOPS

BF161835 TFLOPS

INT81835 TOPS

Architecture

ArchitectureHabana Gaudi 3

Compute Cores64

AI Accelerators8

Release Year2024

Power & Physical

TDP600 W

Max Power600 W

Form FactorPCIe

PCIe GenerationGen5

Interconnect

Multi-GPU SupportYes

Interconnect Bandwidth600 GB/s

Pricing

MSRP$9,000

Notes

compute_cores reflects 64 TPC (Tensor Processor Cores); ai_accelerators is the count of dedicated MME (Matrix Multiplication Engines). Compute figures are dense (no sparsity).