CS-3 vs. NVIDIA DGX B200 Blackwell Benchmarks Published

Cerebras published benchmarks showing the CS-3 is 21x faster at inference than NVIDIA's DGX B200 Blackwell on Llama 3 70B, with 1/3 the cost and 1/3 the power, and maintained a ~5x speed advantage on GPT-OSS-120B against Blackwell's best GPU results.

Evidence Strength

Evidence

96%Authoritative

Backed by official company doc

Single publisher source

Includes official or primary source

Key Development

High-significance development (rated 7/10)

Covered by 3 sources

Confirmed — verified event

Insights

First tracked

September 19, 2025

Last updated

November 6, 2025

Sources

3 sources

Related Developments

Oklahoma City AI Datacenter Ribbon-Cutting with 44+ Exaflops Cerebras Delivers 3,000 Tokens/Second Inference for OpenAI's gpt-oss-120B Open-Weight Model Jais 2 Arabic-Centric LLMs Trained and Deployed on Cerebras Wafer-Scale Clusters GLM-4.7 Available on Cerebras Inference Cloud at 1,000-1,700 Tokens/Second OpenAI Signs $10B+ Multiyear Compute Deal with Cerebras

Sources (3)

Source Timeline

OpenAI GPT-OSS 120B Benchmarked – NVIDIA Blackwell vs. Cerebras
Cerebras·Nov 6, 2025
Cerebras CS-3 vs. Groq LPU
Cerebras·Sep 19, 2025
Cerebras CS-3 vs. Nvidia DGX B200 Blackwell
Cerebras·Sep 19, 2025

View all Cerebras Systems developments