CBRSPrivate
Evidence
96%Authoritative
FactConfirmedProduct·June 2, 2025

Cerebras Sets Llama 4 Maverick Inference Speed Record at 2,500+ Tokens/Sec

Cerebras achieved a world record of 2,500+ tokens/sec per user on Meta's 400B parameter Llama 4 Maverick model as measured by Artificial Analysis, more than doubling Nvidia's DGX B200 benchmark of 1,000 tokens/sec.

Evidence Strength

Evidence
96%Authoritative
Backed by official company doc
Single publisher source
Includes official or primary source
Key Development
High-significance development (rated 8/10)
Confirmed — verified event

Insights

First tracked

June 2, 2025

Last updated

June 2, 2025

Sources

1 source

Sources (1)

Source Timeline

Cerebras Sets Llama 4 Maverick Inference Speed Record at 2,500+ Tokens/Sec — Cerebras Systems | OpenCall