CBRSPrivate
Evidence
100%Authoritative
FactConfirmedProduct·November 6, 2025

Cerebras Delivers 3,000 Tokens/Second Inference for OpenAI's gpt-oss-120B Open-Weight Model

Cerebras announced support for OpenAI's first open-weight reasoning model (gpt-oss-120B) on its inference service, running at ~3,000 tokens per second — claimed to be ~55x faster and ~60x cheaper than Anthropic's Claude 4 Opus.

Evidence Strength

Evidence
100%Authoritative
Backed by official company doc
Reported by 2 independent publishers
Includes official or primary source
Key Development
High-significance development (rated 8/10)
Covered by 4 sources
Confirmed — verified event

Insights

First tracked

August 5, 2025

Last updated

November 6, 2025

Sources

4 sources