← Back to Insights

Mira Murati's Thinking Machine Lab

Nils Liu
News Blog GenAI Architecture
Mira Murati's Thinking Machine Lab

Here’s a question most people assume is trivial: if you set an AI model’s temperature to 0, do you get perfectly consistent outputs every time? Intuition says yes. Reality says no — and the reason is more fundamental than most people realize.

Thinking Machine Lab (Mira Murati’s startup — which raised $2B and hit a $10B valuation before shipping a single product, purely on the team’s credibility) published a deep-dive on this on September 10th.

When you run 1,000 identical prompts against a “deterministic” model, you get dozens of different outputs. The source isn’t model randomness — it’s batch processing on the server side. How the server groups incoming requests changes the numerical outcomes of normalization, matrix multiplication, and attention operations in ways that cascade into different outputs.

The implications are serious:

  • Benchmark scores can vary by up to 5% depending on server load
  • Developers often can’t reproduce specific failure cases because batch configuration has changed
  • For regulated industries, you can’t guarantee consistent auditable behavior

The fix Thinking Machine Lab developed: batch-invariant versions of the core operations — ensuring identical inputs produce identical outputs regardless of how requests are grouped. In testing, it works. The tradeoff: roughly 60% slower.

For most use cases, the speed cost isn’t worth it. For regulated financial, healthcare, or legal applications where consistency is a compliance requirement, it might be.

A practical diagnostic: run the same prompt 100 times and count how many distinct outputs you get. If you’re building anything mission-critical with AI, you should know that number.

The encouraging part: this is a solvable technical problem, not an inherent property of AI systems. Expect the performance cost to come down as the approach matures.

💬 Read more: 2025 Year in Review (English)

Get the latest insights

Join the newsletter to receive my latest articles on GenAI, AI Agents, and architecture.

No spam. Unsubscribe anytime.