We built #AI to be confident. Turns out confidence and accuracy parted ways.

New research from OpenAI shows that training models to say “I don’t know” cuts hallucinations sharply. The problem was never capability- it was incentive.

AI models ‘learned’ that guessing gets rewarded more than abstaining.

Meanwhile, Thinking Machines Lab documented how to make AI boringly predictable. Not just setting temperature to zero, but controlling every random seed, every batch, every library call. Same input, same output, always.

Both reach the same conclusion from different angles: hallucinations aren’t a bug in the technology. They’re a feature of how we trained it.

The fix appears to be embarrassingly human: make “I don’t know” an acceptable output.

💬 Join the conversation on LinkedIn

View on LinkedIn →

FG

Felix Ghauri

Futures Forum

Felix helps organisations navigate AI and exponential change. He writes about technology, geopolitics, and the future of work.

Connect on LinkedIn Let's Talk

We built #AI to be confident. Turns out confidence and accuracy parted ways.

Felix Ghauri

More Insights

AI has become the most market-friendly explanation a CEO can give for cutting headcount.

Perplexity doesn't make any AI models. Yesterday it launched a product that orchestrates 19 of them.

Ukraine: the world's first AI-native military

Thinking about AI in your workflow?