Techsoma Africa
Latest Startups AI FinTech Global Tech Apps Opinions Reports
Policy & Regulations Artificial Intelligence Reports About Contact Advertise African Startup Ecosystem Artificial Intelligence FinTech & Digital Money Global News Technology Apps, Gadgets, Tools & Softwares Opinions & Perspectives Reports
Techsoma Africa
No Result
View All Result
Techsoma Africa
No Result
View All Result
Techsoma Africa
No Result
View All Result
Home Artificial Intelligence

AI Hallucinations Are Getting Worse as Models Scale, and the Industry Has No Real Fix

by Kingsley Okeke
March 13, 2026
in Artificial Intelligence
Reading Time: 5 mins read
AI Hallucinations

The artificial intelligence industry has spent years promising that AI hallucinations (the confident fabrication of false information) would diminish as models grew more powerful. The data increasingly tells a different story.

OpenAI’s own internal testing revealed that its o3 and o4-mini reasoning models hallucinate at significantly higher rates than their predecessors. On the PersonQA benchmark, o3 hallucinated 33% of the time; more than double the 16% rate recorded by o1. The smaller o4-mini performed even worse at 48%. OpenAI’s own technical report admitted that “more research is needed” to understand why.

When More Thinking Produces More Errors

The pattern is counterintuitive but now well-documented. Models built for deeper, chain-of-thought reasoning tend to perform worse on factual accuracy benchmarks than simpler predecessors. The leading hypothesis is structural: reasoning models invest computational effort into working through answers, which can lead them to fill knowledge gaps with plausible-sounding guesses rather than acknowledging uncertainty. Independent research by Transluce, a nonprofit AI lab, found that o3 also fabricates actions it claims to have taken, including, in one documented case, running code on a physical laptop outside of ChatGPT.

An MIT study from early 2025 added a disturbing dimension. When AI models hallucinate, they tend to use more confident language than when they are factually correct. Some models were 34% more likely to use phrases like “definitely” and “certainly” when generating incorrect information. The more wrong the model is, the more certain it sounds.

The Benchmark Problem

Part of the confusion around hallucination trends is methodological. On tightly controlled tasks (such as summarising a provided document), some models have shown genuine improvement, with a handful now sitting below the 1% threshold on summarisation-specific benchmarks. But those tests measure a narrow slice of how AI is actually used.

In medical settings, hallucination rates in clinical scenarios ranged from 64% to over 80% for open-source models, even when mitigation prompts were applied. Legal queries fared no better: Stanford research found that models hallucinate between 69% and 88% of the time on specific legal questions.

Why No One Is Really Fixing It

The core problem is architectural. Large language models are prediction engines, not knowledge retrieval systems. They generate the statistically most likely next word based on training patterns, with no internal mechanism to distinguish known facts from plausible fictions. A 2025 paper from OpenAI and MIT researchers demonstrated mathematically why this tendency persists through training: the way models are currently evaluated rewards confident guessing over calibrated uncertainty, so models learn to bluff.

As OpenAI acknowledged in a September 2025 paper, standard benchmarks penalise uncertainty and reward accuracy scores, meaning a model that guesses will outperform one that says “I don’t know,” even if the guessing model produces far more incorrect answers.

Retrieval-Augmented Generation, which anchors model responses to external source documents, reduces hallucination rates by up to 42% when properly implemented, but it only works when there is a document to anchor to. For the open-ended questions that represent much of real-world AI usage, no reliable solution currently exists.

For users relying on AI tools in healthcare, law, journalism, or financial analysis, that gap remains very much open.

Kingsley Okeke

Kingsley Okeke

I'm a skilled content writer, anatomist, and researcher with a strong academic background in human anatomy. I hold a degree...

Recommended For You

African Startup Ecosystem

Zimbabwe Unveils National AI Strategy Focused on Local Innovation

by Faith Amonimo
June 8, 2026

Zimbabwe has launched a serious AI plan with clear goals for talent, data, startups, and public services. This article explains what the Zimbabwe National AI Strategy gets right and where...

Read moreDetails

Coursera now offers an AI learning feed that turns quick scrolls into study time

June 8, 2026

Côte d’Ivoire to Establish a University Dedicated to AI to Address Digital Skills Shortage

June 8, 2026

Meta rolls out Business Agent across WhatsApp, Instagram, and Messenger

June 4, 2026

Google AI Search Just Changed How You Find Anything Online

June 1, 2026
Next Post

How Founders Can Switch Off Pitch Mode and Build Better Personal Relationships

HOSTAFRICA

HOSTAFRICA Deploys Africa's First NVIDIA RTX PRO 6000 Blackwell GPU Servers in South Africa

Please login to join discussion

Browse by Category

  • African Startup Ecosystem
  • African Telecommunications
  • Apps, Gadgets, Tools & Softwares
  • Artificial Intelligence
  • Business & Markets
  • Creator Economy
  • Cybersecurity
  • Digital Work-Life Series
  • E-Commerce
  • Event Radar Africa
  • Exclusive Interviews
  • Explainers
  • Fabfilter Total Bundle
  • Features/Spotlights
  • FinTech & Digital Money
  • Funding news
  • GenZ Desk!
  • Global News
  • Logistics & Mobility Tech
  • Marvel Rivals Nude Mod
  • Media & Entertainment
  • News
  • Opinions & Perspectives
  • Opportunities, Careers & Learning
  • Partner
  • Policy & Regulations
  • Reports
  • Reviews
  • Tech Insights for Creators
  • Technology
  • Uncategorized
  • About Us
  • Advertise on Techsoma
  • Contact
  • Privacy Policy
  • Publish Your Articles
  • T & C
  • Techsoma Africa

Copyright 2026 Techsoma Africa. All rights reserved.

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
Techsoma Africa

© 2026 Techsoma Africa Media.

Company

Policy AI Reports About Contact Advertise

Legal

Terms Privacy RSS

Latest

Democracy Day: How Technology Is Changing Civic Engagement in Nigeria Every June 12, Nigeria marks a day that cost its people dearly. The date honours the annulled 1993... Airtel Nigeria Deploys 200 Solar Towers in 12 Months. Is It Enough to Challenge MTN?   Airtel Nigeria deployed 200 solar-powered telecom towers between April 2025 and March 2026 across rural and urban... NRS Launches Rev360 Digital Tax Platform to Replace TaxPro Max and Widen Nigeria’s Tax Net The Nigeria Revenue Service has formally unveiled Rev360, its next-generation digital tax administration platform, at a launch event...
No Result
View All Result
  • About Us
  • Advertise on Techsoma
  • Contact
  • Privacy Policy
  • Publish Your Articles
  • T & C
  • Techsoma Africa

Copyright 2026 Techsoma Africa. All rights reserved.