Close Menu
  • Home
  • AI
  • Education
  • Entertainment
  • Food Health
  • Health
  • Sports
  • Tech
  • Well Being

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

What's Hot

AI Law Firm Moritz Shares the Pitch Deck It Used to Raise $9 Million

May 26, 2026

Readers Weigh in: Anthropic’s Boris Cherny Tired of ‘Vibe Coding’

May 26, 2026

AI Is Making the Workplace More Productive — and Less Social

May 26, 2026
Facebook X (Twitter) Instagram
  • Home
  • About Us
  • Advertise With Us
  • Contact us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
Facebook X (Twitter) Instagram
IQ Times Media – Smart News for a Smarter YouIQ Times Media – Smart News for a Smarter You
  • Home
  • AI
  • Education
  • Entertainment
  • Food Health
  • Health
  • Sports
  • Tech
  • Well Being
IQ Times Media – Smart News for a Smarter YouIQ Times Media – Smart News for a Smarter You
Home » Are bad incentives to blame for AI hallucinations?
AI

Are bad incentives to blame for AI hallucinations?

IQ TIMES MEDIABy IQ TIMES MEDIASeptember 7, 2025No Comments3 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


A new research paper from OpenAI asks why large language models like GPT-5 and chatbots like ChatGPT still hallucinate and whether anything can be done to reduce those hallucinations.

In a blog post summarizing the paper, OpenAI defines hallucinations as “plausible but false statements generated by language models,” and it acknowledges that despite improvements, hallucinations “remain a fundamental challenge for all large language models” — one that will never be completely eliminated.

To illustrate the point, researchers say that when they asked “a widely used chatbot” about the title of Adam Tauman Kalai’s PhD dissertation, they got three different answers, all of them wrong. (Kalai is one of the paper’s authors.) They then asked about his birthday and received three different dates. Once again, all of them were wrong.

How can a chatbot be so wrong — and sound so confident in its wrongness? The researchers suggest that hallucinations arise, in part, because of a pretraining process that focuses on getting models to correctly predict the next word, without true or false labels attached to the training statements: “The model sees only positive examples of fluent language and must approximate the overall distribution.”

“Spelling and parentheses follow consistent patterns, so errors there disappear with scale,” they write. “But arbitrary low-frequency facts, like a pet’s birthday, cannot be predicted from patterns alone and hence lead to hallucinations.”

The paper’s proposed solution, however, focuses less on the initial pretraining process and more on how large language models are evaluated. It argues that the current evaluation models don’t cause hallucinations themselves, but they “set the wrong incentives.”

The researchers compare these evaluations to the kind of multiple-choice tests where random guessing makes sense, because “you might get lucky and be right,” while leaving the answer blank “guarantees a zero.” 

Techcrunch event

San Francisco
|
October 27-29, 2025

“In the same way, when models are graded only on accuracy, the percentage of questions they get exactly right, they are encouraged to guess rather than say ‘I don’t know,’” they say.

The proposed solution, then, is similar to tests (like the SAT) that include “negative [scoring] for wrong answers or partial credit for leaving questions blank to discourage blind guessing.” Similarly, OpenAI says model evaluations need to “penalize confident errors more than you penalize uncertainty, and give partial credit for appropriate expressions of uncertainty.”

And the researchers argue that it’s not enough to introduce “a few new uncertainty-aware tests on the side.” Instead, “the widely used, accuracy-based evals need to be updated so that their scoring discourages guessing.”

“If the main scoreboards keep rewarding lucky guesses, models will keep learning to guess,” the researchers say.



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
IQ TIMES MEDIA
  • Website

Related Posts

What ClickUp’s mass layoff tells us about the future of work

May 25, 2026

The pope’s AI encyclical isn’t really about AI

May 25, 2026

Startup Battlefield 200 applications close before May 27  | TechCrunch

May 25, 2026
Add A Comment
Leave A Reply Cancel Reply

Editors Picks

America’s schools face a reckoning on digital devices

May 26, 2026

Guide to the Scripps National Spelling Bee: How to watch, rules, prizes

May 25, 2026

Scott Remer makes a good living as a National Spelling Bee coach

May 23, 2026

Ex-Columbia student Mahmoud Khalil asks Supreme Court to intervene in his deportation fight

May 22, 2026
Education

America’s schools face a reckoning on digital devices

By IQ TIMES MEDIAMay 26, 20260

Just a few years ago, America’s public schools were rushing to get every child a…

Guide to the Scripps National Spelling Bee: How to watch, rules, prizes

May 25, 2026

Scott Remer makes a good living as a National Spelling Bee coach

May 23, 2026

Ex-Columbia student Mahmoud Khalil asks Supreme Court to intervene in his deportation fight

May 22, 2026
IQ Times Media – Smart News for a Smarter You
Facebook X (Twitter) Instagram Pinterest Vimeo YouTube
  • Home
  • About Us
  • Advertise With Us
  • Contact us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2026 iqtimes. Designed by iqtimes.

Type above and press Enter to search. Press Esc to cancel.