• About
  • Advertise
  • Privacy & Policy
  • Contact
HK Businesswire
  • Home
  • News
    • All
    • Business
    • Politics
    • PR Newswire
    • Science
    • World

    Cloopen Files Annual Report on Form 20-F for Fiscal 2025

    Pentagon releases first batch of secret UFO files

    Trump announces three-day Russia-Ukraine ceasefire

    The 28th China Beijing International High-Tech Expo Opens

    The 28th China Beijing International High-Tech Expo Opens

    The Road Home: How WPS Office’s AI Evolution is Giving Southeast Asian Professionals Their Holidays Back

    INSEAD welcomes inaugural GEMBA Flex cohort, marking a new era of flexible executive learning

    INSEAD welcomes inaugural GEMBA Flex cohort, marking a new era of flexible executive learning

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • PR Newswire
  • Business
  • World
  • Entertainment
  • Sports
  • Tech
    • All
    • Apps
    • Gadget
    • Mobile
    • Startup

    Xiaomi Auto Delivers Over 30,000 Vehicles in April, SU7 Orders Surpass 70,000

    Lalamove Completes Cross-Harbor Drone Delivery Test in Hong Kong

    PwC Says AI Computing Power Reshaping Global Telecom Industry as China Leads Transformation

    Xiaomi Launches MiMo-V2.5 Global Open Source With Trillion-Token Incentive Program

    Alipay and Banma Launch AI-Enabled In-Car Payment Solution at Beijing Auto Show

    Xiaomi Showcases Record EV Deliveries and Teases High-Performance YU7 GT at Beijing Auto Show

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
  • Feature
No Result
View All Result
  • Home
  • News
    • All
    • Business
    • Politics
    • PR Newswire
    • Science
    • World

    Cloopen Files Annual Report on Form 20-F for Fiscal 2025

    Pentagon releases first batch of secret UFO files

    Trump announces three-day Russia-Ukraine ceasefire

    The 28th China Beijing International High-Tech Expo Opens

    The 28th China Beijing International High-Tech Expo Opens

    The Road Home: How WPS Office’s AI Evolution is Giving Southeast Asian Professionals Their Holidays Back

    INSEAD welcomes inaugural GEMBA Flex cohort, marking a new era of flexible executive learning

    INSEAD welcomes inaugural GEMBA Flex cohort, marking a new era of flexible executive learning

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • PR Newswire
  • Business
  • World
  • Entertainment
  • Sports
  • Tech
    • All
    • Apps
    • Gadget
    • Mobile
    • Startup

    Xiaomi Auto Delivers Over 30,000 Vehicles in April, SU7 Orders Surpass 70,000

    Lalamove Completes Cross-Harbor Drone Delivery Test in Hong Kong

    PwC Says AI Computing Power Reshaping Global Telecom Industry as China Leads Transformation

    Xiaomi Launches MiMo-V2.5 Global Open Source With Trillion-Token Incentive Program

    Alipay and Banma Launch AI-Enabled In-Car Payment Solution at Beijing Auto Show

    Xiaomi Showcases Record EV Deliveries and Teases High-Performance YU7 GT at Beijing Auto Show

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
  • Feature
No Result
View All Result
HK Businesswire
No Result
View All Result
Home News Science

Teaching AI models to say “I’m not sure”

David Lee by David Lee
22 April 2026
in Science
0
0
SHARES
4
VIEWS
Share on FacebookShare on Twitter

Confidence is persuasive. In artificial intelligence systems, it is often misleading.Today’s most capable reasoning models share a trait with the loudest voice in the room: They deliver every answer with the same unshakable certainty, whether they’re right or guessing. Researchers at MIT’s Computer Science and Artificial Intelligence Laboratory (CSAIL) have now traced that overconfidence to a specific flaw in how these models are trained, and developed a method that fixes it without giving up any accuracy.The technique, called RLCR (Reinforcement Learning with Calibration Rewards), trains language models to produce calibrated confidence estimates alongside their answers. In addition to coming up with an answer, the model thinks about its uncertainty in that answer, and outputs a confidence score. In experiments across multiple benchmarks, RLCR reduced calibration error by up to 90 percent while maintaining or improving accuracy, both on the tasks the model was trained on and on entirely new ones it had never seen. The work will be presented at the International Conference on Learning Representations later this month.The problem traces to a surprisingly simple source. The reinforcement learning (RL) methods behind recent breakthroughs in AI reasoning, including the training approach used in systems like OpenAI’s o1, reward models for getting the right answer, and penalize them for getting it wrong. Nothing in between. A model that arrives at the correct answer through careful reasoning receives the same reward as one that guesses correctly by chance. Over time, this trains models to confidently answer every question they are asked, whether they have strong evidence or are effectively flipping a coin.That overconfidence has consequences. When models are deployed in medicine, law, finance, or any setting where users make decisions based on AI outputs, a system that expresses high confidence regardless of its actual certainty becomes unreliable in ways that are difficult to detect from the outside. A model that says “I’m 95 percent sure” when it is right only half the time is more dangerous than one that simply gets the answer wrong, because users have no signal to seek a second opinion.”The standard training approach is simple and powerful, but it gives the model no incentive to express uncertainty or say I don’t know,” says Mehul Damani, an MIT PhD student and co-lead author on the paper. “So the model naturally learns to guess when it is unsure.” RLCR addresses this by adding a single term to the reward function: a Brier score, a well-established measure that penalizes the gap between a model’s stated confidence and its actual accuracy. During training, models learn to reason about both the problem and their own uncertainty, producing an answer and a confidence estimate together. Confidently wrong answers are penalized. So are unnecessarily uncertain correct ones.The math backs it up: the team proved formally that this type of reward structure guarantees models that are both accurate and well-calibrated. They then tested the approach on a 7-billion-parameter model across a range of question-answering and math benchmarks, including six datasets the model had never been trained on.The results showed a consistent pattern. Standard RL training actively degraded calibration compared to the base model, making models worse at estimating their own uncertainty. RLCR reversed that effect, substantially improving calibration with no loss in accuracy. The method also outperformed post-hoc approaches, in which a separate classifier is trained to assign confidence scores after the fact. “What’s striking is that ordinary RL training doesn’t just fail to help calibration. It actively hurts it,” says Isha Puri, an MIT PhD student and co-lead author. “The models become more capable and more overconfident at the same time.”The team also demonstrated that the confidence estimates produced by RLCR are practically useful at inference time. When models generate multiple candidate answers, selecting the one with the highest self-reported confidence, or weighting votes by confidence in a majority-voting scheme, improves both accuracy and calibration as compute scales.An additional finding suggests that the act of reasoning about uncertainty itself has value. The researchers trained classifiers on model outputs and found that including the model’s explicit uncertainty reasoning in the input improved the classifier’s performance, particularly for smaller models. The model’s self-reflective reasoning about what it does and doesn’t know contains real information, not just decoration.In addition to Damani and Puri, other authors on the paper are Stewart Slocum, Idan Shenfeld, Leshem Choshen, and senior authors Jacob Andreas and Yoon Kim.

Tags: Science
David Lee

David Lee

Read More

Will We Ever Be Able To Forecast Volcanic Eruptions Like Weather?

8 May 2026

Rethinking how our brains use categories to make sense of the world

7 May 2026
  • Trending
  • Comments
  • Latest
HD Hyundai Robotics Secures Order for Robotic Welding Solutions from Chouest Group, Establishing a Strategic Foothold for Global Smart Yard Expansion

HD Hyundai Robotics Secures Order for Robotic Welding Solutions from Chouest Group, Establishing a Strategic Foothold for Global Smart Yard Expansion

7 May 2026

Toys“R”Us Hong Kong Unveils World-Class Flagship for 40th Anniversary

2 May 2026

Cinderella Leaves at Midnight. Kai Tak Concerts End at 10:30, Sharp.

6 May 2026
NEC Indonesia Welcomes New President Director

NEC Indonesia Welcomes New President Director

5 May 2026

Cloopen Files Annual Report on Form 20-F for Fiscal 2025

8 May 2026

Real fine Valverde, Tchouameni 500,000 euros for clash

8 May 2026

Pentagon releases first batch of secret UFO files

8 May 2026

Trump announces three-day Russia-Ukraine ceasefire

8 May 2026

Recent News

Cloopen Files Annual Report on Form 20-F for Fiscal 2025

8 May 2026

Real fine Valverde, Tchouameni 500,000 euros for clash

8 May 2026

Pentagon releases first batch of secret UFO files

8 May 2026

Trump announces three-day Russia-Ukraine ceasefire

8 May 2026
HK Businesswire

Stay ahead with the latest insights on Hong Kong’s economy, finance, and investments. From market trends to policy updates, we bring you in-depth analysis and expert opinions.

📩 Subscribe to our newsletter for exclusive updates.
📍 Follow us on social media for real-time news.
📧 Contact us: info@hongkong-invest.com

Follow Us

  • About
  • Advertise
  • Privacy & Policy
  • Contact

© 2025 by HKBusinesswire.com

No Result
View All Result

© 2025 by HKBusinesswire.com