Credal Transformer: A New Approach to Combat Hallucinations in Large Language Models

TLDR: The Credal Transformer is a novel AI architecture that addresses the problem of hallucinations in Large Language Models (LLMs) by replacing the standard Softmax attention mechanism with a Credal Attention Mechanism (CAM). This new mechanism quantifies the model’s uncertainty at each layer, allowing it to identify out-of-distribution inputs, quantify ambiguity, and abstain from making confident, incorrect predictions on unanswerable questions. The approach integrates uncertainty as a core component of the model with minimal computational overhead, aiming to build more reliable and trustworthy AI systems.

Large Language Models, or LLMs, have become incredibly powerful tools, capable of generating text that often sounds indistinguishable from human writing. However, a significant challenge persists: the phenomenon of “hallucination.” This is when an LLM confidently presents factually incorrect information, which can severely limit its reliability in critical applications.

A new research paper, Credal Transformer: A Principled Approach for Quantifying and Mitigating Hallucinations in Large Language Models, proposes that the root cause of these hallucinations might lie within the very architecture of the Transformer model itself. Specifically, the authors – Shihao Ji, Zihui Song, and Jiajie Huang – point to the Softmax function used in the attention mechanism. They argue that Softmax creates “Artificial Certainty” by forcing the model to pick a single, definitive probability distribution, even when the underlying information is ambiguous. This process, they suggest, discards crucial information about the model’s uncertainty at each layer, leading to overconfident predictions on fabricated content.

Introducing the Credal Transformer

To tackle this fundamental issue, the researchers introduce the Credal Transformer. This innovative architecture replaces the standard attention mechanism with a novel Credal Attention Mechanism (CAM). Unlike traditional attention, CAM doesn’t produce a single attention vector. Instead, it generates a “credal set,” which can be thought of as a convex set of possible distributions. The size or volume of this set directly and measurably quantifies the model’s epistemic uncertainty – essentially, what the model doesn’t know.

The Credal Attention Mechanism is grounded in evidential theory. It re-conceptualizes attention scores as “evidence masses” for a Dirichlet distribution. When there’s strong evidence, the distribution is sharp, much like standard attention. But when evidence is insufficient or conflicting, the distribution becomes diffuse, explicitly representing ambiguity or a lack of knowledge. This allows the model to inherently understand and express its own uncertainty.

Key Capabilities and Benefits

The Credal Transformer has demonstrated several significant advantages:

Out-of-Distribution Detection: The model can effectively identify inputs that are outside its training data. It produces high-entropy outputs (indicating high uncertainty) for unfamiliar or nonsense data, unlike standard models that might confidently make incorrect predictions.
Ambiguity Quantification: For tasks with inherently ambiguous inputs, the Credal Transformer can quantify this ambiguity. Its larger credal sets reflect the model’s uncertainty about the correct interpretation, rather than forcing an arbitrary choice.
Reducing Confident Errors: In question-answering scenarios, especially with unanswerable questions, standard LLMs often generate confident, fabricated answers. The Credal Transformer can significantly reduce these errors by abstaining from prediction when it lacks sufficient evidence, a crucial feature for reliable AI systems.

Performance and Efficiency

A common concern with new architectural changes is computational overhead. However, the Credal Transformer shows promising results in this area. Benchmarks comparing CAM against standard Softmax-based attention reveal that the GFLOPs (a measure of computational complexity) are identical. The Credal Transformer incurs only a minimal overhead in inference time (around +4.4%) and training step time (around +11.6%). This suggests that the significant benefits in reliability and uncertainty awareness come with almost no compromise in computational efficiency, making it a practical alternative for developing more robust AI.

Also Read:

A Step Towards Trustworthy AI

In conclusion, the Credal Transformer represents a foundational step towards building more reliable and trustworthy AI systems. By integrating uncertainty quantification directly into the model’s architecture, it moves beyond treating LLMs as black boxes. This approach allows models to not only generate fluent text but also to understand and communicate their own limitations, paving the way for AI that is intrinsically aware of what it knows and what it doesn’t.

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

Credal Transformer: A New Approach to Combat Hallucinations in Large Language Models

Introducing the Credal Transformer

Key Capabilities and Benefits

Performance and Efficiency

A Step Towards Trustworthy AI

Gen AI News and Updates

Vesl AI Recognized for AI Infrastructure Innovation with ASOCIO Digital Summit Award

AT&T Unleashes Agentic AI Across Business Operations for Enhanced Efficiency and Innovation

Boosting Business Efficiency: A New AI and Big Data Model for Process Optimization

Boosting Business Efficiency: A New AI and Big Data Model for Process Optimization

AlphaCast: A New Approach to Time Series Prediction Through Human-AI Collaboration

New Graph Neural Networks Improve Reasoning in Assumption-Based Argumentation

Enhancing AI Reasoning: How Recursive Refinement and Multi-Agent Systems Improve Language Model Performance

ARGUS: A Proactive Framework for Enhancing Autonomous Driving Safety

Generative AI Powers Next-Gen Autonomous Emergency Response

OR-R1: Advancing Automated Optimization with Smart, Data-Efficient AI

Enhancing GUI Agents with Memory: A New Framework for History-Aware Reasoning

ProBench: A Deeper Look into How We Evaluate AI Agents for Mobile Apps

Enhancing Large Language Model Reasoning with Concise Outputs

Ensuring Trust in Autonomous AI: A Two-Layered Monitoring Approach for Agentic Systems

MedFuse: A Multiplicative Approach to Understanding Irregular Clinical Time Series Data

HyperD: A New Framework for More Accurate and Robust Traffic Predictions

Beyond Training: Researchers Propose ‘Model Raising’ for AI with Intrinsic Values

Bridging the Divide: Why AI Needs a Qualitative Revolution

Language Models Enhance Safety Certificate Synthesis for Dynamic Systems

Subscribe to get the latest news and updates