TLDR: This paper argues that Large Language Model (LLM) hallucinations, often seen as defects, are an inevitable structural feature when operating under an ‘Open World assumption’ where environments are unbounded. By reframing hallucination as a manifestation of the generalization problem, the author distinguishes between corrigible ‘Type-I’ errors (false memorization) and inescapable ‘Type-II’ errors (false generalization). The paper suggests that instead of eliminating hallucinations, we should tolerate them as a byproduct of intelligence, design adaptive systems, and make errors intelligible to humans, especially for Artificial General Intelligence (AGI).
Large Language Models (LLMs) have become incredibly powerful tools, capable of generating human-like text for a vast array of applications, from information retrieval to scientific discovery. However, a persistent challenge with these advanced AI systems is the phenomenon known as “hallucination” – where LLMs produce outputs that are fluent and convincing, yet factually inaccurate or entirely fabricated.
Traditionally, hallucination has often been viewed as a defect, something to be minimized or eliminated through better training data, improved model architectures, or more sophisticated alignment procedures. However, a recent research paper titled “Hallucination is Inevitable for LLMs with the Open World Assumption” by Bowen Xu, offers a fresh perspective, arguing that under certain realistic conditions, hallucination isn’t just a bug, but an unavoidable structural feature of intelligence itself.
Understanding Hallucination: A Generalization Problem
The paper reframes hallucination not merely as an error, but as a manifestation of the fundamental generalization problem in machine learning. In essence, an LLM learns from a finite set of training data and then must apply that learned knowledge to new, unseen situations. This process of generalization is crucial for intelligence, but it’s also inherently fallible.
The distinction between two key assumptions is central to this argument: the “Closed World assumption” and the “Open World assumption.” Under the Closed World assumption, it’s believed that the training data adequately represents all possible future scenarios. In such a controlled environment, hallucinations might be significantly reduced, perhaps even to statistically negligible levels, with enough data and refined algorithms.
However, the paper argues for the adoption of the “Open World assumption,” which is more faithful to real-world intelligence and the ambitions of Artificial General Intelligence (AGI). The Open World assumption posits that intelligent systems must operate in an unbounded environment, facing limitless space, time, and tasks. In this setting, past experience cannot guarantee future accuracy. Just as humans make mistakes when generalizing from limited experience, LLMs operating in an open world cannot entirely avoid generating incorrect or fabricated information.
Two Types of Hallucination
The research distinguishes between two types of hallucination:
- Type-I Hallucination (HT-I): This arises from “false memorization.” It occurs when the model produces an answer that contradicts facts already present in its training data. These errors are, in principle, correctable. If the system knows the correct fact, it can be retrained or adjusted to align its output with that known information.
- Type-II Hallucination (HT-II): This is the more profound type, stemming from “false generalization” to cases not present in the training set. Under the Open World assumption, where the future is not guaranteed to align with the past, this form of hallucination becomes inescapable. No finite past experience can perfectly predict or guarantee correctness in an infinite, unpredictable future. Simply answering “I don’t know” isn’t a solution either, as it would prevent the system from generalizing and providing useful insights beyond its memorized data.
The paper draws parallels to the “No Free Lunch” theorem in machine learning, which states that no single optimization algorithm is universally superior across all possible problem distributions. Similarly, without assumptions about the future test set, perfect generalization is impossible. The paper also references Hume’s observation about the inherent limits of induction, famously illustrated by Russell’s “inductive turkey” – a turkey fed daily expects the pattern to continue, until it’s slaughtered.
Also Read:
- A New Framework for Classifying Language Model Hallucinations
- Unraveling LLM Hallucinations: A Framework for Tracing Semantic Failures
Coping with the Inevitable
Given that Type-II hallucination is deemed inevitable under open-world conditions, the paper suggests a shift in approach from complete elimination to constructive management. The proposed treatments include:
Tolerating Hallucination: Recognizing that hallucinations are a structural byproduct of deep learning, much like cognitive errors or optical illusions are part of human perception. Dismissing them as entirely “wrong” might impose an overly anthropocentric standard.
Maintaining Adaptivity: Since past experience is always limited, intelligent systems should be designed to continuously revise their internal models, update beliefs, and incorporate new information as it becomes available, thereby remaining adaptive in the face of insufficient knowledge.
Making Errors Intelligible and Acceptable: If errors cannot be eliminated, they should at least be understandable from a human perspective. This might involve adopting more transparent representational schemes, such as logical or concept-centered representations, that align more closely with human reasoning, making even erroneous outputs seem “reasonable” within a shared framework.
In conclusion, while efforts to reduce hallucinations in constrained environments remain valuable, for the pursuit of Artificial General Intelligence, the Open World assumption must be embraced. Hallucination, in this light, is not merely a defect to be engineered away, but a fundamental byproduct of intelligence operating in an unbounded reality. For more details, you can read the full research paper here.


