TLDR: A new research paper by Bill Cochran proposes a formal model demonstrating that the depth of a neural network directly limits its logical reasoning capabilities. Each layer adds only one level of logical reasoning, meaning a network of depth ‘n’ cannot faithfully represent predicates requiring ‘n+1’ levels of logic, such as complex counting. This inherent limitation explains phenomena like hallucination, repetition, and limited planning in large language models, suggesting these are not flaws but consequences of architectural constraints on logical expressiveness.
A new research paper titled “On the Limits of Hierarchically Embedded Logic in Classical Neural Networks” by Bill Cochran delves into the fundamental reasoning limitations of large neural network models, particularly those used for language. The paper proposes a formal model that grounds these limitations in the very depth of the neural architecture itself.
The core idea presented is that neural networks can be viewed as linear operators acting on a space of logical predicates. In this framework, each layer within a neural network can encode, at most, one additional level of logical reasoning. The paper defines logic classes, Lk, where L0 represents basic, atomic predicates, and Lk+1 is formed by building upon Lk through operations like quantification or composition.
A significant finding of the research is the proof that a neural network with a depth of ‘n’ layers cannot accurately represent predicates belonging to the Ln+1 class. This includes seemingly simple tasks like counting over complex predicates. This implies a strict upper bound on the logical expressiveness of these networks, directly tied to their architectural depth. This structural limitation creates a ‘null space’ during the processes of tokenization and embedding, effectively preventing higher-order predicates from being represented within the model.
This framework offers compelling explanations for several well-known phenomena observed in large language models, such as the tendency for ‘hallucination’ (generating factually incorrect but plausible-sounding information), repetition in output, and limitations in complex planning. These issues, the paper suggests, are not merely bugs but inherent consequences of the models’ architectural constraints on logical expressiveness.
The paper also touches upon how approximations to higher-order logic might still emerge within these models. It suggests that the network approximates complex logic by using a finite and fixed set of lower-order predicates. This approximation, often described as the “metaphor metaphor” in the paper, means that while certain aspects of a higher-order predicate can be preserved or evoked by a combination of lower-order logic, the result might be structurally useful but not logically precise, much like a metaphor.
The research builds upon classical approximation theory and existing work on logical expressivity and the representational power of neural networks. By formalizing semantic unit selection in language models using tensors, the paper provides a mathematical bound on logical expressiveness as a function of network depth. This work motivates further exploration into architectural extensions and new interpretability strategies for future language model development.
Also Read:
- Neuro-Symbolic AI Learns to Simplify Complex Formulas with Enhanced Efficiency
- Bridging Intent and Action: Integrating Reasoning and Action Models for Automated Service Composition
For a deeper dive into the technical details, you can read the full research paper available at arXiv:2507.20960.


