TLDR: CAMA is a two-stage framework that enhances Large Language Models’ mathematical reasoning by integrating causal knowledge. It constructs a Mathematical Causal Graph (MCG) representing solution strategies and their dependencies, then uses this graph to guide LLMs in solving complex math problems, leading to significant performance improvements over traditional methods.
Large Language Models, or LLMs, have shown impressive capabilities across many language-related tasks, from answering questions to generating code. However, when faced with complex mathematical problems, these powerful AIs often hit a wall. This struggle stems from their inherent design, which can limit deep, multi-step logical inferences and make them sensitive to slight changes in how a problem is phrased.
To tackle this challenge, researchers have introduced a new framework called CAusal MAthematician, or CAMA. This innovative approach aims to equip LLMs with explicit, reusable mathematical structures, moving beyond purely data-driven predictions to more guided, structured reasoning.
What is a Mathematical Causal Graph (MCG)?
At the heart of CAMA is the Mathematical Causal Graph (MCG). Imagine a map of mathematical knowledge where each point is a concept, like “Area of a circle” or “Volume of a cylinder.” The lines connecting these points aren’t just associations; they’re directed arrows showing causal dependencies. For instance, an arrow from “Area of a circle” to “Volume of a cylinder” means you typically need to understand how to calculate the area of a circle before you can compute the volume of a cylinder. This graph stores general problem-solving strategies and the correct order in which knowledge points should be applied.
How CAMA Works: A Two-Stage Process
CAMA operates in two main stages:
1. The Learning Stage
In this initial phase, CAMA builds and refines the MCG. It starts by taking pairs of mathematical questions and their detailed solutions. If solutions aren’t available, an LLM can generate them. From these solutions, CAMA extracts key knowledge points and then uses causal discovery algorithms to figure out the dependencies between them, forming an initial MCG. This graph is then iteratively refined. CAMA feeds the LLM questions and observes its accuracy. Based on this feedback, the graph’s connections are adjusted to better align with how the LLM needs to reason for accurate answers. Edges that lead to correct reasoning are strengthened, while those linked to errors are revised.
2. The Reasoning Stage
Once the MCG is optimized, it’s ready to guide the LLM in solving new, unseen mathematical problems. When a new question comes in, the LLM first generates a preliminary thought process. CAMA then uses this thought process, along with the question and the full MCG, to extract a smaller, task-relevant subgraph. This subgraph contains only the most pertinent knowledge points and their causal relationships for that specific problem. Finally, this structured guidance, verbalized into natural language (e.g., “Area of a circle is a prerequisite for Volume of a cylinder”), is injected back into the LLM to guide its reasoning and produce the final answer.
Key Findings and Benefits
Experiments on challenging mathematical benchmarks like AIME, Omni-MATH, and OlympiadBench have shown that CAMA significantly improves LLM performance. The research highlights several important points:
- Structured guidance from the MCG is more effective than simply providing raw text prompts.
- Capturing asymmetric (directed) causal relationships in the graph leads to greater improvements than using only symmetric associations.
- The iterative alignment step, where the graph is refined based on LLM feedback, is crucial for adapting the MCG to the LLM’s reasoning needs.
For example, in a specific AIME problem, CAMA guided the LLM to correctly apply modular arithmetic and the Chinese Remainder Theorem, leading to the right solution. Without this causal guidance, the LLM resorted to less efficient methods and produced an incorrect answer.
Also Read:
- RL-PLUS: A New Approach to Expand LLM Reasoning Capabilities Beyond Current Limits
- Adaptive Reasoning for Large Language Models: The SynAdapt Approach
Looking Ahead
While CAMA represents a significant leap forward, the researchers acknowledge areas for future improvement. The effectiveness of CAMA currently depends on a “granularity” parameter, which controls how detailed the extracted knowledge points are. Finding the optimal value for this parameter can be tricky, as what works best for training data might not generalize perfectly to new problems. Future work could explore ways for the LLM itself to help select this parameter automatically. Additionally, the MCG remains static during the reasoning process; enabling it to dynamically update or even discover new knowledge points on the fly could further enhance its capabilities.
CAMA offers a promising direction for making Large Language Models more robust and reliable in complex mathematical reasoning by integrating explicit causal knowledge. You can read the full research paper here.


