TLDR: “Curved Inference” is a new framework that analyzes how Large Language Models (LLMs) process information by tracking the geometric path of their internal “residual stream.” It measures how this path “bends” (curvature) and “moves” (salience) in response to semantic changes in prompts, such as emotional or moral shifts. The research found that concern-shifted prompts reliably alter these internal trajectories, with LLaMA showing stronger and more consistent responses than Gemma. This geometric approach offers insights into LLM decision-making, potential for real-time alignment monitoring, and understanding how models navigate meaning.
Large Language Models (LLMs) are incredibly powerful, but understanding how they make decisions remains a significant challenge. Traditional methods often look at isolated parts of the model. However, new research introduces a fascinating concept called “Curved Inference,” which offers a fresh perspective by examining the geometric path of information inside these models.
The paper, titled “Curved Inference: Concern-Sensitive Geometry in Large Language Model Residual Streams” by Rob Manson, proposes a framework to track how the internal “residual stream” of an LLM bends and shifts when the model encounters changes in semantic concern. Think of the residual stream as the model’s evolving internal thought process as it processes text, layer by layer.
The core idea is that when a prompt carries a heightened “concern” – whether it’s emotional tone, moral framing, or a logical shift – the model doesn’t just change its output; it actually bends its internal trajectory. This bending is a measurable deformation in the path of how token representations move through the model’s layers. The researchers focused on two key measurements: curvature (how sharply the path bends) and salience (how much the internal state moves between layers).
To make sure these measurements truly reflect meaningful changes, the team used a special “semantic metric” derived from the model’s unembedding matrix. This ensures that all measurements are aligned with how the model understands and predicts tokens, rather than just raw numerical coordinates.
The study analyzed two popular LLMs, Gemma3-1b and LLaMA3.2-3b, using 20 different prompt sets designed to shift semantic concern across various domains like emotional, moral, logical, and environmental topics. They compared how the models processed neutral prompts versus “concern-shifted” variants.
Key Findings from Curved Inference
The research revealed several consistent and interpretable patterns:
-
Curvature is inherent: Even neutral prompts cause the residual stream to bend. It’s not a straight line, but a path that meaningfully responds to the input’s semantic and structural features.
-
Concern initiates bending: When a concern-shifted token is introduced, the model’s internal trajectory visibly inflects, and these changes spread across subsequent layers.
-
Strength matters: Stronger concerns lead to more pronounced bending. The same internal positions bend, but they bend harder under increased semantic pressure, confirming the signal is semantically grounded.
-
Salience complements curvature: The magnitude of internal movement (salience) also intensifies around the same tokens and layers where bending occurs. This suggests that concern-shifted prompts reallocate the model’s “semantic effort,” emphasizing new internal paths.
A notable difference was observed between the two models: LLaMA consistently showed statistically significant increases in both curvature and salience as concern intensity grew. Gemma also responded but showed weaker differentiation between moderate and strong concerns. LLaMA tended to exhibit early, high-magnitude curvature that persisted through mid-depth layers, indicating a fast and distributed integration of concern information. Gemma, in contrast, showed shallower, more localized bending.
The study also highlighted a “representational trade-off” in LLaMA: when token trajectories were highly curved, they tended to be shorter in total length, and vice versa. This suggests an internal efficiency mechanism, balancing semantic effort with precision.
Also Read:
- Decoding Chain-of-Thought: Information Flow in Language Models
- Decoding Persona Influence in Language Model Reasoning
Implications for Understanding LLMs
This work supports a “two-layer view” of LLM geometry: a latent conceptual structure encoded in the embedding space (the model’s static knowledge) and a contextual trajectory shaped by prompt-specific inference (how it applies that knowledge dynamically). Curved Inference provides a way to link these layers, showing how latent semantic potential is bent or redirected by context.
The findings have significant implications for interpretability research. By focusing on the residual stream’s trajectory, this approach offers a continuous, geometry-grounded signal that complements traditional discrete methods. For instance, sharp spikes in curvature in response to certain concern-shifted prompts could serve as real-time “alignment monitors,” potentially flagging risky completions before they manifest in the output.
The contrast between Gemma and LLaMA also offers insights for architecture design, suggesting that larger models might defer semantic reorientation to later stages, while smaller models might benefit from architectural constraints to avoid premature bending.
This research marks a shift in interpretability, moving from static probes to dynamic trajectory analysis. It provides both a map of the model’s internal semantic geometry and a diagnostic tool, revealing what the model finds meaningful and how it adapts to meet it. For more technical details, you can read the full research paper here.


