AI Models Gain Human-Like Cognitive Control for Smarter, Faster Reasoning

TLDR: A new research paper introduces a method to give Large Reasoning Models (LRMs) dynamic control over their ‘thinking speed,’ similar to human System 1 (fast) and System 2 (slow) cognition. By identifying a ‘steering vector’ in the model’s internal representations and using real-time difficulty estimation, LRMs can now accelerate through easy tasks and delve deeper into complex ones. This ‘plug-and-play’ approach improves accuracy while significantly reducing computational overhead, making AI reasoning more efficient and adaptable.

Human intelligence often operates in two distinct modes: a quick, intuitive ‘System 1’ thinking and a slower, more deliberate ‘System 2’ thinking. While advanced Large Reasoning Models (LRMs) have shown remarkable capabilities in complex, System 2-like reasoning, this often comes with significant computational costs and delays. A new research paper introduces a groundbreaking approach to enable LRMs to dynamically adjust their thinking speed, mimicking human cognitive flexibility and optimizing the trade-off between accuracy and efficiency.

The core challenge addressed by the researchers was twofold: how to control the thinking speed within LRMs, and when to make these adjustments for optimal performance. Their innovative solution tackles both questions, offering a ‘plug-and-play’ method that requires no additional training or cost.

Controlling Thinking Speed: The Steering Vector

For the first challenge, the team identified a ‘steering vector’ within the LRM’s internal representation space. This vector acts like a dial, allowing the model to transition between fast and slow thinking. By subtly editing the model’s internal representations during inference, they achieved a novel ‘test-time scaling effect’. This means that without retraining the model, they could make it produce more concise, faster responses or engage in deeper, more deliberate analysis, depending on the desired outcome. This method significantly outperforms existing prompt-based techniques that try to influence thinking speed through explicit instructions.

When to Adjust: Real-Time Difficulty Estimation

To address the second challenge – knowing when to adjust thinking speed – the researchers developed a real-time difficulty estimation method. This mechanism allows the LRM to assess the complexity of a reasoning step as it’s being processed. Inspired by how humans quickly skim easy parts and focus on difficult ones, this system acts as a ‘traffic light’ for the model. When a complex segment is detected, the model is signaled to slow down and engage in deeper analysis. Conversely, for straightforward parts, it can accelerate its processing.

Also Read:

The Adaptive Reasoning Strategy

By combining the steering vector with real-time difficulty estimation, the researchers propose the first dynamic reasoning strategy for LRMs. This strategy enables fast processing for easy steps and more profound analysis for complex reasoning segments within a single reasoning trace. The results are impressive: the method yields an average +1.3% accuracy while reducing token usage by -8.6% across leading LRMs and advanced reasoning benchmarks. This means models can achieve better results using fewer computational resources.

The algorithms developed are implemented based on the vLLM framework, making them easily integrable into existing Large Language Model deployment systems. This work not only enhances current LRM capabilities but also opens new avenues for future research in creating more efficient and intelligent AI systems. For more in-depth technical details, you can refer to the full research paper: Controlling Thinking Speed in Reasoning Models.

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

AI Models Gain Human-Like Cognitive Control for Smarter, Faster Reasoning

Controlling Thinking Speed: The Steering Vector

When to Adjust: Real-Time Difficulty Estimation

The Adaptive Reasoning Strategy

Gen AI News and Updates

Jorie AI Unveils SmartCore Engine: Revolutionizing Healthcare Intelligence and Automation

Boosting Business Efficiency: A New AI and Big Data Model for Process Optimization

New Graph Neural Networks Improve Reasoning in Assumption-Based Argumentation

Boosting Business Efficiency: A New AI and Big Data Model for Process Optimization

AlphaCast: A New Approach to Time Series Prediction Through Human-AI Collaboration

New Graph Neural Networks Improve Reasoning in Assumption-Based Argumentation

Enhancing AI Reasoning: How Recursive Refinement and Multi-Agent Systems Improve Language Model Performance

ARGUS: A Proactive Framework for Enhancing Autonomous Driving Safety

Generative AI Powers Next-Gen Autonomous Emergency Response

OR-R1: Advancing Automated Optimization with Smart, Data-Efficient AI

Enhancing GUI Agents with Memory: A New Framework for History-Aware Reasoning

ProBench: A Deeper Look into How We Evaluate AI Agents for Mobile Apps

Enhancing Large Language Model Reasoning with Concise Outputs

Ensuring Trust in Autonomous AI: A Two-Layered Monitoring Approach for Agentic Systems

MedFuse: A Multiplicative Approach to Understanding Irregular Clinical Time Series Data

HyperD: A New Framework for More Accurate and Robust Traffic Predictions

Beyond Training: Researchers Propose ‘Model Raising’ for AI with Intrinsic Values

Bridging the Divide: Why AI Needs a Qualitative Revolution

Language Models Enhance Safety Certificate Synthesis for Dynamic Systems

Subscribe to get the latest news and updates