How AI Learns to Adapt by Forming Cognitive Structures

TLDR: The paper introduces a hierarchical learning framework where a Graph Convolutional Network (GCN) learns maze navigation paths (first-order learning) and an MLP controller adapts the GCN’s parameters to new maze structures (second-order learning). It empirically validates that this second-order learning promotes the emergence of internal “mental maps” that are structurally similar (isomorphic) to the environment, leading to significant performance improvements and robust generalization in unseen and varied maze tasks, including value prediction.

In the quest to build more intelligent and adaptive artificial systems, understanding how advanced cognition emerges is paramount. A recent research paper, Hierarchical Learning for Maze Navigation: Emergence of Mental Representations via Second-Order Learning, delves into this complex area, proposing a novel framework that allows AI to develop internal ‘mental maps’ of its environment, much like humans and animals do. This work, by Shalima Binta Manir and Tim Oates, explores how a concept called ‘second-order learning’ can drive the formation of these crucial cognitive structures.

Understanding Mental Maps in AI

At the heart of advanced cognition is the idea of ‘mental representation’ – internal models that mirror the external world. Think of how you navigate a familiar building; you don’t just react to immediate sensory input, but rather use an internal map of the layout. Historically, theories like Tolman’s cognitive maps have highlighted the importance of these internal structures for flexible navigation and problem-solving. More recently, theoretical models have suggested that these representations might emerge from ‘second-order learning’ – a sophisticated form of learning where the system learns how to adapt its primary learning mechanisms.

While the theory is compelling, direct empirical evidence for how second-order learning leads to these environment-aligned mental representations has been limited, especially in practical AI implementations. Many existing computational approaches rely on computationally intensive evolutionary algorithms, which can be difficult to interpret.

The Hierarchical Learning Approach

To address this gap, Manir and Oates introduce a hierarchical learning framework. This system comprises two main components:

First-Order Learner: A Graph Convolutional Network (GCN) that directly learns to predict optimal paths in a maze. It takes in information about the maze’s nodes (like coordinates and connectivity) and outputs whether a node is part of the shortest path.
Second-Order Learner: An MLP (Multi-Layer Perceptron) controller that dynamically adjusts the GCN’s parameters. When the GCN encounters a new maze structure it hasn’t seen before, the MLP steps in to adapt its learning, essentially teaching the GCN ‘how to learn’ more effectively in novel situations.

The core idea is that this second-order adaptation, driven by the MLP, encourages the GCN to develop an internal mental map that is ‘isomorphic’ – structurally similar – to the actual maze environment. This means that the internal representation preserves the spatial relationships and topology of the maze.

Evidence of Internal Maps

The researchers conducted several experiments to validate their hypothesis. In one key experiment, they trained the GCN on standard mazes and then tested its ability to adapt to mazes with randomly blocked paths. The ‘Adapted GCN’ (with the MLP controller) achieved significantly higher accuracy (93.6%) compared to the ‘Unadapted GCN’ (61%), demonstrating the power of second-order learning in handling structural changes.

To visualize the internal representations, they used a technique called t-SNE, which helps to project high-dimensional data into a two-dimensional space for easier understanding. These visualizations revealed clear evidence of an emergent mental map:

Nodes that were spatially close in the maze appeared close in the AI’s internal representation.
The system developed distinct internal clusters for different functional roles of nodes, such as those on the optimal path versus those not.
Quantitative measures, like Pearson and Spearman correlation coefficients (both over 0.9), confirmed a strong alignment between the internal latent space and the actual maze geometry, indicating that the AI had indeed constructed an isomorphic mental map.

Generalization and the Importance of Structure

Further experiments highlighted the robustness and importance of this approach:

Generalization Across Sizes: A GCN trained on smaller 8×8 mazes showed remarkable ability to generalize its spatial understanding to much larger mazes (10×10, 12×12, 14×14), maintaining high correlation with true grid distances. This suggests the AI learns general spatial principles, not just specific maze layouts.
Impact of Non-Isomorphic Features: When mazes were given non-spatial, random features instead of meaningful spatial coordinates, the AI’s performance drastically declined. This crucial finding underscores that the effectiveness of second-order learning relies heavily on the presence of a meaningful, isomorphic relationship between the environment’s features and the AI’s internal representation. Without this structural alignment, adaptation fails.
Value Adaptation: The framework was also successfully applied to value prediction tasks, where the AI had to predict cumulative rewards in dynamic environments. The adapted GCN significantly improved value prediction accuracy and policy alignment, further confirming the broad utility of second-order learning.

Also Read:

Conclusion: A Path to More Adaptive AI

This research provides compelling empirical validation for the theory that second-order learning can induce the emergence of structured internal cognitive maps. By combining GCNs for first-order learning and an MLP controller for second-order adaptation, the system demonstrates superior performance and robust generalization in dynamic, graph-structured environments like mazes. The findings suggest that for AI to achieve flexible and robust reasoning, its capacity to learn how to learn (second-order learning) fundamentally relies on the formation of isomorphic mappings during its primary learning process. This work offers valuable insights into the computational mechanisms underlying mental representation and paves the way for more adaptive and intelligent artificial systems.

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

How AI Learns to Adapt by Forming Cognitive Structures

Understanding Mental Maps in AI

The Hierarchical Learning Approach

Evidence of Internal Maps

Generalization and the Importance of Structure

Conclusion: A Path to More Adaptive AI

Gen AI News and Updates

HKU Spearheads AI Integration in Hong Kong’s Digital Education Future

UNESCO’s 43rd General Conference Concludes with New Leadership and Landmark Ethics Frameworks for Technology

BRYGE AI Secures Silver Stevie® Award for Groundbreaking Health Tech Product for Women

Boosting Business Efficiency: A New AI and Big Data Model for Process Optimization

AlphaCast: A New Approach to Time Series Prediction Through Human-AI Collaboration

New Graph Neural Networks Improve Reasoning in Assumption-Based Argumentation

Enhancing AI Reasoning: How Recursive Refinement and Multi-Agent Systems Improve Language Model Performance

ARGUS: A Proactive Framework for Enhancing Autonomous Driving Safety

Generative AI Powers Next-Gen Autonomous Emergency Response

OR-R1: Advancing Automated Optimization with Smart, Data-Efficient AI

Enhancing GUI Agents with Memory: A New Framework for History-Aware Reasoning

ProBench: A Deeper Look into How We Evaluate AI Agents for Mobile Apps

Enhancing Large Language Model Reasoning with Concise Outputs

Ensuring Trust in Autonomous AI: A Two-Layered Monitoring Approach for Agentic Systems

MedFuse: A Multiplicative Approach to Understanding Irregular Clinical Time Series Data

HyperD: A New Framework for More Accurate and Robust Traffic Predictions

Beyond Training: Researchers Propose ‘Model Raising’ for AI with Intrinsic Values

Bridging the Divide: Why AI Needs a Qualitative Revolution

Language Models Enhance Safety Certificate Synthesis for Dynamic Systems

Subscribe to get the latest news and updates