Simulating Human Perspective Taking in AI: A Developmental Approach

TLDR: A research paper introduces an extension to the PerspAct system, integrating Large Language Models (LLMs) with Selman’s theory of developmental perspective taking. The study evaluates GPT’s ability to generate internal narratives reflecting different developmental stages (Egocentric and Differentiated) and assesses their influence on collaborative task performance. Key findings indicate that GPT can reliably produce developmentally consistent narratives, and higher developmental stages generally enhance collaborative effectiveness. Notably, LLMs tend to shift towards more advanced perspective-taking during interaction, suggesting that linguistic exchanges help refine internal representations. This work highlights the potential of integrating developmental theory into AI models for more socially intelligent agents.

Understanding how others see the world is a fundamental human skill, crucial for effective communication and teamwork. This ability, known as perspective taking, is not just about seeing things from a different angle; it also involves understanding another person’s thoughts, feelings, and intentions. As artificial intelligence (AI) systems become more integrated into our lives, equipping them with similar social intelligence is becoming increasingly important.

A recent research paper, Growing Perspectives: Modelling Embodied Perspective Taking and Inner Narrative Development Using Large Language Models, delves into this challenge by exploring how Large Language Models (LLMs) can simulate the developmental stages of perspective taking, drawing inspiration from established theories in developmental psychology.

Bridging AI and Developmental Psychology

The study builds upon an existing system called PerspAct, which combines the ‘Reason and Act’ (ReAct) paradigm with LLMs to help AI agents engage in collaborative tasks. The new research extends PerspAct by incorporating Selman’s theory of perspective taking, a well-known framework that outlines how humans develop this ability through distinct stages, from early egocentric views to more sophisticated forms of understanding others.

The researchers, including Sabrina Patania, Luca Annese, Anna Lambiase, Anita Pellegrini, Tom Foulsham, Azzurra Ruggeri, Silvia Rossi, Silvia Serino, and Dimitri Ognibene, aimed to answer two key questions: Can LLMs reliably generate internal narratives that reflect different developmental stages of perspective taking? And how do these developmentally-informed narratives impact an AI agent’s performance in collaborative tasks?

Selman’s Stages Simplified for AI

Selman’s theory describes five stages of perspective taking. For this study, the researchers simplified these into two core stages to focus on key developmental transitions:

Egocentric (Stage 0, before age 6): At this stage, an individual physically distinguishes themselves from others but assumes everyone thinks and sees things exactly as they do. The AI agent’s internal narrative would reflect only its own viewpoint.
Self-other, Differentiated (Stage 1, ages 6–8): Here, individuals begin to acknowledge that others might have different perspectives, often due to having access to different information. The AI agent’s narrative would show an emerging distinction between its own view and that of another.

How the Study Was Conducted

The team used GPT-4o, a powerful LLM, to generate ‘child-like’ internal narratives for various scenarios. These narratives were designed to mimic either the Egocentric or Differentiated stage. For example, an Egocentric narrative might simply list what the AI sees, while a Differentiated one might add, “I see this, but you might not see that from your side.”

These narratives were then fed into the PerspAct system, which acted as a ‘matcher agent’ in an extended director task. In this task, the agent had to identify objects in collaboration with a ‘director’ whose visibility and perspective differed. The performance was evaluated based on task accuracy and efficiency (number of steps taken).

Three main experimental conditions were used to test collaborative performance:

Blind: The AI received the child-like narrative without explicit information about the developmental stage it was meant to represent.
Informed: The AI received the narrative along with explicit information about the developmental stage.
Objective-Informed: The AI received a neutral, objective description of the scene, but was still told which developmental stage to simulate.

Key Findings: AI’s Evolving Perspective

The study yielded several interesting results:

Narrative Consistency: GPT-4o proved capable of reliably generating internal narratives that aligned with the specified developmental stages before any interaction took place.
Improved Collaboration with Higher Stages: Generally, narratives reflecting the Differentiated stage led to better collaborative performance and efficiency compared to the Egocentric stage. This aligns with human developmental patterns, where more advanced perspective taking enhances social interaction.
A Dynamic Shift in Interaction: Perhaps the most intriguing finding was a behavioral shift during interaction. While GPT could initially generate Egocentric narratives, once the AI started interacting with the ‘director’ and received linguistic requests, it often shifted towards more advanced, Differentiated reasoning. This suggests that language exchanges act as a trigger, helping the LLM refine its internal representations and become more socially attuned.

Also Read:

Implications for AI Development

These findings highlight the significant potential of integrating developmental psychology theories into AI models. By explicitly considering how perspective taking evolves, researchers can design LLMs that are not only more accurate and efficient in collaborative tasks but also exhibit more nuanced and human-like social cognition. The study also points to the importance of evaluating an AI’s internal ‘thought process’ during dynamic interactions, not just its initial static descriptions.

While the research focused on the initial stages of perspective taking, it opens doors for future work to explore more complex developmental shifts and to refine evaluation methods for AI in interactive social contexts. Ultimately, by viewing LLMs through a developmental lens, we gain a powerful new framework for understanding and building more socially intelligent artificial agents.

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

Simulating Human Perspective Taking in AI: A Developmental Approach

Bridging AI and Developmental Psychology

Selman’s Stages Simplified for AI

How the Study Was Conducted

Key Findings: AI’s Evolving Perspective

Implications for AI Development

Gen AI News and Updates

PASA Unveils New ‘Data for AI’ Guidance to Foster Responsible Innovation in Pensions Administration

AZTECH Introduces Comprehensive AI Training Series to Propel Regional Digital Transformation

HKU Spearheads AI Integration in Hong Kong’s Digital Education Future

Boosting Business Efficiency: A New AI and Big Data Model for Process Optimization

AlphaCast: A New Approach to Time Series Prediction Through Human-AI Collaboration

New Graph Neural Networks Improve Reasoning in Assumption-Based Argumentation

Enhancing AI Reasoning: How Recursive Refinement and Multi-Agent Systems Improve Language Model Performance

ARGUS: A Proactive Framework for Enhancing Autonomous Driving Safety

Generative AI Powers Next-Gen Autonomous Emergency Response

OR-R1: Advancing Automated Optimization with Smart, Data-Efficient AI

Enhancing GUI Agents with Memory: A New Framework for History-Aware Reasoning

ProBench: A Deeper Look into How We Evaluate AI Agents for Mobile Apps

Enhancing Large Language Model Reasoning with Concise Outputs

Ensuring Trust in Autonomous AI: A Two-Layered Monitoring Approach for Agentic Systems

MedFuse: A Multiplicative Approach to Understanding Irregular Clinical Time Series Data

HyperD: A New Framework for More Accurate and Robust Traffic Predictions

Beyond Training: Researchers Propose ‘Model Raising’ for AI with Intrinsic Values

Bridging the Divide: Why AI Needs a Qualitative Revolution

Language Models Enhance Safety Certificate Synthesis for Dynamic Systems

Subscribe to get the latest news and updates