Advancing Digital Avatars: A New Method for Realistic Gaze Redirection

TLDR: Researchers introduce a novel 3D gaze redirection framework that uses an explicit 3D eyeball structure with 3D Gaussian Splatting to generate photorealistic images with accurate and controllable gaze. The method allows for explicit eyeball rotation and includes an adaptive deformation module for subtle muscle movements around the eyes, enabling independent eye control and outperforming previous state-of-the-art techniques in image quality and gaze accuracy.

A new research paper titled “Roll Your Eyes: Gaze Redirection via Explicit 3D Eyeball Rotation” introduces a groundbreaking approach to manipulating eye gaze in digital images, promising more realistic and controllable virtual interactions. This work, by YoungChan Choi, HengFei Wang, YiHua Cheng, Boeun Kim, Hyung Jin Chang, YoungGeun Choi, and Sang-Il Choi, addresses a significant challenge in computer vision and graphics: creating digital avatars that can express natural and accurate gaze.

Traditional methods for gaze redirection often rely on neural radiance fields (NeRFs), which use implicit neural representations. While effective, these approaches struggle with explicitly modeling the 3D structure of the eyeball, making precise rotation and translation difficult. This limitation can lead to less realistic eye movements in digital representations.

The core innovation of this new framework is its use of an explicit 3D eyeball structure, integrated with 3D Gaussian Splatting (3DGS). Unlike implicit methods, 3DGS provides a clear 3D representation, making it ideal for modeling the eyeball. By physically rotating and translating this dedicated 3D eyeball structure, the system can generate photorealistic images that accurately reflect a desired gaze direction.

Beyond just rotating the eyeball, the researchers also developed an adaptive deformation module. This module is crucial for replicating the subtle muscle movements around the eyes that naturally occur when a person changes their gaze. This attention to detail significantly enhances the realism of the generated faces, capturing nuances that are often missed by other methods.

A notable feature of this framework is its ability to control the left and right eyes independently. This opens up possibilities for synthesizing complex and even unconventional gaze patterns, such as cross-eyed looks, which are challenging to capture or simulate in real-world scenarios. This fine-grained control expands the creative potential for gaze synthesis in various applications.

The effectiveness of the framework was demonstrated through experiments on the ETH-XGaze dataset. The results show that the proposed method outperforms existing state-of-the-art techniques in terms of image quality and gaze estimation accuracy. It also maintains strong identity preservation, meaning the generated images look like the original subject, just with a different gaze. The system is also built upon 3DGS, which allows for higher detail preservation and faster rendering speeds compared to NeRF-based approaches.

Ablation studies confirmed the importance of the newly introduced loss functions and the gaze-guided deformation field. These components were shown to be critical for ensuring that the eyeball and facial representations are disentangled and that subtle muscle movements around the eyes are accurately captured, preventing visual distortions and improving overall realism.

Also Read:

This novel gaze redirection framework marks a significant step forward in creating more lifelike and expressive digital avatars. By enabling precise and independent control over eye movements and capturing the subtle surrounding muscle deformations, this technology has vast potential for applications in virtual reality, augmented reality, remote conferencing, and the development of highly immersive digital avatars. You can read the full research paper here.

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

Advancing Digital Avatars: A New Method for Realistic Gaze Redirection

Gen AI News and Updates

Iris Bolsters Leadership with New Innovation, AI, and Technology Director Amidst Senior Hires

Enhancing Text Legibility in AI-Generated Videos with Synthetic Data

Tailoring Image Edits: A Collaborative Approach to User Preferences in AI

Boosting Business Efficiency: A New AI and Big Data Model for Process Optimization

AlphaCast: A New Approach to Time Series Prediction Through Human-AI Collaboration

New Graph Neural Networks Improve Reasoning in Assumption-Based Argumentation

Enhancing AI Reasoning: How Recursive Refinement and Multi-Agent Systems Improve Language Model Performance

ARGUS: A Proactive Framework for Enhancing Autonomous Driving Safety

Generative AI Powers Next-Gen Autonomous Emergency Response

OR-R1: Advancing Automated Optimization with Smart, Data-Efficient AI

Enhancing GUI Agents with Memory: A New Framework for History-Aware Reasoning

ProBench: A Deeper Look into How We Evaluate AI Agents for Mobile Apps

Enhancing Large Language Model Reasoning with Concise Outputs

Ensuring Trust in Autonomous AI: A Two-Layered Monitoring Approach for Agentic Systems

MedFuse: A Multiplicative Approach to Understanding Irregular Clinical Time Series Data

HyperD: A New Framework for More Accurate and Robust Traffic Predictions

Beyond Training: Researchers Propose ‘Model Raising’ for AI with Intrinsic Values

Bridging the Divide: Why AI Needs a Qualitative Revolution

Language Models Enhance Safety Certificate Synthesis for Dynamic Systems

Subscribe to get the latest news and updates