Genetic Denoising: Accelerating Robotic Control with Minimal Steps

TLDR: A new research paper introduces the Genetic Diffusion Policy (GDP), a novel approach that significantly speeds up robotic manipulation tasks by tailoring diffusion model inference to the unique characteristics of embodied AI. GDP uses a population-based sampling strategy to filter denoising trajectories, reducing out-of-distribution risks and improving performance with as few as two neural function evaluations, outperforming traditional diffusion policies by up to 20%.

Diffusion models have emerged as a powerful tool in the field of robotic manipulation, allowing robots to learn complex tasks by observing expert demonstrations. These models, initially developed for generating images and videos, have shown great promise in enabling robots to perform intricate actions. However, a significant challenge remains: the speed at which these models can generate actions. Traditional diffusion models require many sequential steps to produce high-quality outputs, leading to latency that can hinder real-time robotic applications.

Researchers at Huawei Technologies Canada have tackled this problem head-on, proposing a novel approach that significantly accelerates diffusion policies without compromising performance. Their work, detailed in the paper “Two-Steps Diffusion Policy for Robotic Manipulation via Genetic Denoising”, highlights that directly applying inference strategies from vision tasks to robotics is not optimal. Robotic tasks, characterized by structured and low-dimensional action distributions, require a more tailored approach.

The Challenge of Slow Inference and Out-of-Distribution States

The core issue with existing diffusion policies in robotics is their computational expense during inference. Many denoising steps are needed to generate actions, which is too slow for robots operating in dynamic environments. A key finding by the researchers is that a common heuristic called “clipping,” used to constrain predictions, inadvertently creates “out-of-distribution” (OoD) intermediate states. These states cause the model to make larger errors and waste computational effort before returning to relevant action spaces. Counter-intuitively, they found that reducing the injected noise during denoising, a practice that degrades image generation, actually improves performance in robotic tasks due to the simpler, low-dimensional nature of robot action spaces.

Introducing Genetic Diffusion Policy (GDP)

To overcome these limitations, the team introduced the Genetic Diffusion Policy (GDP). This innovative sampling strategy enhances both the performance and stability of robotic manipulation by adapting the denoising process specifically for embodied AI tasks. GDP employs a population-based selection mechanism, similar to genetic algorithms, to filter denoising trajectories. It works by generating a population of potential action samples. Before each denoising step, a “fitness score” is calculated for each sample, measuring how “in-distribution” it is. Samples with better scores are favored and duplicated, while less suitable ones are discarded. This process ensures that the denoising trajectory stays within a high-density region of valid actions, reducing clipping artifacts and improving the quality of the generated actions, especially when very few denoising steps are used.

Remarkable Results with Fewer Steps

The effectiveness of GDP was rigorously tested across 14 challenging robotic manipulation tasks from D4RL and Robomimic benchmarks, including complex tasks like manipulating a pen, relocating a ball, hammering a nail, and opening a door. The results were striking: GDP consistently outperformed standard diffusion-based policies, achieving up to 20% performance gains while requiring significantly fewer inference steps. For instance, on Adroit Hand tasks, GDP achieved a 100% success rate with just two neural function evaluations (NFE), a dramatic improvement over baselines that often required 100 steps or more to achieve lower success rates.

This breakthrough means that robots can now generate actions much faster and more reliably, making diffusion policies more practical for real-world applications where quick and accurate decision-making is crucial. The research underscores the importance of tailoring AI techniques to the specific characteristics of the domain, rather than blindly transferring methods from one area to another.

Also Read:

Looking Ahead

While the Genetic Diffusion Policy shows immense promise, the researchers acknowledge that their genetic algorithm is still in its simplest form, without advanced features like cross-breeding or sophisticated mutations. Future work will explore more complex metaheuristics and further theoretical analysis. However, this initial success paves the way for a new generation of highly efficient and robust robotic control systems, capable of performing complex tasks with unprecedented speed and accuracy.

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

Genetic Denoising: Accelerating Robotic Control with Minimal Steps

The Challenge of Slow Inference and Out-of-Distribution States

Introducing Genetic Diffusion Policy (GDP)

Remarkable Results with Fewer Steps

Looking Ahead

Gen AI News and Updates

PASA Unveils New ‘Data for AI’ Guidance to Foster Responsible Innovation in Pensions Administration

AZTECH Introduces Comprehensive AI Training Series to Propel Regional Digital Transformation

HKU Spearheads AI Integration in Hong Kong’s Digital Education Future

Boosting Business Efficiency: A New AI and Big Data Model for Process Optimization

AlphaCast: A New Approach to Time Series Prediction Through Human-AI Collaboration

New Graph Neural Networks Improve Reasoning in Assumption-Based Argumentation

Enhancing AI Reasoning: How Recursive Refinement and Multi-Agent Systems Improve Language Model Performance

ARGUS: A Proactive Framework for Enhancing Autonomous Driving Safety

Generative AI Powers Next-Gen Autonomous Emergency Response

OR-R1: Advancing Automated Optimization with Smart, Data-Efficient AI

Enhancing GUI Agents with Memory: A New Framework for History-Aware Reasoning

ProBench: A Deeper Look into How We Evaluate AI Agents for Mobile Apps

Enhancing Large Language Model Reasoning with Concise Outputs

Ensuring Trust in Autonomous AI: A Two-Layered Monitoring Approach for Agentic Systems

MedFuse: A Multiplicative Approach to Understanding Irregular Clinical Time Series Data

HyperD: A New Framework for More Accurate and Robust Traffic Predictions

Beyond Training: Researchers Propose ‘Model Raising’ for AI with Intrinsic Values

Bridging the Divide: Why AI Needs a Qualitative Revolution

Language Models Enhance Safety Certificate Synthesis for Dynamic Systems

Subscribe to get the latest news and updates