MADS: A Multi-Agent System for Generating Persuasive Dialogue Data

TLDR: MADS (Multi-Agent Dialogue Simulation) is a framework that uses three coordinated AI agents (User, Dialog, and Optimization) to generate diverse and persuasive multi-turn dialogues. It addresses challenges like lack of user data and prompt inefficiency by simulating user behaviors and self-optimizing dialogue strategies. MADS employs a Chain-of-Attitude model to track user attitude changes and has shown significant improvements in persuasive performance for small language models and real-world business metrics, such as increasing conversion rates by 22.4% in a marketing scenario.

In the rapidly evolving landscape of artificial intelligence, the ability of conversational agents to persuade users is becoming increasingly vital, especially in sectors like marketing, healthcare, and finance. However, developing such persuasive dialogue systems faces significant hurdles: a scarcity of authentic multi-turn user data, difficulties in evaluating new systems (cold-start problem), and the inefficiency of manually designed prompts.

To tackle these challenges, researchers from Baidu Inc. have introduced MADS (Multi-Agent Dialogue Simulation), a groundbreaking framework designed to generate diverse and persuasive multi-turn dialogues through agent self-play. MADS offers a scalable and cost-effective solution for creating high-quality training data without extensive human annotation, making it particularly suitable for new or early-stage deployments.

Understanding the MADS Framework

MADS operates as a closed-loop system, continuously refining its dialogue generation process. It comprises three interconnected agents:

User Agents: These agents simulate a wide array of user personas, each driven by distinct personality traits and business contexts. They are designed to mimic realistic user behaviors.
Dialog Agent: This agent engages in multi-turn conversations with the User Agents, employing various task-oriented persuasion strategies.
Optimization Agent: This crucial agent evaluates and refines the dialogue outcomes. It automates dialogue annotation and prompt refinement through three sub-modules: Summary, Evaluation, and Evolution.

The entire process follows a cycle: Meta Instruction → Simulation → Optimization → Domain-Specific LLMs, ensuring a self-optimizing and high-fidelity modeling of user interactions.

Modeling User Attitudes with Chain-of-Attitude (CoA)

A core component of MADS is the Chain-of-Attitude (CoA) model, which tracks how user attitudes change throughout a dialogue. Inspired by classic marketing models like AIDA (Attention, Interest, Desire, Action), CoA defines a structured progression of user attitudes, from initial resistance to eventual acceptance. It uses a hierarchical state space of 16 attitude states, classified by an LLM-based classifier, to capture these dynamics turn by turn. This allows MADS to quantify the diversity of attitude changes using metrics like average information entropy and Jensen-Shannon divergence.

Self-Optimizing Dialogue Strategies

The Dialog Agent continuously improves its persuasive strategies through a reflection mechanism. Starting with a basic prompt, the system iteratively generates improved prompts and corresponding training dialogues. It dynamically calculates task-level metrics, such as intent compliance and CoA quality. If the user acceptance rate meets a predefined target and dialogue diversity is high, the system returns the optimized dialogues and agent. Otherwise, the Optimization Agent refines the prompt, learning from both successful and unsuccessful interactions.

Evaluating Persuasiveness

To validate the effectiveness of MADS-simulated data, the researchers used two primary evaluation benchmarks:

Make Me Pay (MMP): An OpenAI evaluation that assesses an LLM’s ability to persuade a user to donate money over multiple turns, focusing on dialogue guidance, emotional engagement, and persuasive strategies.
Persuasion For Good (P4G): A dataset that identifies ten typical persuasive strategies in donation scenarios, used here with an LLM-based classifier to analyze the distribution of strategies employed by the Dialog Agent.

Experimental Insights and Real-World Impact

Experiments demonstrated that incorporating personality traits like Zodiac Signs and MBTI types into user profiles significantly enhances the diversity of simulated user behaviors and enriches the persuasive strategies used by the Dialog Agent. For instance, the ‘Dmbti’ group showed a nearly 40% increase in normalized attitude entropy compared to a baseline with only demographic attributes.

Fine-tuning small language models (like Mistral-7B, ERNIE-Lite, and Qwen2.5-7B) with MADS-generated data from insurance scenarios led to substantial improvements in donation success rates and reductions in user withdrawal rates on the MMP benchmark. For example, ERNIE-Lite’s donation success rate increased from 18% to 30%.

In simulated marketing scenarios, MADS showed significant improvements in task completion rates after just a few iterations of prompt optimization. For instance, the success rate in automotive scenarios increased from 32.5% to 45%.

Perhaps the most compelling evidence of MADS’s value comes from its real-world deployment. An end-to-end Audio LLM trained with MADS for an insurance scenario achieved remarkable results. Compared to conventional systems, the MADS-trained Audio-LLM-16b increased the organic traffic conversion rate by 22.4% (from 1.83% to 2.24%) and boosted the user intention rate by 28.5%. This demonstrates clear business value and the practical applicability of the framework.

Also Read:

Looking Ahead

While MADS presents a powerful solution, the researchers acknowledge limitations, including a focus on single-dimension evaluation (persuasion), the potential for representational bias from a fixed attitude taxonomy, and the need for more stratified prompt optimization for diverse user types. Future work will explore multi-dimensional assessments, data-driven attitude representations, and more nuanced prompt strategies.

The MADS framework represents a significant step forward in generating high-quality, diverse persuasive dialogue data, offering a scalable and efficient approach to enhance conversational AI systems. For more details, you can refer to the full research paper.

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

MADS: A Multi-Agent System for Generating Persuasive Dialogue Data

Understanding the MADS Framework

Modeling User Attitudes with Chain-of-Attitude (CoA)

Self-Optimizing Dialogue Strategies

Evaluating Persuasiveness

Experimental Insights and Real-World Impact

Looking Ahead

Gen AI News and Updates

Amazon Bedrock’s A2A Protocol: The Catalyst for Next-Gen Cross-Framework Multi-Agent AI Systems

Astreya Unveils New Wave of Enterprise AI Agents to Boost Business Efficiency and Automation

Vida Secures $4 Million Series A Funding to Advance AI Voice Technology and Expand Leadership

Boosting Business Efficiency: A New AI and Big Data Model for Process Optimization

AlphaCast: A New Approach to Time Series Prediction Through Human-AI Collaboration

New Graph Neural Networks Improve Reasoning in Assumption-Based Argumentation

Enhancing AI Reasoning: How Recursive Refinement and Multi-Agent Systems Improve Language Model Performance

ARGUS: A Proactive Framework for Enhancing Autonomous Driving Safety

Generative AI Powers Next-Gen Autonomous Emergency Response

OR-R1: Advancing Automated Optimization with Smart, Data-Efficient AI

Enhancing GUI Agents with Memory: A New Framework for History-Aware Reasoning

ProBench: A Deeper Look into How We Evaluate AI Agents for Mobile Apps

Enhancing Large Language Model Reasoning with Concise Outputs

Ensuring Trust in Autonomous AI: A Two-Layered Monitoring Approach for Agentic Systems

MedFuse: A Multiplicative Approach to Understanding Irregular Clinical Time Series Data

HyperD: A New Framework for More Accurate and Robust Traffic Predictions

Beyond Training: Researchers Propose ‘Model Raising’ for AI with Intrinsic Values

Bridging the Divide: Why AI Needs a Qualitative Revolution

Language Models Enhance Safety Certificate Synthesis for Dynamic Systems

Subscribe to get the latest news and updates