A Two-Stage Framework to Reduce AI Hallucinations in Multilingual Models

TLDR: CCL-XCoT is a two-stage fine-tuning framework designed to reduce hallucinations in multilingual large language models (MLLMs), particularly for low-resource languages. It achieves this by first improving cross-lingual semantic alignment through curriculum-based contrastive learning during pre-training, and then guiding the model to reason in a high-resource language before generating answers in the target low-resource language via a Cross-lingual Chain-of-Thought (XCoT) strategy. The method significantly reduces hallucination rates by up to 62% and enhances factual knowledge transfer without external retrieval or multi-model ensembles.

Large Language Models (LLMs) have shown remarkable abilities across many tasks, but they often struggle with generating accurate and factual information, a problem commonly known as ‘hallucination.’ This issue is particularly challenging for Multilingual Large Language Models (MLLMs), especially when dealing with languages that have limited training data, often referred to as ‘low-resource languages.’ These hallucinations can lead to outputs that are inaccurate or completely fabricated, which is a significant concern for real-world applications.

The core problem stems from an imbalance in training data: MLLMs are typically trained more extensively on high-resource languages like English, leading to a knowledge gap when they try to generalize to other languages. Existing methods like Retrieval-Augmented Generation (RAG) or Chain-of-Thought (CoT) prompting have shown some success in English, but their effectiveness diminishes in low-resource settings due to unreliable retrieval or the assumption that the model already possesses sufficient internal knowledge in the target language.

To tackle this, researchers Weihua Zheng, Roy Ka-Wei Lee, Zhengyuan Liu, Kui Wu, AiTi Aw, and Bowei Zou have proposed an innovative solution called CCL-XCoT. This framework, detailed in their paper CCL-XCoT: An Efficient Cross-Lingual Knowledge Transfer Method for Mitigating Hallucination Generation, is a two-stage fine-tuning process designed to improve factual generation in low-resource languages without needing external retrieval systems.

How CCL-XCoT Works

The CCL-XCoT framework operates in two key stages:

First, during a phase called ‘continued pre-training,’ the model undergoes ‘curriculum-based contrastive learning’ combined with next-token prediction. This stage is crucial for enhancing cross-lingual semantic alignment. In simpler terms, it teaches the model to recognize that semantically similar sentences or paragraphs across different languages should be represented closely in its internal knowledge space. This process starts with aligning individual sentences and then progresses to aligning longer, paragraph-level texts, helping the model build a more robust understanding across languages.

Second, during ‘instruction fine-tuning,’ the framework introduces a ‘Cross-lingual Chain-of-Thought (XCoT)’ prompting strategy. This is where the model is guided to think and reason in a high-resource language (like English) first, before generating its final answer in the target low-resource language. For example, if asked a question in Chinese, the model would first internally outline its reasoning steps in English, then formulate an English answer, and finally translate that answer back into Chinese. This leverages the model’s stronger reasoning and factual knowledge base in high-resource languages to improve accuracy in less-resourced ones.

Also Read:

Promising Results

The experimental results for CCL-XCoT are highly encouraging. The framework has been shown to reduce hallucination rates by up to 62% and significantly improve the transfer of factual knowledge between different language pairs. Importantly, these improvements are achieved without relying on external retrieval systems or complex multi-model setups. The curriculum-based contrastive learning alone led to gains of up to 20% on cross-lingual natural language understanding tasks, demonstrating its effectiveness in semantic alignment. Furthermore, analysis revealed that fine-tuning the mid-level layers of the model was most effective for cross-lingual knowledge transfer, offering insights for more efficient future adaptations.

This research marks a significant step forward in making multilingual LLMs more reliable and accurate, especially for languages that have historically been underrepresented in large datasets. By explicitly aligning semantic spaces and guiding reasoning through high-resource languages, CCL-XCoT offers a practical and efficient way to bridge linguistic and factual gaps in AI-generated content.

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

A Two-Stage Framework to Reduce AI Hallucinations in Multilingual Models

How CCL-XCoT Works

Promising Results

Gen AI News and Updates

Google DeepMind Unveils SIMA 2: An Advanced AI Agent for Virtual 3D Worlds

A New Way to Disentangle Data for Scientific Exploration

SiegPath Honored with ‘Most Innovative Fintech Award’ at AI Expo Europe 2025 for AI-Driven Solutions

Boosting Business Efficiency: A New AI and Big Data Model for Process Optimization

AlphaCast: A New Approach to Time Series Prediction Through Human-AI Collaboration

New Graph Neural Networks Improve Reasoning in Assumption-Based Argumentation

Enhancing AI Reasoning: How Recursive Refinement and Multi-Agent Systems Improve Language Model Performance

ARGUS: A Proactive Framework for Enhancing Autonomous Driving Safety

Generative AI Powers Next-Gen Autonomous Emergency Response

OR-R1: Advancing Automated Optimization with Smart, Data-Efficient AI

Enhancing GUI Agents with Memory: A New Framework for History-Aware Reasoning

ProBench: A Deeper Look into How We Evaluate AI Agents for Mobile Apps

Enhancing Large Language Model Reasoning with Concise Outputs

Ensuring Trust in Autonomous AI: A Two-Layered Monitoring Approach for Agentic Systems

MedFuse: A Multiplicative Approach to Understanding Irregular Clinical Time Series Data

HyperD: A New Framework for More Accurate and Robust Traffic Predictions

Beyond Training: Researchers Propose ‘Model Raising’ for AI with Intrinsic Values

Bridging the Divide: Why AI Needs a Qualitative Revolution

Language Models Enhance Safety Certificate Synthesis for Dynamic Systems

Subscribe to get the latest news and updates