Enhancing Travel Booking Platforms with Large Language Models: An Industrial Case Study

TLDR: A research paper details Fervento’s integration of Large Language Models (LLMs) into their CALEIDOHOTELS platform to improve the consistency and completeness of property descriptions. The study compared Mistral 7B (fine-tuned) and Mixtral 8x7B, finding Mixtral 8x7B superior in completeness, precision, and hallucination reduction, despite its significantly higher computational cost. The project, CaleidoGen, aims to enhance user experience, enable customization, and differentiate the platform through high-quality, consistent accommodation data.

The online travel industry, particularly property booking platforms, faces a significant challenge: maintaining consistent and up-to-date information about accommodations. Data often comes from various third-party providers, leading to incomplete or inconsistent details that can frustrate users and impact market share. To tackle this, Fervento, a software development startup, has integrated Large Language Models (LLMs) into its property reservation platform, CALEIDOHOTELS.

This initiative, detailed in the research paper Large Language Models in the Travel Domain: An Industrial Experience, explores the practical application of LLMs to enhance the quality and consistency of facility descriptions. The core idea is to leverage the LLMs’ ability to process and generate human-like text to fill data gaps and standardize property information across the platform.

The CaleidoGen System

Fervento developed a system called CaleidoGen, which uses Generative AI (GenAI) to create a coherent catalog of property descriptions from existing data sources. This system aims to reduce resource utilization compared to traditional methods and offers several key advantages:

Enhanced User Experience: By unifying descriptions across different providers, CaleidoGen ensures a consistent brand feel and simplifies comparisons for users.
Customization and Personalization: The system can highlight specific features or amenities, tailoring descriptions to seasonal interests or unique selling points.
Ease of Extension: A dedicated generation module makes it easier to integrate new or lower-quality data sources with minimal overhead.
Differentiation from Competitors: CALEIDOHOTELS can stand out with unique and easily updated property descriptions, turning its catalog into a strategic asset.

Evaluating LLM Performance

The researchers evaluated two prominent LLMs for this task: Mistral 7B and Mixtral 8x7B. Mistral 7B, a smaller and more resource-efficient model, was fine-tuned using the QLoRA technique. Mixtral 8x7B, a larger and more powerful model, was utilized with a refined system prompt. Both models were assessed on their ability to generate consistent and homogeneous descriptions while minimizing inaccuracies, often referred to as ‘hallucinations’.

The evaluation involved generating 20 facility descriptions for each model, using a distinct testing dataset to ensure unbiased results. The quality of the generated content was measured using several metrics:

Completeness: How much of the available context information was included in the description.
Precision: The factual accuracy of the information presented.
Hallucinations: The presence of fabricated or incorrect details.
Length of generation: The number of words in the generated text.

Also Read:

Key Findings and Trade-offs

The results showed that both models performed well, but Mixtral 8x7B consistently outperformed Mistral 7B-FT (fine-tuned) across most metrics. Mixtral 8x7B achieved 99.6% completeness compared to Mistral 7B-FT’s 93%, and a precision of 98.8% versus 96%. Crucially, Mixtral 8x7B had a significantly lower hallucination rate of 1.2% compared to Mistral 7B-FT’s 4%, making its content more reliable. Although Mixtral 8x7B produced slightly shorter descriptions (averaging 249 words vs. 277), it did so without compromising essential details, indicating more concise output.

However, this superior performance came with a notable trade-off in computational cost. Mixtral 8x7B required approximately 50GB of VRAM and an hourly cost of about $1.61, roughly ten times more than Mistral 7B-FT, which needed only 5GB of VRAM and cost around $0.16 per hour. This highlights the balance between achieving high output quality and managing infrastructure and operational expenses.

Despite its higher computational demands, Mixtral 8x7B was ultimately chosen as the core model for CALEIDOHOTELS due to the critical importance of providing accurate and complete information in the travel domain. This industrial experience demonstrates the significant potential of advanced LLMs in solving complex data consistency challenges within the tourism and technology sectors.

The work also sets the stage for future developments, including LLM-driven features like generating personalized itineraries or guided tour plans based on user preferences and nearby attractions.

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

Enhancing Travel Booking Platforms with Large Language Models: An Industrial Case Study

The CaleidoGen System

Evaluating LLM Performance

Key Findings and Trade-offs

Gen AI News and Updates

AI’s Hyper-Growth Unlocked: OpenAI’s $500B Valuation Forces a Capital Re-evaluation for Investors

PASA Unveils New ‘Data for AI’ Guidance to Foster Responsible Innovation in Pensions Administration

Ghana Navigates Complexities in AI Regulatory Development Amidst Coordination Challenges

Boosting Business Efficiency: A New AI and Big Data Model for Process Optimization

AlphaCast: A New Approach to Time Series Prediction Through Human-AI Collaboration

New Graph Neural Networks Improve Reasoning in Assumption-Based Argumentation

Enhancing AI Reasoning: How Recursive Refinement and Multi-Agent Systems Improve Language Model Performance

ARGUS: A Proactive Framework for Enhancing Autonomous Driving Safety

Generative AI Powers Next-Gen Autonomous Emergency Response

OR-R1: Advancing Automated Optimization with Smart, Data-Efficient AI

Enhancing GUI Agents with Memory: A New Framework for History-Aware Reasoning

ProBench: A Deeper Look into How We Evaluate AI Agents for Mobile Apps

Enhancing Large Language Model Reasoning with Concise Outputs

Ensuring Trust in Autonomous AI: A Two-Layered Monitoring Approach for Agentic Systems

MedFuse: A Multiplicative Approach to Understanding Irregular Clinical Time Series Data

HyperD: A New Framework for More Accurate and Robust Traffic Predictions

Beyond Training: Researchers Propose ‘Model Raising’ for AI with Intrinsic Values

Bridging the Divide: Why AI Needs a Qualitative Revolution

Language Models Enhance Safety Certificate Synthesis for Dynamic Systems

Subscribe to get the latest news and updates