Salesforce Faces Copyright Infringement Lawsuit Over AI Model Training Data

TLDR: Salesforce is facing a proposed class-action lawsuit filed by authors who allege the company used thousands of copyrighted books without permission to train its artificial intelligence models, including its xGen AI series and models powering Einstein Copilot. The lawsuit highlights the growing legal challenges faced by tech companies over the use of copyrighted material in AI development.

Cloud-computing giant Salesforce is now embroiled in a significant legal battle, facing a proposed class-action lawsuit from a group of authors who claim the company unlawfully utilized their copyrighted works to train its artificial intelligence software. The complaint, filed in federal court on Wednesday, October 15, 2025, alleges that Salesforce infringed on intellectual property rights by incorporating thousands of books into the training datasets for its xGen AI series of large language models (LLMs) and other generative AI tools like Einstein Copilot, which are powered by AI firm Cohere’s models.

Authors E. Molly Tanzer and Jennifer Gilmore are among the plaintiffs, with other reports also mentioning bestselling writers such as Jonathan Franzen, Jodi Picoult, and George Saunders, indicating a potentially broader legal action. The lawsuit specifically points to the inclusion of ‘notorious RedPajama and The Pile datasets’ in Salesforce’s training regimen, which are said to contain the ‘Book3 corpus’ – a collection of hundreds of thousands of copyrighted books acquired without the explicit authorization or consent of their creators.

Joseph Saveri, the attorney representing the authors, underscored the critical need for transparency from companies developing AI products that rely on copyrighted material. ‘It’s important that companies that use copyrighted material for AI products are transparent,’ Saveri stated, adding, ‘It’s also only fair that our clients are fairly compensated when this happens.’ The plaintiffs are seeking substantial damages and an injunction to prevent Salesforce from any further unauthorized use of their content in AI training.

Adding a layer of irony to the situation, the lawsuit reportedly cites previous statements by Salesforce CEO Marc Benioff, who has publicly criticized other AI companies for using ‘stolen’ training data and suggested that compensating content creators for their work would be ‘very easy to do.’ The complaint argues that Benioff’s own company should adhere to these principles.

Also Read:

Salesforce has not yet issued a public statement regarding the lawsuit, with a company spokesperson declining to comment on the matter. This legal action against Salesforce is not an isolated incident but rather part of a burgeoning trend. Numerous authors and content owners have initiated similar lawsuits against other major tech firms, including OpenAI, Microsoft, and Meta Platforms, all alleging the misuse of copyrighted material for AI model training. Notably, Anthropic recently reached a landmark $1.5 billion settlement in August with a separate group of authors over similar copyright infringement claims, setting a precedent for the potential financial implications of such disputes.

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

Salesforce Faces Copyright Infringement Lawsuit Over AI Model Training Data

Gen AI News and Updates

UNESCO’s 43rd General Conference Concludes with New Leadership and Landmark Ethics Frameworks for Technology

Vatican Summit Addresses Ethical Imperatives of AI in Healthcare

Alation Introduces Agentic AI Suite for Enhanced Data Governance

AZTECH Introduces Comprehensive AI Training Series to Propel Regional Digital Transformation

HKU Spearheads AI Integration in Hong Kong’s Digital Education Future

OneShield Achieves Landmark Registration Under Cloud Security Alliance AI Controls Matrix, Setting New Industry Standard

SeedAI Leads Utah’s Proactive Initiative for Ethical AI Integration in Business

Bahrain Commended for AI Preparedness in New UNESCO Global Report

U.S. Air Force Secures Skydio Drone Technology for Enhanced Autonomous Operations

Malaysia Forges Ahead with AI Development, Prioritizing Governance and Ethical Frameworks

Contractify Honored as Top Contract Management Solution Provider for 2025 by LegalTech Breakthrough Awards

UNESCO’s 43rd General Conference Concludes with New Leadership and Landmark Ethics Frameworks for Technology

Astreya Unveils New Wave of Enterprise AI Agents to Boost Business Efficiency and Automation

EPAM Honored with Microsoft’s 2025 Innovate with Azure AI Platform Partner of the Year Award for Pioneering AI Solutions

EBU Academy’s School of AI Honored with European Digital Skills Award for Upskilling Media Professionals

Vesl AI Recognized for AI Infrastructure Innovation with ASOCIO Digital Summit Award

Netherlands Unveils Ambitious AI Strategy to Shape Global Governance Frameworks

BRYGE AI Secures Silver Stevie® Award for Groundbreaking Health Tech Product for Women

Prepify AI and ZoraSafe, Inc. Honored with ‘Panelists’ Choice’ Awards at UF Innovate’s GatorPitch in Miami

Subscribe to get the latest news and updates