Britannica and Merriam-Webster File Major AI Copyright Lawsuit Against OpenAI
Publishers allege OpenAI scraped reference works to train ChatGPT and misattributed content.
On March 13, 2026, a major legal challenge was initiated in the U.S. District Court for the Southern District of New York. Encyclopaedia Britannica, Inc., along with its well-known subsidiary Merriam-Webster, Inc., filed a high-stakes lawsuit against OpenAI.
The complaint alleges that the AI giant systematically copied nearly 100,000 reference entries to train its GPT models and power the new ChatGPT search product.
As reported by Bloomberg, the lawsuit claims these actions resulted in “near-verbatim” outputs that effectively cannibalized web traffic. By seeking both monetary damages and a permanent injunction, the publishers are challenging the “fair use” defense that has long protected AI development.
Encyclopaedia Britannica Sues OpenAI
The complaint, filed in the Southern District of New York and publicly reported by legal and technology outlets, including Reuters, alleges that OpenAI ingested nearly 100,000 Britannica articles and Merriam-Webster entries to train GPT-series models and to power the search and answer features of ChatGPT.
Plaintiffs say the company’s chatbot reproduces close to verbatim passages and dictionary definitions and sometimes attributes those passages to Britannica or Merriam-Webster in inaccurate “hallucinations.” The filing names the defendants and attaches side-by-side comparisons of Britannica content and ChatGPT outputs as examples.
The complaint was filed March 13, 2026 (S.D.N.Y. case no. 1:26-cv-02097) and was prepared by lead counsel at Susman Godfrey LLP, the firm that has represented other major publishers in related AI litigation.
Britannica’s filings say the alleged copying occurred at scale, and they ask a judge both for unspecified monetary relief and for a court order blocking the alleged infringements. OpenAI responded in public statements that its models are trained on publicly available data and that its practices are grounded in fair use.
Why the OpenAI Lawsuit Matters
The OpenAI copyright legal case adds to a growing number of legal disputes over how generative AI systems are trained. Publishers, authors, and media organizations have increasingly argued that AI companies rely on copyrighted materials without authorization.
The dispute reflects broader industry debates about whether AI training qualifies as fair use, a legal doctrine that allows limited use of copyrighted material without permission. OpenAI has previously argued that training models on publicly available data falls within fair-use protections, a position cited in multiple ongoing cases.
Legal analysts say these disputes could shape the future of AI training datasets and licensing models. Tom’s Guide publishers increasingly view licensing deals as a possible path forward, especially after a $1.5 billion settlement involving Anthropic and authors highlighted the financial stakes of AI copyright disputes.
Merriam-Webster OpenAI Lawsuit Reactions
Executives and legal observers have framed the case as part of a broader fight over how AI companies interact with trusted information sources.
Legal experts cited in Bloomberg Law note that the case also focuses on alleged trademark misuse under the Lanham Act, particularly when AI systems incorrectly attribute generated content to trusted publishers.
Industry observers note that the Britannica-Merriam-Webster lawsuit follows prior legal challenges faced by OpenAI, including cases over Scarlett Johansson’s voice likeness and other content replication.
Analysts tracking generative AI litigation, such as the “ChatGPT Is Eating the World” legal tracker, highlight this as part of a growing trend of lawsuits targeting AI developers for copyright, trademark, and data usage practices.
ChatGPT Copyright Case Industry Impact
The ChatGPT copyright case could have implications beyond the companies directly involved. Developers building AI systems may need clearer guidelines on how training datasets are compiled, especially when they include copyrighted material.
According to TechCrunch, the lawsuit is part of a broader wave of litigation that includes the ongoing New York Times vs. OpenAI case and a separate lawsuit involving Britannica and the AI search company Perplexity.
If courts rule that training AI systems on copyrighted material without permission constitutes infringement, technology companies may need to rely more heavily on licensed datasets or agreements with publishers.
At the same time, publishers may seek compensation when their content contributes to AI systems that generate automated answers competing with traditional websites.
As Moneycontrol reports, the case highlights a growing tension between AI innovation and the protection of intellectual property created by professional writers, editors, and researchers.
What’s Next For OpenAI
The OpenAI-Britannica lawsuit will proceed through preliminary court proceedings in the Southern District of New York. The next steps could include motions to dismiss and early discovery related to OpenAI’s training data.
Legal experts expect the case to unfold alongside several other high-profile AI copyright lawsuits, including those brought by media companies and authors.
Decisions in these cases could influence how AI developers collect training data and negotiate licensing agreements with publishers in the coming years.
Source: Britannica, Merriam-Webster Accuse OpenAI of Copying



