OpenAI has notched a victory in its ongoing legal fight against publishers over how its AI tools use creative work. On November 7, a judge dismissed a copyright case against the startup brought by independent publishers Alternet and Raw Story.
While several publishers have lined up content deals with OpenAIâincluding WIRED parent company Condé Nastâdozens of copyright lawsuits against AI startups are winding their way through the US court system. Many of these complaints allege direct copyright infringement, arguing that itâs illegal for AI companies to train their tools on news articles, books, paintings, and other copyrighted materials without permission. Some also include other claims ranging from trademark law violation to violations of the Digital Millennium Copyright Act, a copyright law intended as an anti-piracy statute that is now broadly deployed by intellectual property rights holders.
The complaint brought by Alternet and Raw Story focused on the DMCA, arguing that OpenAI broke the law by scraping thousands of news articles and stripping them of âcopyright management informationâ (CMI) like the authorâs name, the terms and conditions for use of the work, and the title of the work. The outlets asked for statutory damages of no less than $2,500 per violation, arguing OpenAI knew that stripping training data of CMI would result in copyright infringement from ChatGPT when it summarized or âregurgitatedâ articles without proper attribution.
OpenAI argued that the publishers had no legal standing to bring this claim, stating they failed to offer proof that ChatGPT was trained on their material, let alone that the training was harmful. Judge Colleen McMahon of the US Southern District of New York agreed with OpenAIâs argument, dismissing the case for lack of standing.
âWe build our AI models using publicly available data, in a manner protected by fair use and related principles, and supported by long-standing and widely accepted legal precedents,â says OpenAI spokesperson Jason Deutrom.
Although this is a major setback for Alternet and Raw Story, itâs not necessarily the end. âWe do intend to continue the case,â says Raw Story founder and CEO John Byrne. The next step is requesting permission from the judge to file an amended complaint.
âWeâre confident that we can address the courtâs concerns in an amended complaint,â says Matt Topic, a partner at Loevy & Loevy, the firm representing Raw Story Media. While Judge McMahon describes herself as âskepticalâ that the outlets could âallege a cognizable injuryâ in the dismissal, her ruling does indicate that sheâs open to considering a new filing.
Topic, who also represents The Intercept in a similar DMCA case against OpenAI, as well as the nonprofit newsroom the Center for Investigative Reporting in a copyright infringement case against both OpenAI and Microsoft, says he is âconfident that these kinds of DMCA claims are permitted under the Constitution.â
Not all experts agree. âThese claims make no sense and should all be dismissed, so I am not surprised by this ruling,â says Matthew Sag, a professor of law and artificial intelligence at Emory University. He believes the publishers failed to prove that OpenAI broke the law in part because they did not offer concrete examples that ChatGPT distributed copies of their work after stripping CMI.
Ann G. Fort, an intellectual property lawyer and partner at Eversheds Sutherland, suspects that the news outlets will need to provide specific examples of how ChatGPT produces infringing responses. âTheyâre going to need to show output,â she says.
DMCA claims have been especially contentious in a number of AI lawsuits. In The Intercept case, OpenAI filed a motion to dismiss over standing, too, but the court procedure was slightly different, and the publisher was given leave to file an amended complaint. It did so this past summer, bolstering its case by adding 600 pages of exhibits, including examples of how OpenAIâs models could be prompted to produce snippets of text that were in at least one case nearly identical to an Intercept article. The court is expected to rule later this month.
Whether or not Raw Story and Alternet are ultimately allowed to file an amended complaint, this weekâs dismissal appears not to foreclose other legal arguments; the judge pointedly noted that she found the specific DMCA claims lacking rather than the broader concept of infringement. âLet us be clear about what is really at stake here. The alleged injury for which the plaintiffs truly seek redress is not the exclusion of CMI from defendantâs training sets, but rather the defendantâs use of plaintiffâs articles to develop ChatGPT without compensation to plaintiff,â Judge McMahon writes. âWhether there is another statute or legal theory that does elevate this type of harm remains to be seen. But that question is not before the court today.â
However, some experts believe this ruling could, indeed, have far-reaching consequences. âThis theory of no standing is actually a potential earthquake far beyond AI,â says James Grimmelmann, a professor of digital and internet law at Cornell University. âIt has the potential to significantly restrict the kinds of IP cases that federal courts can hear.â He suspects that the logic applied in this case could be extended to argue that publishers donât have standing âto sue over model training at all, even for copyright infringement.â