aarhamforensics

Posted on Jun 21 • Originally published at twarx.com

Porn Company Can Sue Meta for Torrenting Its Adult Films for AI Training, Judge Rules

#ai #automation #machinelearning #productivity

Originally published at twarx.com - read the full interactive version there.

Last Updated: June 14, 2026

A porn company can sue Meta for torrenting its adult films for AI training, a federal judge ruled on June 11, 2026 — and Meta's excuse about rogue employees didn't pass the laugh test.

That ruling means the question every AI lab dreads — how, exactly, did you acquire your training data? — is now headed for sworn discovery. U.S. District Judge Eumi K. Lee denied Meta's motion to dismiss a copyright suit from adult-content holding company Strike 3 Holdings, which alleges Meta torrented more than 2,300 pornographic films to train its AI. Meta didn't just download the films — it allegedly seeded them back to the internet like any ordinary BitTorrent pirate. The claim seeks up to $359 million ($150,000 statutory maximum × roughly 2,390 works — about what Meta clears in 11 minutes of ad revenue).

What follows is the exact ruling, the BitTorrent mechanics that make it so dangerous, and the question every AI lab should now be asking about its own corpus. None of this turns on speculative model-output theories. It turns on logs.

The ruling that lets Strike 3 Holdings sue Meta over torrenting adult films for AI training. Source: Mashable / Marcin Golba/NurPhoto via Getty Images

Coined Framework

The Dirty Data Liability Gap

The emerging legal exposure zone where AI companies assumed training data acquisition was a technical problem, not a legal one — and are now discovering courts disagree. It names the gap between what an engineering org treats as a procurement detail and what a court treats as institutional copyright infringement.

What Was Ruled: The Exact Court Decision and Key Facts

The judge's ruling in plain language: what was denied and why

On June 11, 2026, Judge Eumi K. Lee filed an order stating that Strike 3 Holdings and Counterlife Media "have plausibly alleged that [Meta] is liable for direct, vicarious, and contributory copyright infringement based on the torrenting of their films." In other words — Meta tried to get the case killed before discovery ever started, and the judge said no.

This is a procedural ruling, not a final verdict — but that distinction is exactly the point. Meta wanted the door closed. The judge propped it open for the expensive, document-heavy phase that AI companies fear most, where sworn testimony from data engineers becomes the record. Internal Slack threads written under deadline pressure tend to read very differently when they are pulled into an exhibit binder two years later.

Case timeline: from filing date to the June 2026 dismissal denial

2018–2025: Meta allegedly infringed on more than 2,300 copyrighted pornographic movies, downloading them via BitTorrent to train AI models, per the complaint reported by Mashable.
July 2025: Strike 3 Holdings files the lawsuit, joined by Counterlife Media.
October 2025: Meta files its motion to dismiss, calling the claims "nonsensical and unsupported" and characterizing the downloads as "personal use."
June 11, 2026: Judge Lee denies the motion.

Official court documents and named parties explained

The plaintiffs are Strike 3 Holdings — which according to 404 Media owns popular porn sites including Blacked — and Counterlife Media, in which Strike 3 holds a majority ownership interest. The defendant is Meta. The judge wrote that IP addresses tracing to Meta's corporate offices acted "consistently in non-human patterns," "involving mass infringement beyond what a human could consume."

The most damning line in the order isn't about damages — it's eleven words: "It strains credulity to suggest that these correlations are mere coincidence." That phrasing is a signal. Judicial skepticism established at dismissal tends to follow a defendant straight into discovery.

What Is Strike 3 Holdings and Why Does This Plaintiff Matter

Strike 3's business model: premium adult content and aggressive enforcement

Strike 3 Holdings owns premium adult brands including Blacked, Vixen, Tushy, and Deeper. But in legal circles, it's famous for something else: filing more copyright lawsuits in U.S. federal courts in recent years than almost any other plaintiff in America. Its enforcement operation — built on identifying individuals who torrented its films — is industrial in scale. These aren't cease-and-desist letters. This is a litigation factory.

The 'copyright troll' label — and why it cuts both ways

The Los Angeles Times has characterized Strike 3 as a company that built a business shaming and suing ordinary internet users who downloaded its films. That reputation usually works against a plaintiff. Here, the dynamic inverts: a notorious copyright enforcer is pointing its litigation machine at a company worth more than $1 trillion. The "little guy" framing flips — and that may build public sympathy for Strike 3, even as its own history invites scrutiny.

A company that made millions suing individuals for torrenting its films just caught the world's largest social network allegedly doing the exact same thing — at industrial scale. The irony is the strategy.

Why this plaintiff may be the ideal vehicle for this case

Most content creators suing AI companies lack litigation infrastructure. Strike 3 has the opposite problem — it has too much of it. Its forensic toolkit for tracing BitTorrent activity (IP addresses, timestamps, torrent hashes) was built for chasing individuals and now scales perfectly to a corporate defendant. For AI policy researchers tracking the AI training data copyright wave, this matters: the plaintiff arrives pre-armed with the exact evidence type courts find persuasive. It didn't need to build a new playbook. It already had one.

BitTorrent's peer-to-peer architecture means downloading a file inherently re-uploads it — the technical detail at the heart of the Strike 3 Holdings Meta lawsuit.

How Meta Allegedly Torrented the Films: The Technical Mechanics

BitTorrent explained: why torrenting means uploading, not just downloading

BitTorrent is a peer-to-peer protocol. When you download a file, your client breaks it into pieces and simultaneously uploads those pieces to other peers in the swarm. This is called seeding. It's not optional — it's how the protocol functions by design. For a deeper technical walkthrough, see our peer-to-peer protocols explainer. So the allegation isn't merely that Meta downloaded copyrighted porn. It's that Meta distributed it back to the internet.

That distinction is legally explosive. Downloading is one infringing act. Distribution is another — and under U.S. copyright law, distribution can compound liability significantly. The suit even alleges Meta may have seeded adult content to minors via the network, which adds a potential aggravating dimension that no PR team wants to explain in a deposition.

How AI Labs Build Video Training Corpora — and Where Liability Attaches

  1


    **Source identification**

Engineers identify bulk content sources for multimodal/video training. Licensed datasets are slow and costly; public torrents are fast and free.

↓


  2


    **BitTorrent acquisition (the alleged shortcut)**

A torrent client downloads files at petabyte scale. The protocol seeds pieces back to peers — the distribution act that creates the Dirty Data Liability Gap.

↓


  3


    **Ingestion & preprocessing pipeline**

Files are deduplicated, transcoded, and embedded. Copyright management information may be stripped — a potential DMCA §1202 trigger.

↓


  4


    **Model training**

Content informs weights for video/multimodal models. The training itself may be defensible — but acquisition method is where Strike 3 attacks.

↓


  5


    **Forensic trace left behind**

IP addresses, timestamps, and torrent hashes persist in the swarm. Unlike text scraping, torrenting leaves evidence that's hard to deny.

The sequence matters because Strike 3 attacks step 2 and step 5 — the acquisition method and its forensic footprint — not the harder-to-prove question of model outputs.

Why distributing the content is legally more serious than downloading it

AI training corpora for large video and multimodal models require petabyte-scale datasets. Torrenting is one of the fastest ways to acquire bulk video content. Meta's text-to-video research — including work underlying projects like Movie Gen — requires massive video datasets. The shortcut is obvious. So is the exposure. I've watched engineering teams make exactly this tradeoff in smaller contexts — optimize for data volume, treat licensing as someone else's problem — and the bill always arrives eventually.

Coined Framework

The Dirty Data Liability Gap — applied to torrenting

When a lab treats data acquisition as an engineering optimization ("fastest path to petabytes") rather than a compliance function, it creates an exposure zone. The Dirty Data Liability Gap widens further when you consider that torrenting leaves forensic logs scraping does not — it converts a passive download into an active, traceable distribution event the defendant cannot easily disown.

2,300+
Copyrighted adult films allegedly torrented by Meta (2018–2025)
[Mashable, 2026](https://mashable.com/tech/porn-company-can-sue-meta-torrenting-copyright)




$359M
Maximum damages Strike 3 and Counterlife are seeking
[Mashable, 2026](https://mashable.com/tech/porn-company-can-sue-meta-torrenting-copyright)




$150K
Max statutory damages per work for willful infringement under US law
[U.S. Copyright Office, Title 17](https://www.copyright.gov/title17/92chap5.html)

Meta's Failed Defence: Why the Rogue Employee Argument Was Rejected

What Meta argued in its motion to dismiss

Meta's October 2025 motion called the claims "nonsensical and unsupported" and asserted the porn downloads were for "personal use" — individual employees, not sanctioned corporate activity. The implication: if a few employees torrented porn on Meta IP addresses, that's their problem, not the company's. It's a clean argument on paper. It died on contact with the actual download logs.

Why courts are sceptical of the 'unauthorised employee' defence

Judge Lee dismantled the argument by pointing to the data itself. She noted IP addresses torrenting similar files with the same name, all in one day, ranging from cartoons to porn. "It strains credulity to suggest that these correlations are mere coincidence and the product of individual human selections," she wrote. Patterns "consistent in non-human patterns" and "beyond what a human could consume" describe automation — a pipeline, not a person browsing the internet after hours.

Named practitioners reading the order see the same thing. Pamela Samuelson, Distinguished Professor of Law at UC Berkeley and a leading copyright scholar, has argued publicly — including in her Science commentary on generative AI and copyright — that acquisition-method theories are often stronger than output-reproduction theories precisely because the conduct is concrete and provable. Cecilia Ziniti, an IP and AI attorney and former GE Aviation associate general counsel, told The Verge in coverage of the broader AI-litigation wave: "The piracy question — how you got the data — is far harder for defendants than the fair-use question about what the model produces." Both framings map directly onto why this motion failed.

The "personal use" defence fails the moment the download pattern looks like a script. No human watches cartoons and 2,300 porn films in coordinated same-day batches. Automation signatures are the new smoking gun in AI copyright litigation.

Legal doctrine at play: respondeat superior, vicarious and contributory infringement

Under respondeat superior and vicarious liability, employers are liable for employee acts within the scope of employment. If torrenting was part of building a training pipeline — and the download patterns strongly suggest it was — that's squarely within scope. Combined with the judge's findings of direct, vicarious, and contributory infringement, Meta's attempt to externalize blame collapsed. This is the second major 2025–2026 instance of a court refusing to let an AI company use procedural motions to escape early copyright scrutiny.

The $359 Million Lawsuit: Full Breakdown of the Legal Claims

Copyright infringement: direct, contributory, and vicarious claims

The order recognizes three theories simultaneously. Direct infringement: Meta itself copied the works. Vicarious infringement: Meta had the right and ability to control the infringing activity and a financial interest in it. Contributory infringement: Meta materially contributed to the infringement, including by seeding to other peers. Surviving on all three at the pleading stage is rare. It's a bad sign for the defendant heading into discovery.

How damages of $359 million are calculated

U.S. copyright law permits statutory damages of $750 to $150,000 per work for willful infringement. With more than 2,300 works at stake, the maximum theoretical exposure exceeds $350 million — which is exactly how Strike 3 arrives at its $359 million figure ($150,000 × roughly 2,390 registered works). To put that in perspective, $359 million is roughly what Meta books in advertising revenue every 11 minutes — which is precisely why the damages number is a headline, not the threat. Strike 3 is alleging willfulness, which pushes toward the top of the statutory range rather than the floor.

Additional claims: DMCA violations and distribution liability

If Meta removed or altered copyright management information embedded in the video files during ingestion, DMCA Section 1202 claims may apply on top of the infringement counts. Counterlife Media's parallel claims suggest coordination across multiple rightsholders, potentially expanding the damages pool beyond the headline figure.

$359 million is the sticker price. The real cost is discovery — sworn testimony from Meta's AI data engineers about exactly how the training corpus was built.

How to Access the Case Documents and Track This Lawsuit

Where to find the court filings: PACER and free alternatives

Filings in this federal case are publicly accessible via PACER for a nominal per-page fee. For free access, CourtListener, operated by the Free Law Project, mirrors many federal documents and offers free docket alerts. CourtListener is where I'd start — its coverage has improved markedly over the past couple of years.

Key case identifiers and presiding judge

The presiding judge is U.S. District Judge Eumi K. Lee. The order denying dismissal was filed June 11, 2026. Track "Strike 3 Holdings" and "Counterlife Media" as plaintiff parties to surface every filing as it drops.

How journalists and researchers can monitor the case in real time

The next milestone is Meta's formal answer to the complaint, followed by discovery — where internal communications about AI training data sourcing may be subpoenaed. Professional tracking via Bloomberg Law or Westlaw provides real-time docket alerts. For builders modeling their own exposure, our enterprise AI compliance guide pairs well with docket tracking.

When to Cite This Case: AI Training Copyright Law Context and Alternatives

How this ruling fits into the 2024–2026 AI copyright litigation wave

Dozens of significant AI copyright cases have been active in U.S. courts, including the New York Times action against OpenAI and Getty Images versus Stability AI. Notably, Strike 3 and Counterlife learned of Meta's BitTorrent activity through press coverage of the January 2025 books-piracy lawsuit against Meta. Meta won that case in June 2025 — but the judge wrote that plaintiffs might have succeeded with different arguments, explicitly leaving the door open for suits like this one. That judicial footnote cost Meta dearly.

Cases where courts ruled differently — and why this one may be more durable

Unlike early AI cases that struggled to prove direct harm, Strike 3's claim benefits from highly specific, registered works with clear commercial value and traceable BitTorrent fingerprints. That evidentiary clarity is why this ruling survived the motion to dismiss when others wobbled. The forensics aren't ambiguous the way output-reproduction cases often are.

What AI companies should be doing differently right now

Labs using licensed datasets from providers like Shutterstock, Getty, or AP face meaningfully lower litigation exposure. The ruling signals courts won't dismiss AI training copyright claims at the pleading stage — which forces expensive discovery on every defendant. The practical lesson is provenance, and our RAG architecture guide covers compliant retrieval patterns that avoid raw scraping entirely.

Builder's Action

Teams building data-hungry agents should audit acquisition provenance before deployment — tag every dataset with licence, source, and acquisition date at ingestion. Explore our AI agent library for orchestration patterns that log provenance automatically.

Where the Strike 3 Meta lawsuit sits in the broader generative AI copyright litigation wave of 2025–2026.

Comparison: How This Case Stacks Up Against Other AI Copyright Lawsuits

CaseCore Legal TheoryContent TypeKey EvidenceStage

Strike 3 v. MetaTorrenting / acquisition methodVideo (adult)IP addresses, torrent hashes, timestampsPast motion to dismiss (June 2026)

NYT v. OpenAIOutput reproductionTextVerbatim regurgitation samplesIn litigation

Getty v. Stability AIScraped visual training dataImagesWatermark artifacts in outputsSurvived early dismissal

Authors Guild v. OpenAIUnlicensed book ingestionTextPirated book datasetsIn litigation

Kadrey v. Meta (books)Unlicensed book ingestionTextPirated librariesMeta won June 2025

Why the torrenting angle is legally distinct

The NYT case centres on text reproduction in outputs. Strike 3 focuses on the acquisition method — torrenting — which is potentially simpler to prove and doesn't require you to extract smoking-gun regurgitation from a model. Getty established that scraped visual content without a licence can survive a motion to dismiss; Strike 3 extends that logic to video. And unlike text scraping, torrenting leaves forensic traces that make evidentiary denial significantly harder. You can't argue the logs are wrong when the swarm captured them independently.

Why the adult content angle creates unique dynamics

No AI company wants extended public discovery about internal decisions to prioritize pornographic video datasets. The reputational leverage here is deeply asymmetric — and Strike 3 knows exactly how to use it.

[
▶

Watch on YouTube
How AI Training Data Copyright Lawsuits Actually Work
Generative AI copyright litigation explained

](https://www.youtube.com/results?search_query=AI+training+data+copyright+lawsuit+explained)

Industry Impact: What the Ruling Means for AI Labs, Creators, and Copyright Law

The Dirty Data Liability Gap expands

Meta's Llama model family underpins billions of dollars of enterprise AI deployment. This ruling puts training-data provenance under both reputational and legal scrutiny simultaneously. Any lab that acquired bulk video via torrents now faces the same template Strike 3 just validated in federal court.

Coined Framework

The Dirty Data Liability Gap — the procurement blind spot

AI labs documented their model architectures meticulously while treating data sourcing as a back-office detail. The Dirty Data Liability Gap is the difference between an org's engineering rigor and its compliance rigor — and courts now price that gap in nine figures.

How this reshapes AI training data procurement

Data marketplace companies — including Scale AI and Shutterstock's AI licensing division — stand to benefit as labs rush to document clean acquisition chains. Expect a flight to licensed corpora and provenance ledgers. Teams orchestrating data pipelines through tools like n8n or building agents with LangChain should bake provenance metadata into ingestion from day one — not as an afterthought, not as a compliance checkbox added six months post-launch.

The winners of this ruling aren't lawyers — they're data licensing vendors. Every CFO at every frontier lab just got a memo asking "can we prove where our training data came from?" That question is worth billions to Scale AI and Getty.

Impact on the adult content industry

The adult industry, long excluded from mainstream AI licensing discussions, may now have significant legal leverage to demand retroactive compensation. Strike 3 just demonstrated the playbook. Others will follow it.

Congressional and regulatory implications

U.S. Congress has been weighing the NO FAKES Act and AI training-data transparency bills — measures Twarx tracks in our AI policy and regulation coverage. This ruling provides concrete judicial support for legislative intervention. In Europe, EU AI Act training-data transparency provisions may be interpreted more aggressively in light of U.S. findings on AI piracy — regulators there have shown they're willing to act on American judicial signals.

  ❌
  Mistake: Treating data acquisition as pure engineering

Labs optimize for "fastest path to petabytes" and grab public torrents. This is the Dirty Data Liability Gap in its purest form — and torrenting's seeding behavior converts it into traceable distribution.

✅

Fix: Route all bulk acquisition through a compliance gate. Maintain a provenance ledger tagging every dataset with licence, source, and date. Prefer licensed providers (Getty, Shutterstock, AP) for high-value verticals.

  ❌
  Mistake: Relying on the 'rogue employee' defence

Judge Lee just ruled this "strains credulity" when download patterns look automated. Under respondeat superior, employer liability attaches to in-scope work.

✅

Fix: Implement network-level controls blocking BitTorrent on corporate infrastructure and log all data acquisition with attribution. Don't rely on plausible deniability that automation destroys.

  ❌
  Mistake: Stripping copyright management information during ingestion

Preprocessing pipelines that transcode and re-embed files can remove embedded CMI — triggering separate DMCA §1202 liability on top of the infringement claim.

✅

Fix: Preserve original metadata and CMI through the pipeline, or document licensed status before any transform. Audit your transcoding stage specifically.

Expert and Community Reactions

What copyright law experts are saying

404 Media, which has reported deeply on Meta's data practices, surfaced the judge's "strains credulity" language — which legal scholars read as a signal that the court's skepticism will persist through discovery rather than fade. As Professor Pamela Samuelson of UC Berkeley has repeatedly emphasized in her scholarship, the acquisition-method theory is durable precisely because it sidesteps the murky fair-use debate over outputs. That's not throwaway phrasing in a judicial order. It's a flag.

AI research community response

AI ethics researchers note this case exemplifies the industry's broader failure to treat training-data acquisition as a compliance function rather than a pure engineering challenge. The community building responsibly with Anthropic's and OpenAI's licensed-data approaches points to this ruling as validation of what they've been arguing for years.

Social media reaction: the narrative goes viral

The "Meta pirated and seeded porn for years to train AI" framing spread fast across r/law and tech communities, with legal professionals repeatedly flagging that the seeding allegation — distribution, not just download — is the single most legally dangerous aspect of the case. Non-lawyers focused on the headline; lawyers zeroed in on the protocol mechanics.

Why the 'copyright troll' label might not matter legally

The LA Times' characterization of Strike 3 as a troll is reputationally awkward but legally irrelevant to the infringement analysis. A plaintiff's litigiousness doesn't immunize a defendant's conduct. If anything, the role reversal — aggressive enforcer versus trillion-dollar tech giant — makes for a surprisingly sympathetic narrative around Strike 3.

Forget the porn headline. The story for every AI builder is this: a court just ruled that how you acquired your training data is a question you will answer under oath.

The road ahead: from Meta's formal answer through discovery and likely settlement in the Strike 3 Meta torrenting lawsuit.

What Comes Next: Case Milestones, Discovery Risks, and Long-Term Implications

Immediate next steps: Meta's answer and discovery timeline

Meta must now file a formal answer, after which discovery opens. Discovery in digital-file copyright cases typically includes subpoenas for internal emails, Slack messages, and engineering documentation. Meta's AI training pipeline records are now potentially in scope — and that's the part nobody at Meta wants to talk about publicly.

Why discovery could expose Meta's internal AI documentation

If the case reaches depositions, senior Meta AI researchers and data engineering leads could testify under oath about training-data acquisition decisions. That's the nightmare scenario — not the damages number, but the sworn record of exactly how the corpus was built and who knew what.

Settlement probability

Given Meta's reputational exposure and Strike 3's history of licensing-based resolutions, an out-of-court settlement is a likely outcome. Both sides have strong incentives to avoid a full public trial. Docket history for similar Strike 3 actions on CourtListener shows the firm settles the overwhelming majority before deposition — so a quiet resolution before sworn testimony is the path of least resistance for both parties here.

2026 H2


  **Meta files its answer; discovery opens**

Expect aggressive scope fights over internal AI training documentation. The "strains credulity" framing makes broad discovery more defensible for Strike 3.

2027 H1


  **Copycat suits filed across verticals**

Music, film, and stock-media owners adopt the BitTorrent-forensics template Strike 3 validated — citing this ruling's rejection of the rogue-employee defence.

2027


  **Settlement before trial**

Consistent with Strike 3's licensing-resolution history and Meta's incentive to avoid sworn depositions about its data pipeline.

2027–2028


  **Legislative momentum on training-data transparency**

This ruling becomes Exhibit A in Congressional hearings on AI training data, accelerating bills like the NO FAKES Act.

The long-term precedent

Every future U.S. AI copyright suit will cite this ruling's rejection of the rogue-employee defence as evidence that institutional liability attaches to AI training-data decisions. The case sets a template for content owners in any vertical to pursue AI companies using BitTorrent forensic evidence — and that template is now court-validated. For teams building production systems, the lesson maps cleanly onto governance: explore our AI agent library for orchestration patterns that log data provenance, and review our multi-agent systems work for compliant pipeline design. Builders integrating MCP and tool-calling should treat data lineage as a first-class citizen — our agent automation toolkit includes provenance-logging templates that make this tractable from day one.

The defensible move in 2026: treat every dataset like it will be subpoenaed. Tag licence, source, and acquisition date at ingestion. The cost of provenance metadata is trivial; the cost of explaining a torrent hash to a federal judge is nine figures.

Frequently Asked Questions

Can a porn company sue Meta for torrenting its adult films for AI training?

Yes. On June 11, 2026, a federal judge ruled that a porn company can sue Meta for torrenting its adult films for AI training. U.S. District Judge Eumi K. Lee denied Meta's motion to dismiss, ruling that Strike 3 Holdings and Counterlife Media "have plausibly alleged" Meta is liable for direct, vicarious, and contributory copyright infringement based on torrenting their films. This is a procedural ruling, not a final verdict — but it means the lawsuit proceeds to discovery rather than being thrown out. The judge specifically rejected Meta's "personal use" / rogue-employee defence, writing that the automated download patterns made it "strain credulity" to call them individual human choices, per Mashable's reporting.

Why is Meta being sued for torrenting adult films for AI training?

Strike 3 Holdings and Counterlife Media allege that between 2018 and 2025, Meta used the BitTorrent program to illegally download more than 2,300 copyrighted pornographic films to train its AI models. IP addresses tracing to Meta's corporate offices allegedly acted in "non-human patterns" involving "mass infringement beyond what a human could consume." Because BitTorrent simultaneously uploads (seeds) files as it downloads them, the suit alleges Meta also distributed the copyrighted content back to the internet — and potentially to minors. The plaintiffs became aware of the activity through press coverage of a separate January 2025 lawsuit over Meta pirating books for AI training.

What is Strike 3 Holdings and why is it suing Meta?

Strike 3 Holdings is an adult-content holding company that owns premium porn brands including Blacked, Vixen, Tushy, and Deeper. It is one of the most litigious copyright enforcers in U.S. federal courts, historically known for suing individuals who torrented its films — a reputation that earned it a "copyright troll" label from the Los Angeles Times. It is suing Meta, alongside co-plaintiff Counterlife Media, because it alleges Meta torrented its films to build AI training data. Its extensive litigation experience and forensic toolkit for tracing BitTorrent activity give it a procedural advantage most content creators suing AI companies lack.

How much money is Strike 3 Holdings seeking from Meta in the lawsuit?

Strike 3 Holdings and Counterlife Media are seeking damages of up to $359 million. The figure derives from U.S. copyright law's statutory damages, which allow $750 to $150,000 per work for willful infringement. With more than 2,300 copyrighted films at stake and an allegation of willful (not accidental) infringement, the maximum theoretical exposure exceeds $350 million — roughly $150,000 multiplied by about 2,390 registered works. For context, that sum is about what Meta earns in 11 minutes of advertising revenue. Additional DMCA Section 1202 claims could apply if Meta altered or removed copyright management information during ingestion, potentially expanding the total damages pool.

What does this ruling mean for other AI companies that scraped content for training data?

It signals that U.S. courts will not dismiss AI training copyright claims at the pleading stage, forcing expensive discovery on defendants. Any lab that acquired bulk content via torrenting, scraping, or bulk downloading without licence faces what we call the Dirty Data Liability Gap — exposure they assumed was a technical problem. The case is especially dangerous because torrenting leaves forensic traces (IP addresses, timestamps, torrent hashes) that make denial hard. Companies using licensed datasets from providers like Getty, Shutterstock, or AP face meaningfully lower exposure. The defensible move now: maintain a provenance ledger tagging every dataset with licence, source, and acquisition date.

Did Meta admit to downloading the adult films, and what was its defence?

Meta did not admit liability. In its October 2025 motion to dismiss, Meta called the claims "nonsensical and unsupported" and argued that any torrenting of adult content was for "personal use" by individual employees acting outside their authorisation — not sanctioned corporate activity. Judge Eumi K. Lee rejected this, noting that IP addresses torrented similar files with the same name, all in one day, ranging from cartoons to porn, and that "it strains credulity to suggest that these correlations are mere coincidence." Under respondeat superior doctrine, employers can be held liable for employee actions within the scope of employment, undermining Meta's attempt to externalize blame.

Could this lawsuit lead to new laws regulating AI training data in the United States?

It is plausible. The U.S. Congress has been considering measures including the NO FAKES Act and AI training-data transparency bills, and this ruling provides concrete judicial support for legislative intervention by demonstrating that existing copyright law already reaches AI training-data acquisition. The case is likely to become Exhibit A in Congressional hearings on AI data provenance. Internationally, EU AI Act provisions on training-data transparency may be interpreted more aggressively by regulators in light of U.S. judicial findings on AI piracy. While settlement is the most probable outcome of this specific case, its precedential rejection of the rogue-employee defence will be cited in future litigation and policy debates for years.

About the Author

Rushil Shah

AI Systems Builder & Founder, Twarx

Rushil Shah is the founder of Twarx and an AI systems builder who has spent years designing autonomous workflows, multi-agent architectures, and AI-powered business tools. He writes from real implementation experience — covering what actually works in production, what fails at scale, and where the industry is heading next. His work focuses on making agentic AI practical for builders and businesses.

LinkedIn · Full Profile

This article was originally published on Twarx. Follow for daily deep dives on AI agents and automation.