CNN has officially filed a lawsuit against Perplexity in New York. The media giant accuses the AI startup of generating verbatim copies of its journalism. This legal move marks a significant escalation in the ongoing conflict between traditional publishers and generative AI companies.
According to the complaint, Perplexity provides users with access to material behind CNN's paywall. This suggests the AI engine is not just summarizing but reproducing protected content directly. The allegation implies a bypass of standard subscription barriers that fund journalistic operations.
The lawsuit highlights a critical detail about data scraping ethics. CNN claims it repeatedly attempted to block Perplexity's crawlers from accessing its site. Perplexity allegedly ignored these technical barriers and continued harvesting data anyway. This disregard for robots.txt protocols strengthens CNN's argument regarding unauthorized use.
As the original outlet reported, the core issue is about who pays for the content AI tools consume. CNN emphasizes that human beings report and create the material Perplexity takes without permission. The complaint argues there is no compensation for this extensive use of intellectual property.
Perplexity’s business model relies on its AI answer engine and recently launched Comet browser. These products function by pulling information from across the web to answer queries. The alleged copying stems directly from this mechanism of aggregating and presenting web data.
This case reflects a broader tension in the AI industry regarding content licensing. Other publishers have launched similar legal challenges against AI firms. They argue these companies monetize journalistic work without paying for it. The outcome could set a precedent for how AI answer engines operate legally.
What this means for you: If you build or use AI tools, assume that unlicensed scraping of paywalled or copyrighted material carries legal risk. You should prioritize using APIs from licensed partners or clearly summarize public domain data. Try this prompt with your AI assistant to ensure ethical usage: "Summarize the key legal risks of using copyrighted content in AI training data without a license, focusing on recent lawsuits like CNN vs Perplexity."