Reddit Lawsuit Accuses Perplexity, Other AI Firms, of Stealing Data

Reddit filed a lawsuit against Perplexity, along with several other data mining companies, accusing them of stealing the social media platform’s valuable data.

Reddit’s lawsuit, filed on Wednesday in Manhattan federal court, said Perplexity and the three other firms it sued — Oxylabs UAB, AWM Proxy, and SerpApi — illegally circumvented Reddit’s digital guardrails by scraping its content through Google’s search engine results.

“These Defendants are similar to would-be bank robbers, who, knowing they cannot get into the bank vault, break into the armored truck carrying the cash instead,” Reddit’s lawsuit alleges.

Reddit said it sent a cease-and-desist letter to Perplexity in May 2024 demanding it stop scraping Reddit data unless it made a deal with the social media company, as Google and OpenAI had done.

Perplexity said it “was not using Reddit content to train any AI models and that it would respect Reddit’s robots.txt,” according to the lawsuit.

But Perplexity’s citations to Reddit increased “forty-fold after Reddit told it to stop,” the lawsuit added.

“Rather than respect Reddit and its users’ rights, what Perplexity has done in response is simply come up with increasingly devious schemes to circumvent Reddit’s security systems and policies,” the lawsuit says.

According to the lawsuit, Perplexity appears to have used at least one of the data scrapers to ingest the platform’s data into its AI models.

“In other words, Perplexity’s business model is effectively to take Reddit’s content from Google search results, feed them into a third party’s LLM, and call it a new product,” the lawsuit says. “While that business model has somehow translated into a $20 billion valuation, it has not resulted in a willingness to pay for what others (including Google) have.”

Perplexity spokesperson Jesse Dwyer said the company “will always fight vigorously for users’ rights to freely and fairly access public knowledge.”

“Our approach remains principled and responsible as we provide factual answers with accurate AI, and we will not tolerate threats against openness and the public interest,” Dwyer said.

A SerpApi representative said the company disagrees with Reddit’s allegations and plans to “vigorously” defend itself in court.

Oxylabs did not immediately respond to a request for comment by Business Insider. AWMProxy, identified in the lawsuit as a former Russian botnet, could not immediately be reached for comment.

A Reddit spokesperson confirmed to Business Insider that the company has spent tens of millions of dollars on anti-scraping systems, which the lawsuit says these companies circumvented.

The lawsuit said Reddit caught Perplexity bypassing its guardrails by setting up a test post that acted as a digital “marked bill.”

The test post could only be viewed by Google’s search engine, the lawsuit said, so Perplexity and other AI companies should not have been able to use it for their models.

The contents of the post soon appeared in Perplexity, indicating that it or another data scraper it worked with had taken the content without permission.

“Within hours, queries to Perplexity’s ‘answer engine’ produced the contents of that test post,” Reddit’s lawsuit says.

Reddit’s lawsuit quotes a social media post from Cloudflare’s CEO comparing Perplexity to “North Korean hackers” for appearing to try to hide its web-crawling activity.

“Some supposedly ‘reputable’ AI companies act more like North Korean hackers,” Matthew Prince wrote on X in August. “Time to name, shame, and hard block them.”

In a statement to Business Insider, Reddit’s chief legal officer Ben Lee said Oxylabs UAB, AWM Proxy, and SerpApi were “textbook examples” of illegal scrapers.

“Scrapers bypass technological protections to steal data, then sell it to clients hungry for training material,” he said. “Reddit is a prime target because it’s one of the largest and most dynamic collections of human conversation ever created.”

Reddit launched in 2005 as an online discussion forum, but is now trying to add value through a new strategy: search traffic. The decision has put Reddit in competition with companies like Perplexity.

“Reddit is one of the few platforms positioned to become a true search destination. We offer something special: a breadth of conversations and knowledge you can’t find anywhere else,” the company said in its Q2 report in July. “Every week, hundreds of millions of people come to Reddit looking for advice, and we’re turning more of that intent into active users of Reddit’s native search.”

Online search traffic has become a profitable industry led by companies like Google, which announced an expanded partnership with Reddit in March 2024 to train its AI models on the platform’s content. On its end, Reddit gained access to Google’s Vertex AI, allowing the platform to add enhanced search and other features. One month later, Reddit went public with a $6.4 billion valuation.



Source link

Visited 1 times, 1 visit(s) today

Related Article

Judge gives Trump admin a basic sex ed lesson while rebuking its planned funding cuts

Judge gives Trump admin a basic sex ed lesson while rebuking its planned funding cuts

A federal judge said Monday that she’ll likely block the Trump administration’s efforts to withhold billions of dollars in federal funding for sexual health education grants. The revelation came as the judge reportedly reprimanded the administration and gave it a basic sex ed lesson from the bench. I’ve written previously on the administration’s bigoted rationale

Trump's past demand that the DOJ pay him $230 million is part of a bigger plan

Trump’s past demand that the DOJ pay him $230 million is part of a bigger plan

I thought there was little President Donald Trump could do to surprise me. I was wrong. The New York Times reported Tuesday that “President Trump is demanding that the Justice Department pay him about $230 million in compensation for the federal investigations into him.” The two claims were filed in 2023 and 2024 under the

Dr. Daniel Mayer speaks Wednesday, Oct. 22, 2025 at Memorial Hospital West in Pembroke Pines. (joe Cavaretta/South Florida Sun Sentinel)

Therapy dog awakens woman from coma at Pembroke Pines hospital

Priscilla Timmons of Cooper City had been in a coma at Memorial Hospital West for more than 24 hours when something wonderfully odd occurred. She felt Scrunchie, a Golden Retriever, nuzzle her finger, and reached out to the therapy dog who had been brought to her bedside in the intensive care unit at Memorial West

Twin-engined fighters of the second world war

Twin-engined fighters of the second world war

Why would anyone opt for a twin-engine setup for a fighter? Well, sometimes a bigger aircraft was required for greater range, or armament or a second crewmember to navigate or operate a radar. Here we choose the ten best of this exciting class of aeroplanes, assessing both their performance and their importance in World War

Should You Invest In Crypto Income ETFs? The Shocking Truth

Should You Invest In Crypto Income ETFs? The Shocking Truth

The first wave of crypto ETFs allowed investors to onboard crypto assets into traditional brokerage accounts – and tax-advantaged retirement accounts. Given the long-term return potential of cryptocurrencies, that’s a win-win. But cryptos are still volatile. Last week’s $19 billion leveraged wipeout in bitcoin surpassed the wipeout at the Covid bottom in March 2020. And