Reddit Lawsuit Accuses Perplexity, Other AI Firms, of Stealing Data

Reddit filed a lawsuit against Perplexity, along with several other data mining companies, accusing them of stealing the social media platform’s valuable data.

Reddit’s lawsuit, filed on Wednesday in Manhattan federal court, said Perplexity and the three other firms it sued — Oxylabs UAB, AWM Proxy, and SerpApi — illegally circumvented Reddit’s digital guardrails by scraping its content through Google’s search engine results.

“These Defendants are similar to would-be bank robbers, who, knowing they cannot get into the bank vault, break into the armored truck carrying the cash instead,” Reddit’s lawsuit alleges.

Reddit said it sent a cease-and-desist letter to Perplexity in May 2024 demanding it stop scraping Reddit data unless it made a deal with the social media company, as Google and OpenAI had done.

Perplexity said it “was not using Reddit content to train any AI models and that it would respect Reddit’s robots.txt,” according to the lawsuit.

But Perplexity’s citations to Reddit increased “forty-fold after Reddit told it to stop,” the lawsuit added.

“Rather than respect Reddit and its users’ rights, what Perplexity has done in response is simply come up with increasingly devious schemes to circumvent Reddit’s security systems and policies,” the lawsuit says.

According to the lawsuit, Perplexity appears to have used at least one of the data scrapers to ingest the platform’s data into its AI models.

“In other words, Perplexity’s business model is effectively to take Reddit’s content from Google search results, feed them into a third party’s LLM, and call it a new product,” the lawsuit says. “While that business model has somehow translated into a $20 billion valuation, it has not resulted in a willingness to pay for what others (including Google) have.”

Perplexity spokesperson Jesse Dwyer said the company “will always fight vigorously for users’ rights to freely and fairly access public knowledge.”

“Our approach remains principled and responsible as we provide factual answers with accurate AI, and we will not tolerate threats against openness and the public interest,” Dwyer said.

A SerpApi representative said the company disagrees with Reddit’s allegations and plans to “vigorously” defend itself in court.

Oxylabs did not immediately respond to a request for comment by Business Insider. AWMProxy, identified in the lawsuit as a former Russian botnet, could not immediately be reached for comment.

A Reddit spokesperson confirmed to Business Insider that the company has spent tens of millions of dollars on anti-scraping systems, which the lawsuit says these companies circumvented.

The lawsuit said Reddit caught Perplexity bypassing its guardrails by setting up a test post that acted as a digital “marked bill.”

The test post could only be viewed by Google’s search engine, the lawsuit said, so Perplexity and other AI companies should not have been able to use it for their models.

The contents of the post soon appeared in Perplexity, indicating that it or another data scraper it worked with had taken the content without permission.

“Within hours, queries to Perplexity’s ‘answer engine’ produced the contents of that test post,” Reddit’s lawsuit says.

Reddit’s lawsuit quotes a social media post from Cloudflare’s CEO comparing Perplexity to “North Korean hackers” for appearing to try to hide its web-crawling activity.

“Some supposedly ‘reputable’ AI companies act more like North Korean hackers,” Matthew Prince wrote on X in August. “Time to name, shame, and hard block them.”

In a statement to Business Insider, Reddit’s chief legal officer Ben Lee said Oxylabs UAB, AWM Proxy, and SerpApi were “textbook examples” of illegal scrapers.

“Scrapers bypass technological protections to steal data, then sell it to clients hungry for training material,” he said. “Reddit is a prime target because it’s one of the largest and most dynamic collections of human conversation ever created.”

Reddit launched in 2005 as an online discussion forum, but is now trying to add value through a new strategy: search traffic. The decision has put Reddit in competition with companies like Perplexity.

“Reddit is one of the few platforms positioned to become a true search destination. We offer something special: a breadth of conversations and knowledge you can’t find anywhere else,” the company said in its Q2 report in July. “Every week, hundreds of millions of people come to Reddit looking for advice, and we’re turning more of that intent into active users of Reddit’s native search.”

Online search traffic has become a profitable industry led by companies like Google, which announced an expanded partnership with Reddit in March 2024 to train its AI models on the platform’s content. On its end, Reddit gained access to Google’s Vertex AI, allowing the platform to add enhanced search and other features. One month later, Reddit went public with a $6.4 billion valuation.



Source link

Visited 1 times, 1 visit(s) today

Related Article

Should You Invest In Crypto Income ETFs? The Shocking Truth

Should You Invest In Crypto Income ETFs? The Shocking Truth

The first wave of crypto ETFs allowed investors to onboard crypto assets into traditional brokerage accounts – and tax-advantaged retirement accounts. Given the long-term return potential of cryptocurrencies, that’s a win-win. But cryptos are still volatile. Last week’s $19 billion leveraged wipeout in bitcoin surpassed the wipeout at the Covid bottom in March 2020. And

Quiet Canal Street without illegal vendors day after ICE crackdown in Chinatown, Manhattan

Quiet Canal Street without illegal vendors day after ICE crackdown in Chinatown, Manhattan

CHINATOWN, Manhattan (WABC) — Officials say nine people were arrested when federal agents descended on Canal Street to target illegal street vendors, sparking chaos and protests. On Wednesday, acting ICE Director Todd Lyons told Fox News that New York City will see an “increase in ICE arrests” because there are “so many criminal illegal” immigrants.

Two charges against Karen Read blogger ‘Turtleboy’ Aidan Kearney dropped

Two charges against Karen Read blogger ‘Turtleboy’ Aidan Kearney dropped

Two charges against blogger and longtime Karen Read advocate ‘Turtleboy’ have been dropped. The Norfolk County District Attorney’s Office entered a nolle prosequi filing on Wednesday, dropping charges of witness intimidation and wiretapping against Aidan Kearney. Two cases against Kearney will continue to be prosecuted by Special Prosecutor Robert Cosgrove, the Norfolk County District Attorney’s

JD Vance and Benjamin Netanyahu standing in front of the U.S. and Israeli flags.

Vance Says He’s Optimistic Gaza’s Cease-Fire Would Hold

new video loaded: Vance Says He’s Optimistic Gaza’s Cease-Fire Would Hold transcript Back transcript Vance Says He’s Optimistic Gaza’s Cease-Fire Would Hold Vice President JD Vance met with Prime Minister Benjamin Netanyahu of Israel in Jerusalem and said that he was optimistic the Gaza cease-fire will hold. Recent flare-ups of violence in Gaza underscored the

Lines of people outside the glass pyramid at the Louvre Museum in Paris.

Louvre Museum Reopens After Jewel Heist

new video loaded: Louvre Museum Reopens After Jewel Heist transcript Back transcript Louvre Museum Reopens After Jewel Heist Patrons of the reopened Louvre, the world’s most-visited museum, expressed astonishment and intrigue over the daytime heist of jewelry worth more than $100 million that had led to its closure. “It was astonishing.” “Especially how quickly —”

The Israeli national flag flutters as apartments are seen in the background in the Israeli settlement of Maale Adumim in the Israeli-occupied West Bank.(REUTERS File)

Israeli lawmakers approve advancement of West Bank annexation bills

Israeli lawmakers on Wednesday voted in favour of advancing two bills on annexing the occupied West Bank, an ambition openly promoted by far-right ministers in recent months. The Israeli national flag flutters as apartments are seen in the background in the Israeli settlement of Maale Adumim in the Israeli-occupied West Bank.(REUTERS File) The vote came