1. Website Planet
  2. >
  3. News
  4. >
  5. Perplexity AI Accused of Scraping Explicitly Blocked Websites
Perplexity AI Accused of Scraping Explicitly Blocked Websites

Perplexity AI Accused of Scraping Explicitly Blocked Websites

Headshot of Andrés Gánem Written by:
Headshot of Maggy Di Costanzo Reviewed by: Maggy Di Costanzo
Last updated: August 15, 2025
AI-powered search startup Perplexity AI is allegedly bypassing restrictions set by websites to stop AI agents from scraping their content, according to an August 4th report by internet infrastructure provider Cloudflare. The provider has since delisted Perplexity’s crawlers as verified bots.

According to Cloudflare, the company originally became aware of the issue when several customers reported encountering crawling activity by Perplexity’s bots even after explicitly including rules to block the AI company.

Crawlers, as the name suggests, are bots designed to “crawl” websites in search of specific content or information. The use of non-AI crawlers, for example, is fundamental for sites to get indexed by Google or other popular search engines. Sites also have files with explicit instructions regarding what information crawlers may access.

To test these claims, researchers at Cloudflare created a series of new domains, making sure they were in no way publicly accessible or indexed by any search engines. The researchers then included explicit instructions in the websites’ code to block bots from accessing the website in any way. They then asked Perplexity AI to fetch them data from each of the newly created domains.

The report found that Perplexity managed to access key information about the websites regardless, and further analysis suggested that Perplexity engaged in “stealth” practices to bypass the restrictions.

“We observed that Perplexity uses not only their declared user-agent, but also a generic browser intended to impersonate Google Chrome on macOS when their declared crawler was blocked,” reads the report.

As a response, Cloudflare has now implemented rules to block all known Perplexity crawlers at the infrastructure level.

“This controversy reveals that Cloudflare’s systems are fundamentally inadequate for distinguishing between legitimate AI assistants and actual threats. If you can’t tell a helpful digital assistant from a malicious scraper, then you probably shouldn’t be making decisions about what constitutes legitimate web traffic,” reads an in-depth response from Perplexity’s official blog.

This isn’t the first time Perplexity has been accused of unethically circumventing restrictions on scraping. Last year, the startup found itself amidst controversy after news outlets accused it of plagiarising would-be protected content.

Despite the controversy, the AI startup is rapidly gaining popularity from both the public and private investors. Last month, reports surfaced that Apple is considering acquiring Perplexity AI.

Senior Writer:
Rate this Article
4.3 Voted by 3 users
You already voted! Undo
This field is required Maximal length of comment is equal 80000 chars Minimal length of comment is equal 10 chars
Any comments?
Reply
View %s replies
View %s reply
More news
Show more
We check all user comments within 48 hours to make sure they are from real people like you. We're glad you found this article useful - we would appreciate it if you let more people know about it.
Popup final window
Share this blog post with friends and co-workers right now:
1 1 1

We check all comments within 48 hours to make sure they're from real users like you. In the meantime, you can share your comment with others to let more people know what you think.

Once a month you will receive interesting, insightful tips, tricks, and advice to improve your website performance and reach your digital marketing goals!

So happy you liked it!

Share it with your friends!

1 < 1 1

Or review us on 1

3717351
50
5000
143203015