Now you can block OpenAI’s web crawler

By

Aug 7, 2023

Image: OpenAI

OpenAI now lets you block its web crawler from scraping your site to help train GPT models.

In a blog post, OpenAI said website operators can specifically disallow its GPTBot crawler on their site’s Robots.txt file or block its IP address. “Web pages crawled with the GPTBot user agent may potentially be used to improve future models and are filtered to remove sources that require paywall access, are known to gather personally identifiable information (PII), or have text that violates our policies,” OpenAI said in the blog post. For sources that don’t fit the excluded criteria, “allowing GPTBot to access your site can help AI models become more accurate and improve their general capabilities and safety.”

Blocking the GPTBot may be the…

By

EPGN Tech The Verge

Now you can block OpenAI’s web crawler

By

By

Related Post

60 million households watched the Tyson vs. Paul fight on Netflix

The FTC says spam call complaints are way down since 2021

Narwal’s Freo X Ultra, the best mopping robot available, is on sale for a new low price

Leave a Reply Cancel reply

You missed

60 million households watched the Tyson vs. Paul fight on Netflix

The FTC says spam call complaints are way down since 2021

Narwal’s Freo X Ultra, the best mopping robot available, is on sale for a new low price

Activists’ Alternative to COP29 Brings Frontline Communities Together