AI Companies Bypass Web Standards, Face Legal Threats Over Content Scraping

1 min read
Source: Business Insider
AI Companies Bypass Web Standards, Face Legal Threats Over Content Scraping
Photo: Business Insider
TL;DR Summary

OpenAI and Anthropic are reportedly ignoring or bypassing the robots.txt rule, which prevents automated scraping of websites, to collect data for training their AI models. Despite public claims of respecting these blocks, findings by TollBit suggest otherwise. This practice has raised concerns among media publishers and highlights the ongoing tension between AI companies' data needs and copyright protections.

Share this article

Reading Insights

Total Reads

0

Unique Readers

0

Time Saved

2 min

vs 3 min read

Condensed

87%

43758 words

Want the full story? Read the original article

Read on Business Insider