AI Companies Accused of Ignoring Web Standards and Copyright Laws

TL;DR Summary
Several AI companies are reportedly ignoring the Robots Exclusion Protocol (robots.txt) to scrape content from websites without permission, leading to disputes with publishers. TollBit, a content licensing startup, has highlighted widespread non-compliance, with AI firms using data for training without authorization. This has resulted in legal actions and negotiations for licensing deals, as the debate over the legality and value of using content to train generative AI continues.
- Several AI companies said to be ignoring robots dot txt exclusion, scraping content without permission: report Tom's Hardware
- Exclusive: Multiple AI companies bypassing web standard to scrape publisher sites, licensing firm says Reuters
- Forbes letter threatens legal action against Perplexity AI over copyright Axios
- Wired: AI startup Perplexity is 'BS machine' CNBC
- Perplexity AI Results Include Plagiarism and Made-Up Content, Reports Say CNET
Reading Insights
Total Reads
0
Unique Readers
0
Time Saved
2 min
vs 3 min read
Condensed
86%
486 → 68 words
Want the full story? Read the original article
Read on Tom's Hardware