Google Harnesses Public Web Data to Train AI Models

July 5, 2023 at 03:11 PM

•

1 min read

Google Harnesses Public Web Data to Train AI Models — Photo: The Verge

TL;DR Summary

Google has updated its privacy policy to disclose that its AI services may be trained on publicly available data scraped from the web. The policy now clarifies that services like Bard and Cloud AI are included in the use of public data. However, it does not specify how copyrighted materials will be prevented from being included in the data pool. This approach raises questions about compliance with global regulations like GDPR and the fair use doctrine. The use of scraped data by AI companies has sparked lawsuits and calls for stricter regulations. Additionally, the working conditions of those sorting through the vast amounts of training data have come under scrutiny.

Topics:top-news #ai-training #data-scraping #google #privacy-policy #public-information #technology

Share this article

Reading Insights

Total Reads

Unique Readers

Time Saved

3 min

vs 4 min read

Condensed

83%

635 → 110 words

Want the full story? Read the original article

Read on The Verge

JavaScript Required

tl;dr daily news requires JavaScript to be enabled. Please enable JavaScript in your browser settings.

Related Sources

Reading Insights