"Tech Giants' Unconventional Data Harvesting for A.I. Training"

1 min read
Source: 9to5Google
"Tech Giants' Unconventional Data Harvesting for A.I. Training"
Photo: 9to5Google
TL;DR Summary

OpenAI used over a million hours of YouTube video transcripts to train its GPT-4 AI model, despite YouTube's rules against unauthorized scraping or downloading of content. Google has also used similar methods to train its AI models. As the demand for training data increases and existing data sources are depleted, companies are resorting to aggressive means to capture new data for training more advanced AI models.

Share this article

Reading Insights

Total Reads

0

Unique Readers

1

Time Saved

2 min

vs 3 min read

Condensed

84%

42066 words

Want the full story? Read the original article

Read on 9to5Google