Researchers Challenge Industry Claims on Ethical AI and Copyright

1 min read
Source: futurism.com
Researchers Challenge Industry Claims on Ethical AI and Copyright
Photo: futurism.com
TL;DR Summary

A team of researchers from MIT, Cornell, and other institutions successfully trained a large language model using only ethically-sourced, publicly licensed data, challenging the industry belief that such development is impossible without vast resources. They created the Common Pile dataset, manually curated over eight terabytes of data, and trained a seven billion-parameter AI that rivals older industry models, highlighting ethical concerns around data use and copyright in AI development.

Share this article

Reading Insights

Total Reads

0

Unique Readers

0

Time Saved

3 min

vs 4 min read

Condensed

89%

64269 words

Want the full story? Read the original article

Read on futurism.com