The Unbelievable Scale of AI’s Pirated-Books Problem - The Atlantic

peter.suber's bookmarks 2025-03-21

Summary:

"When employees at Meta started developing their flagship AI model, Llama 3, they faced a simple ethical question. The program would need to be trained on a huge amount of high-quality writing to be competitive with products such as ChatGPT, and acquiring all of that text legally could take time. Should they just pirate it instead? ..."

Link:

https://archive.is/CY6yl

From feeds:

Open Access Tracking Project (OATP) » peter.suber's bookmarks

Tags:

oa.new oa.meta oa.ai oa.books oa.mining oa.copyright oa.ethics oa.libgen oa.paywalled oa.search oa.fair_use oa.guerrilla

Date tagged:

03/21/2025, 09:35

Date published:

03/21/2025, 05:34