The Unbelievable Scale of AI’s Pirated-Books Problem - The Atlantic
peter.suber's bookmarks 2025-03-21
Summary:
"When employees at Meta started developing their flagship AI model, Llama 3, they faced a simple ethical question. The program would need to be trained on a huge amount of high-quality writing to be competitive with products such as ChatGPT, and acquiring all of that text legally could take time. Should they just pirate it instead? ..."