Google and Harvard debut dataset with 1m public domain books for AI training
peter.suber's bookmarks 2025-01-16
Summary:
"Harvard University, in conjunction with Google, has released a dataset of a million public domain books to train the next generation of AI."