Harvard Is Releasing a Massive Free AI Training Dataset Funded by OpenAI and Microsoft
beSpacific 2024-12-13
Summary:
Wired – “The project’s leader says that allowing everyone to access the collection of public-domain books will help “level the playing field” in the AI industry. Harvard University announced Thursday it’s releasing a high-quality dataset of nearly 1 million public-domain books that could be used by anyone to train large language models and other AI […]