Training Generative AI Models on Copyrighted Works Is Fair Use

peter.suber's bookmarks 2024-01-24

Summary:

"OpenAI cites the Library Copyright Alliance (LCA) position that “based on well-established precedent, the ingestion of copyrighted works to create large language models or other AI training databases generally is a fair use.” LCA explained this position in our submission to the US Copyright Office notice of inquiry on copyright and AI, and in the LCA Principles for Copyright and AI.

 

LCA is not involved in any of the AI lawsuits. But as champions of fair use, free speech, and freedom of information, libraries have a stake in maintaining the balance of copyright law so that it is not used to block or restrict access to information. We drafted the principles on AI and copyright in response to efforts to amend copyright law to require licensing schemes for generative AI that could stunt the development of this technology, and undermine its utility to researchers, students, creators, and the public. The LCA principles hold that copyright law as applied and interpreted by the Copyright Office and the courts is flexible and robust enough to address issues of copyright and AI without amendment. The LCA principles also make the careful and critical distinction between input to train an LLM, and output—which could potentially be infringing if it is substantially similar to an original expressive work...."

Link:

https://www.arl.org/blog/training-generative-ai-models-on-copyrighted-works-is-fair-use/

From feeds:

Open Access Tracking Project (OATP) » peter.suber's bookmarks
Fair Use Tracker » ARL Policy Notes

Tags:

oa.new oa.ai oa.copyright oa.fair_use oa.usa oa.lca oa.libraries

Authors:

Katherine Klosek, Director of Information Policy and Federal Relations, Association of Research Libraries (ARL), and Marjory S. Blumenthal, Senior Policy Fellow, American Library Association (ALA) Office of Public Policy and Advocacy

Date tagged:

01/24/2024, 02:44

Date published:

01/23/2024, 04:44