FORCE2019: Perpetual access machines: archiving web-published scholarship at scale | Jefferson Bailey | Oct. 2019

ab1630's bookmarks 2019-12-19

Summary:

October 16, 2019 presentation by Jefferson Bailey and Bryan Newbold, at Force2019.

Event summary description: "In 2018, the Internet Archive undertook a large-scale project to build as complete a collection as possible of scholarly outputs published on the web, as well as to improve the discoverability and accessibility of scholarly works archived as part of these global web harvests. This project involved a number of areas of work: targeted archiving of known OA publications (especially at-risk “long tail” publications); extraction and augmentation of bibliographic metadata and full text; integration and preservation of related identifier, registry, and aggregation services and datastores; partnerships with affiliated initiatives and joint service developments; and creation of new tools and machine learning approaches for identifying archived scholarly work in existing born-digital and web collections. The project also identified and archived associated research outputs such as blogs, datasets, code repositories and other secondary research objects. The beta API and public interface - code-named "fatcat" - can be found at https://fatcat.wiki/...."

slides: https://zenodo.org/record/3497352#.XftrGy2ZNE4

video: https://www.youtube.com/watch?v=PARqfbYIdXQ

 

Link:

https://www.infodocket.com/2019/11/02/video-perpetual-access-machines-archiving-web-published-scholarship-at-scale-force2019-conference-presentation/

From feeds:

Open Access Tracking Project (OATP) » ab1630's bookmarks

Tags:

oa.new oa.preservation oa.versions oa.ml oa.metadata oa.journals oa.repositories oa.internet_archive

Date tagged:

12/19/2019, 06:33

Date published:

12/19/2019, 01:33