SU Library News & Events » Blog Archive » HathiTrust Research Center releases Extracted Features Dataset

lterrat's bookmarks 2016-12-13

Summary:

"HathiTrust has announced the release of a significantly expanded open dataset, HathiTrust Research Center (HTRC) Extracted Features (EF) Dataset, Version 1.0. This dataset provides researchers with open access to data from the full text of the HathiTrust Digital Library (HTDL), representing 13.7 million volumes, over 5 billion pages, and consisting of over 2 trillion tokens (words). Syracuse University Libraries hold an institutional membership in the HathiTrust partnership."

Link:

http://libnews.syr.edu/hathitrust-research-center-releases-extracted-features-dataset/

From feeds:

Open Access Tracking Project (OATP) » lterrat's bookmarks

Tags:

oa.repositories

Date tagged:

12/13/2016, 23:04

Date published:

12/13/2016, 18:04