IFLA -- A free Workshop on Text Mining and the HathiTrust Research Center at the IFLA in Kuala Lampur, Malaysia.

lkfitz's bookmarks 2018-06-13

Summary:

"Here are some of the exciting things you can expect to learn and become familiar with during this session:

  • Building a corpus of texts in a HTRC Workset, and using it to conduct text analysis on your collection of works;
  • Gathering data through web scraping;
  • Cleaning data, dirty OCR, and clean OCR;
  • Using Python for text mining;
  • Topic modeling and other approaches for text analysis."

Link:

https://www.ifla.org/node/57616

From feeds:

Open Access Tracking Project (OATP) ยป lkfitz's bookmarks

Tags:

oa.new oa.hathi oa.mining oa.malaysia oa.data oa.rdm oa.code4oa oa.events

Date tagged:

06/13/2018, 11:09

Date published:

06/13/2018, 07:09