IFLA -- A free Workshop on Text Mining and the HathiTrust Research Center at the IFLA in Kuala Lampur, Malaysia.
lkfitz's bookmarks 2018-06-13
Summary:
"Here are some of the exciting things you can expect to learn and become familiar with during this session:
- Building a corpus of texts in a HTRC Workset, and using it to conduct text analysis on your collection of works;
- Gathering data through web scraping;
- Cleaning data, dirty OCR, and clean OCR;
- Using Python for text mining;
- Topic modeling and other approaches for text analysis."