Text mine millions of research papers with the CORE dataset - OpenMinTeD
lterrat's bookmarks 2016-11-27
" CORE is an aggregation service that harvests open access journals and repositories,institutional and disciplinary, from around the world. It offers one of the largest collections of scientific content via its Datasets, ready to be text-mined. We encourage everyone to use it as part of OpenMinTeD and beyond.
The current version of the dataset was released last October and contains 24 million metadata records and 4 million full-text records of research articles. Comparing to the past years, the amount of data in our dataset has massively increased and our collection has doubled since the previous dataset release in September 2015. CORE is a great Open Accesssupporter and with its service it aims to provide content that can be text-mined mainly for research purposes. Our dataset collection dates back to April 2013."