Ancient OCR: Storing, Cataloguing, Relating, and Exposing OCR Objects from the Open Philology Project - EUDAT
peter.suber's bookmarks 2016-10-09
Summary:
"The Open Philology Project at the University of Leipzig has developed a modular, multi-threaded OCR pipeline to reach our goal of digitizing 100,000 books in the next three years. This pilot project gives us a way to store, catalogue, and expose the results of this pipeline, from original image to final OCR results. The users of the EUDAT system will be at the University of Leipzig and Tufts University (USA). The users of the data would be the same as those of the Perseus Digital Library, i.e., researchers and students in classical languages worldwide...."