A corpus and a modular infrastructure for the empirical study of (an)notated music - PubMed
peter.suber's bookmarks 2025-04-27
Summary:
Abstract: The present corpus is the outcome of a long-term collaborative effort to produce analytically annotated music scores suitable for the computer-assisted study of European compositions since 1600. With 1283 analytically annotated, symbolically encoded music scores by 36 composers, our corpus amounts to one of the largest published resources of its kind. At the same time, it provides a modular digital infrastructure for the accountable, collaborative curation of annotated scores ("sheet music"). All annotations were created and reviewed by a team of trained music theorists, who collaborated online using the git version control software according to a formally codified workflow. To improve the consistency of analytical practices given the diversity of represented eras and genres, the corpus has been automatically parsed for notational well-formedness and cross-reviewed by annotators for adherence to our music-analytical guidelines. The computational infrastructure has been designed with "data persistence" and open access in mind.