Text this: A corpus and a modular infrastructure for the empirical study of (an)notated music