FAIR Derived Data in TEI and Its Publication in the TextGrid Repository
Many research projects face legal restrictions on the publication of texts. In recent decades, several projects have circumvented these restrictions by both deleting some parts of the data and publishing derived data from the original files. We discuss the limitations of the commonly used ad-hoc sol...
Saved in:
| Main Authors: | , , , , , |
|---|---|
| Format: | Article |
| Language: | deu |
| Published: |
Text Encoding Initiative Consortium
2025-03-01
|
| Series: | Journal of the Text Encoding Initiative |
| Subjects: | |
| Online Access: | https://journals.openedition.org/jtei/5622 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Summary: | Many research projects face legal restrictions on the publication of texts. In recent decades, several projects have circumvented these restrictions by both deleting some parts of the data and publishing derived data from the original files. We discuss the limitations of the commonly used ad-hoc solutions and the deprecation of the FAIR status that they cause. In contrast, we propose to model derived data in TEI, and present several variants with five corpora from different languages, genres, and periods. We also present the implementation of several features for publishing such data in the TextGrid Repository and the publication of derived data from a corpus of Spanish novels and a corpus of American plays. |
|---|---|
| ISSN: | 2162-5603 |