In search of comity: TEI for distant reading

Any expansion of the TEI beyond its traditional user base involves a recognition that there are many differing answers to the traditional question “What is text, really?” We report on some work carried out in the context of the COST Action Distant Reading for European Literary History (CA16204), in...

Full description

Saved in:
Bibliographic Details
Main Authors: Lou Burnard, Christof Schöch, Carolin Odebrecht
Format: Article
Language:deu
Published: Text Encoding Initiative Consortium 2021-07-01
Series:Journal of the Text Encoding Initiative
Subjects:
Online Access:https://journals.openedition.org/jtei/3500
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832578470859767808
author Lou Burnard
Christof Schöch
Carolin Odebrecht
author_facet Lou Burnard
Christof Schöch
Carolin Odebrecht
author_sort Lou Burnard
collection DOAJ
description Any expansion of the TEI beyond its traditional user base involves a recognition that there are many differing answers to the traditional question “What is text, really?” We report on some work carried out in the context of the COST Action Distant Reading for European Literary History (CA16204), in particular on the TEI-conformant schemas developed for one of its principal deliverables: the European Literary Text Collection (ELTeC). The ELTeC will contain comparable corpora for each of at least a dozen European languages, each being a balanced sample of one hundred novels from the period 1840 to 1920, together with metadata concerning their production and reception. We hope that it will become a reliable basis for comparative work in data-driven textual analytics. The focus of the ELTeC encoding scheme is not to represent texts in all their original complexity, nor to duplicate the work of scholarly editors. Instead, we aim to facilitate a richer and better-informed distant reading than a transcription of lexical content alone would permit. At the same time, where the TEI encourages diversity, we enforce consistency by permitting representation of only a specific and quite small set of textual features, both structural and analytical. These constraints are expressed by a master TEI ODD, from which we derive three different schemas by ODD chaining, each associated with appropriate documentation.
format Article
id doaj-art-0318e2878dac4a9892f7378a5da632ab
institution Kabale University
issn 2162-5603
language deu
publishDate 2021-07-01
publisher Text Encoding Initiative Consortium
record_format Article
series Journal of the Text Encoding Initiative
spelling doaj-art-0318e2878dac4a9892f7378a5da632ab2025-01-30T13:56:38ZdeuText Encoding Initiative ConsortiumJournal of the Text Encoding Initiative2162-56032021-07-011410.4000/jtei.3500In search of comity: TEI for distant readingLou BurnardChristof SchöchCarolin OdebrechtAny expansion of the TEI beyond its traditional user base involves a recognition that there are many differing answers to the traditional question “What is text, really?” We report on some work carried out in the context of the COST Action Distant Reading for European Literary History (CA16204), in particular on the TEI-conformant schemas developed for one of its principal deliverables: the European Literary Text Collection (ELTeC). The ELTeC will contain comparable corpora for each of at least a dozen European languages, each being a balanced sample of one hundred novels from the period 1840 to 1920, together with metadata concerning their production and reception. We hope that it will become a reliable basis for comparative work in data-driven textual analytics. The focus of the ELTeC encoding scheme is not to represent texts in all their original complexity, nor to duplicate the work of scholarly editors. Instead, we aim to facilitate a richer and better-informed distant reading than a transcription of lexical content alone would permit. At the same time, where the TEI encourages diversity, we enforce consistency by permitting representation of only a specific and quite small set of textual features, both structural and analytical. These constraints are expressed by a master TEI ODD, from which we derive three different schemas by ODD chaining, each associated with appropriate documentation.https://journals.openedition.org/jtei/3500distant readingELTeCODD chainingcorpus designthe European novelliterary studies
spellingShingle Lou Burnard
Christof Schöch
Carolin Odebrecht
In search of comity: TEI for distant reading
Journal of the Text Encoding Initiative
distant reading
ELTeC
ODD chaining
corpus design
the European novel
literary studies
title In search of comity: TEI for distant reading
title_full In search of comity: TEI for distant reading
title_fullStr In search of comity: TEI for distant reading
title_full_unstemmed In search of comity: TEI for distant reading
title_short In search of comity: TEI for distant reading
title_sort in search of comity tei for distant reading
topic distant reading
ELTeC
ODD chaining
corpus design
the European novel
literary studies
url https://journals.openedition.org/jtei/3500
work_keys_str_mv AT louburnard insearchofcomityteifordistantreading
AT christofschoch insearchofcomityteifordistantreading
AT carolinodebrecht insearchofcomityteifordistantreading