Building, Encoding, and Annotating a Corpus of Parliamentary Debates in TEI XML: A Cross-Linguistic Account
This paper introduces an integrative and comprehensive method for the linguistic annotation of parliamentary discourse. Initially conceived as documentation for a specific and small-scale research project, the annotation scheme takes into account national specificities and is geared to proposing an...
Saved in:
Main Authors: | , |
---|---|
Format: | Article |
Language: | deu |
Published: |
Text Encoding Initiative Consortium
2022-06-01
|
Series: | Journal of the Text Encoding Initiative |
Subjects: | |
Online Access: | https://journals.openedition.org/jtei/4164 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832578466337259520 |
---|---|
author | Naomi Truan Laurent Romary |
author_facet | Naomi Truan Laurent Romary |
author_sort | Naomi Truan |
collection | DOAJ |
description | This paper introduces an integrative and comprehensive method for the linguistic annotation of parliamentary discourse. Initially conceived as documentation for a specific and small-scale research project, the annotation scheme takes into account national specificities and is geared to proposing an annotation scheme that is both highly standardized and adaptable to other research contexts. In this paper we present a specific application of the Text Encoding Initiative (TEI) framework applied to a subset of official transcripts of plenary proceedings in three parliamentary cultures. The TEI annotation scheme proposed here has two main applications: first, it serves as a basis for encoding parliamentary corpora by providing a systematic way of annotating both elements within the text (e.g., turns, incidents, and interruptions) and the metadata associated with it (e.g., variables pertaining to the speaker or the speech event); second, it provides a cross-linguistic empirical basis for further annotation projects. |
format | Article |
id | doaj-art-9d70f0b145804f4fbe405d7a0b45542d |
institution | Kabale University |
issn | 2162-5603 |
language | deu |
publishDate | 2022-06-01 |
publisher | Text Encoding Initiative Consortium |
record_format | Article |
series | Journal of the Text Encoding Initiative |
spelling | doaj-art-9d70f0b145804f4fbe405d7a0b45542d2025-01-30T13:56:41ZdeuText Encoding Initiative ConsortiumJournal of the Text Encoding Initiative2162-56032022-06-011410.4000/jtei.4164Building, Encoding, and Annotating a Corpus of Parliamentary Debates in TEI XML: A Cross-Linguistic AccountNaomi TruanLaurent RomaryThis paper introduces an integrative and comprehensive method for the linguistic annotation of parliamentary discourse. Initially conceived as documentation for a specific and small-scale research project, the annotation scheme takes into account national specificities and is geared to proposing an annotation scheme that is both highly standardized and adaptable to other research contexts. In this paper we present a specific application of the Text Encoding Initiative (TEI) framework applied to a subset of official transcripts of plenary proceedings in three parliamentary cultures. The TEI annotation scheme proposed here has two main applications: first, it serves as a basis for encoding parliamentary corpora by providing a systematic way of annotating both elements within the text (e.g., turns, incidents, and interruptions) and the metadata associated with it (e.g., variables pertaining to the speaker or the speech event); second, it provides a cross-linguistic empirical basis for further annotation projects.https://journals.openedition.org/jtei/4164annotationcontrastive linguisticsparliamentary debates |
spellingShingle | Naomi Truan Laurent Romary Building, Encoding, and Annotating a Corpus of Parliamentary Debates in TEI XML: A Cross-Linguistic Account Journal of the Text Encoding Initiative annotation contrastive linguistics parliamentary debates |
title | Building, Encoding, and Annotating a Corpus of Parliamentary Debates in TEI XML: A Cross-Linguistic Account |
title_full | Building, Encoding, and Annotating a Corpus of Parliamentary Debates in TEI XML: A Cross-Linguistic Account |
title_fullStr | Building, Encoding, and Annotating a Corpus of Parliamentary Debates in TEI XML: A Cross-Linguistic Account |
title_full_unstemmed | Building, Encoding, and Annotating a Corpus of Parliamentary Debates in TEI XML: A Cross-Linguistic Account |
title_short | Building, Encoding, and Annotating a Corpus of Parliamentary Debates in TEI XML: A Cross-Linguistic Account |
title_sort | building encoding and annotating a corpus of parliamentary debates in tei xml a cross linguistic account |
topic | annotation contrastive linguistics parliamentary debates |
url | https://journals.openedition.org/jtei/4164 |
work_keys_str_mv | AT naomitruan buildingencodingandannotatingacorpusofparliamentarydebatesinteixmlacrosslinguisticaccount AT laurentromary buildingencodingandannotatingacorpusofparliamentarydebatesinteixmlacrosslinguisticaccount |