Building, Encoding, and Annotating a Corpus of Parliamentary Debates in TEI XML: A Cross-Linguistic Account

This paper introduces an integrative and comprehensive method for the linguistic annotation of parliamentary discourse. Initially conceived as documentation for a specific and small-scale research project, the annotation scheme takes into account national specificities and is geared to proposing an...

Full description

Saved in:
Bibliographic Details
Main Authors: Naomi Truan, Laurent Romary
Format: Article
Language:deu
Published: Text Encoding Initiative Consortium 2022-06-01
Series:Journal of the Text Encoding Initiative
Subjects:
Online Access:https://journals.openedition.org/jtei/4164
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832578466337259520
author Naomi Truan
Laurent Romary
author_facet Naomi Truan
Laurent Romary
author_sort Naomi Truan
collection DOAJ
description This paper introduces an integrative and comprehensive method for the linguistic annotation of parliamentary discourse. Initially conceived as documentation for a specific and small-scale research project, the annotation scheme takes into account national specificities and is geared to proposing an annotation scheme that is both highly standardized and adaptable to other research contexts. In this paper we present a specific application of the Text Encoding Initiative (TEI) framework applied to a subset of official transcripts of plenary proceedings in three parliamentary cultures. The TEI annotation scheme proposed here has two main applications: first, it serves as a basis for encoding parliamentary corpora by providing a systematic way of annotating both elements within the text (e.g., turns, incidents, and interruptions) and the metadata associated with it (e.g., variables pertaining to the speaker or the speech event); second, it provides a cross-linguistic empirical basis for further annotation projects.
format Article
id doaj-art-9d70f0b145804f4fbe405d7a0b45542d
institution Kabale University
issn 2162-5603
language deu
publishDate 2022-06-01
publisher Text Encoding Initiative Consortium
record_format Article
series Journal of the Text Encoding Initiative
spelling doaj-art-9d70f0b145804f4fbe405d7a0b45542d2025-01-30T13:56:41ZdeuText Encoding Initiative ConsortiumJournal of the Text Encoding Initiative2162-56032022-06-011410.4000/jtei.4164Building, Encoding, and Annotating a Corpus of Parliamentary Debates in TEI XML: A Cross-Linguistic AccountNaomi TruanLaurent RomaryThis paper introduces an integrative and comprehensive method for the linguistic annotation of parliamentary discourse. Initially conceived as documentation for a specific and small-scale research project, the annotation scheme takes into account national specificities and is geared to proposing an annotation scheme that is both highly standardized and adaptable to other research contexts. In this paper we present a specific application of the Text Encoding Initiative (TEI) framework applied to a subset of official transcripts of plenary proceedings in three parliamentary cultures. The TEI annotation scheme proposed here has two main applications: first, it serves as a basis for encoding parliamentary corpora by providing a systematic way of annotating both elements within the text (e.g., turns, incidents, and interruptions) and the metadata associated with it (e.g., variables pertaining to the speaker or the speech event); second, it provides a cross-linguistic empirical basis for further annotation projects.https://journals.openedition.org/jtei/4164annotationcontrastive linguisticsparliamentary debates
spellingShingle Naomi Truan
Laurent Romary
Building, Encoding, and Annotating a Corpus of Parliamentary Debates in TEI XML: A Cross-Linguistic Account
Journal of the Text Encoding Initiative
annotation
contrastive linguistics
parliamentary debates
title Building, Encoding, and Annotating a Corpus of Parliamentary Debates in TEI XML: A Cross-Linguistic Account
title_full Building, Encoding, and Annotating a Corpus of Parliamentary Debates in TEI XML: A Cross-Linguistic Account
title_fullStr Building, Encoding, and Annotating a Corpus of Parliamentary Debates in TEI XML: A Cross-Linguistic Account
title_full_unstemmed Building, Encoding, and Annotating a Corpus of Parliamentary Debates in TEI XML: A Cross-Linguistic Account
title_short Building, Encoding, and Annotating a Corpus of Parliamentary Debates in TEI XML: A Cross-Linguistic Account
title_sort building encoding and annotating a corpus of parliamentary debates in tei xml a cross linguistic account
topic annotation
contrastive linguistics
parliamentary debates
url https://journals.openedition.org/jtei/4164
work_keys_str_mv AT naomitruan buildingencodingandannotatingacorpusofparliamentarydebatesinteixmlacrosslinguisticaccount
AT laurentromary buildingencodingandannotatingacorpusofparliamentarydebatesinteixmlacrosslinguisticaccount