Converting raw transcripts into an annotated and turn-aligned TEI-XML corpus: the example of the Corpus of Serbian Forms of Address
This paper describes the procedure of building a TEI-XML corpus of spoken Serbian starting from raw transcripts. The corpus consists of semi–structured interviews, which were gathered with the aim of investigating forms of address in Serbian. The interviews were thoroughly transcribed according to G...
Saved in:
| Main Author: | Dolores Lemmenmeier-Batinić |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
University of Ljubljana Press (Založba Univerze v Ljubljani)
2021-07-01
|
| Series: | Slovenščina 2.0: Empirične, aplikativne in interdisciplinarne raziskave |
| Subjects: | |
| Online Access: | https://journals.uni-lj.si/slovenscina2/article/view/9869 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Introduction : Grammaticalité et annotations de corpus d’anglais oral – perspectives et problèmes
by: Sylvie Hancil
Published: (2018-07-01) -
IT-SR-NER: SERBIAN-ITALIAN PARALLEL CORPUS FOR LEARNING SERBIAN AS A FOREIGN LANGUAGE
by: Olja Perišić, et al.
Published: (2025-06-01) -
VERBAL COLLOCATIONS IN THE SPOKEN ALBANIAN CORPUS IN KOSOVO
by: Adelajda Baftiu, et al.
Published: (2023-11-01) -
Tradurre un corpus di memoria storico-biografica. Strategie di semplificazione e adattamento per il pubblico italiano
by: Lucia Cinato
Published: (2025-08-01) -
A TEI-based Approach to Standardising Spoken Language Transcription
by: Thomas Schmidt
Published: (2011-06-01)