Modeling Frequency Data: Methodological Considerations on the Relationship between Dictionaries and Corpora

Academic dictionary writing is making greater and greater use of the TEI Guidelines’ dictionary module. And as increasing numbers of TEI dictionaries become available, there is an ever more palpable need to work towards greater interoperability among dictionary writing systems and other language res...

Full description

Saved in:
Bibliographic Details
Main Authors: Karlheinz Mörth, Laurent Romary, Gerhard Budin, Daniel Schopper
Format: Article
Language:deu
Published: Text Encoding Initiative Consortium 2015-12-01
Series:Journal of the Text Encoding Initiative
Subjects:
Online Access:https://journals.openedition.org/jtei/1356
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832578492881960960
author Karlheinz Mörth
Laurent Romary
Gerhard Budin
Daniel Schopper
author_facet Karlheinz Mörth
Laurent Romary
Gerhard Budin
Daniel Schopper
author_sort Karlheinz Mörth
collection DOAJ
description Academic dictionary writing is making greater and greater use of the TEI Guidelines’ dictionary module. And as increasing numbers of TEI dictionaries become available, there is an ever more palpable need to work towards greater interoperability among dictionary writing systems and other language resources that are needed by dictionaries and dictionary tools. In particular this holds true for the crucial role that statistical data obtained from language resources play in lexicographic workflow—a role that also has to be reflected in the model of the data produced in these workflows. Presenting a range of current projects, the authors address two main questions in this area: How can the relationship between a dictionary and other language resources be conceptualized, irrespective of whether they are used in the production of the dictionary or to enrich existing lexicographic data? And how can this be documented using the TEI Guidelines? Discussing a variety of options, this paper proposes a customization of the TEI dictionary module that tries to respond to the emerging requirements in an environment of increasingly intertwined language resources.
format Article
id doaj-art-663c0dd436f84f66b77b6798b05c76a8
institution Kabale University
issn 2162-5603
language deu
publishDate 2015-12-01
publisher Text Encoding Initiative Consortium
record_format Article
series Journal of the Text Encoding Initiative
spelling doaj-art-663c0dd436f84f66b77b6798b05c76a82025-01-30T13:56:25ZdeuText Encoding Initiative ConsortiumJournal of the Text Encoding Initiative2162-56032015-12-01810.4000/jtei.1356Modeling Frequency Data: Methodological Considerations on the Relationship between Dictionaries and CorporaKarlheinz MörthLaurent RomaryGerhard BudinDaniel SchopperAcademic dictionary writing is making greater and greater use of the TEI Guidelines’ dictionary module. And as increasing numbers of TEI dictionaries become available, there is an ever more palpable need to work towards greater interoperability among dictionary writing systems and other language resources that are needed by dictionaries and dictionary tools. In particular this holds true for the crucial role that statistical data obtained from language resources play in lexicographic workflow—a role that also has to be reflected in the model of the data produced in these workflows. Presenting a range of current projects, the authors address two main questions in this area: How can the relationship between a dictionary and other language resources be conceptualized, irrespective of whether they are used in the production of the dictionary or to enrich existing lexicographic data? And how can this be documented using the TEI Guidelines? Discussing a variety of options, this paper proposes a customization of the TEI dictionary module that tries to respond to the emerging requirements in an environment of increasingly intertwined language resources.https://journals.openedition.org/jtei/1356lexicographylanguage resourcesdigital corporastatistics
spellingShingle Karlheinz Mörth
Laurent Romary
Gerhard Budin
Daniel Schopper
Modeling Frequency Data: Methodological Considerations on the Relationship between Dictionaries and Corpora
Journal of the Text Encoding Initiative
lexicography
language resources
digital corpora
statistics
title Modeling Frequency Data: Methodological Considerations on the Relationship between Dictionaries and Corpora
title_full Modeling Frequency Data: Methodological Considerations on the Relationship between Dictionaries and Corpora
title_fullStr Modeling Frequency Data: Methodological Considerations on the Relationship between Dictionaries and Corpora
title_full_unstemmed Modeling Frequency Data: Methodological Considerations on the Relationship between Dictionaries and Corpora
title_short Modeling Frequency Data: Methodological Considerations on the Relationship between Dictionaries and Corpora
title_sort modeling frequency data methodological considerations on the relationship between dictionaries and corpora
topic lexicography
language resources
digital corpora
statistics
url https://journals.openedition.org/jtei/1356
work_keys_str_mv AT karlheinzmorth modelingfrequencydatamethodologicalconsiderationsontherelationshipbetweendictionariesandcorpora
AT laurentromary modelingfrequencydatamethodologicalconsiderationsontherelationshipbetweendictionariesandcorpora
AT gerhardbudin modelingfrequencydatamethodologicalconsiderationsontherelationshipbetweendictionariesandcorpora
AT danielschopper modelingfrequencydatamethodologicalconsiderationsontherelationshipbetweendictionariesandcorpora