By a Thread: Encoding Online Forum Data in TEI

Online forums are platforms where users interact in conversations organized around common topics. In this paper, we make a proposal for encoding forum data according to the TEI Guidelines using a unified format, which covers both traditional online forums as well as Reddit, the largest platform that...

Full description

Saved in:
Bibliographic Details
Main Authors: Sebastian Reimann, Lina Rodenhausen, Frederik Elwert, Tatjana Scheffler
Format: Article
Language:deu
Published: Text Encoding Initiative Consortium 2024-05-01
Series:Journal of the Text Encoding Initiative
Subjects:
Online Access:https://journals.openedition.org/jtei/5084
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832578468422877184
author Sebastian Reimann
Lina Rodenhausen
Frederik Elwert
Tatjana Scheffler
author_facet Sebastian Reimann
Lina Rodenhausen
Frederik Elwert
Tatjana Scheffler
author_sort Sebastian Reimann
collection DOAJ
description Online forums are platforms where users interact in conversations organized around common topics. In this paper, we make a proposal for encoding forum data according to the TEI Guidelines using a unified format, which covers both traditional online forums as well as Reddit, the largest platform that offers forum functionality. We first discuss the specific properties of various types of forums, including most prominently their treelike thread structure. We argue that this tree structure is best represented in a nested XML tree, and does not follow existing stream- or timestamp-based CMC schemas. We present a solution that makes use of a wide range of previously available elements from the TEI Guidelines and the CMC-core schema to encode forums with different thread structures, types of post reactions, and sets of available emojis. Moreover, we propose a TEI header for storing forum metadata within the context of interdisciplinary research, which addresses the challenges of applying TEI elements to born-digital data. Finally, we propose customizations to preexisting TEI elements that are necessary to cover several peculiarities of online forums.
format Article
id doaj-art-cbbbc04aa7774edb996ef83fefac07b7
institution Kabale University
issn 2162-5603
language deu
publishDate 2024-05-01
publisher Text Encoding Initiative Consortium
record_format Article
series Journal of the Text Encoding Initiative
spelling doaj-art-cbbbc04aa7774edb996ef83fefac07b72025-01-30T13:56:45ZdeuText Encoding Initiative ConsortiumJournal of the Text Encoding Initiative2162-56032024-05-011710.4000/1209kBy a Thread: Encoding Online Forum Data in TEISebastian ReimannLina RodenhausenFrederik ElwertTatjana SchefflerOnline forums are platforms where users interact in conversations organized around common topics. In this paper, we make a proposal for encoding forum data according to the TEI Guidelines using a unified format, which covers both traditional online forums as well as Reddit, the largest platform that offers forum functionality. We first discuss the specific properties of various types of forums, including most prominently their treelike thread structure. We argue that this tree structure is best represented in a nested XML tree, and does not follow existing stream- or timestamp-based CMC schemas. We present a solution that makes use of a wide range of previously available elements from the TEI Guidelines and the CMC-core schema to encode forums with different thread structures, types of post reactions, and sets of available emojis. Moreover, we propose a TEI header for storing forum metadata within the context of interdisciplinary research, which addresses the challenges of applying TEI elements to born-digital data. Finally, we propose customizations to preexisting TEI elements that are necessary to cover several peculiarities of online forums.https://journals.openedition.org/jtei/5084computer-mediated communicationdigital religionemojisonline forumsRedditthread structure
spellingShingle Sebastian Reimann
Lina Rodenhausen
Frederik Elwert
Tatjana Scheffler
By a Thread: Encoding Online Forum Data in TEI
Journal of the Text Encoding Initiative
computer-mediated communication
digital religion
emojis
online forums
Reddit
thread structure
title By a Thread: Encoding Online Forum Data in TEI
title_full By a Thread: Encoding Online Forum Data in TEI
title_fullStr By a Thread: Encoding Online Forum Data in TEI
title_full_unstemmed By a Thread: Encoding Online Forum Data in TEI
title_short By a Thread: Encoding Online Forum Data in TEI
title_sort by a thread encoding online forum data in tei
topic computer-mediated communication
digital religion
emojis
online forums
Reddit
thread structure
url https://journals.openedition.org/jtei/5084
work_keys_str_mv AT sebastianreimann byathreadencodingonlineforumdataintei
AT linarodenhausen byathreadencodingonlineforumdataintei
AT frederikelwert byathreadencodingonlineforumdataintei
AT tatjanascheffler byathreadencodingonlineforumdataintei