SUC-CORE: A Balanced Corpus Annotated with Noun Phrase Coreference

This paper describes SUC-CORE, a subset of the Stockholm Ume°a Corpus and the Swedish Treebank annotated with noun phrase coreference. While most coreference annotated corpora consist of exts of similar types within related domains, SUC-CORE consists of both informative and imaginative prose and co...

Full description

Saved in:
Bibliographic Details
Main Author: Kristina Nilsson Björkenstam
Format: Article
Language:English
Published: Linköping University Electronic Press 2013-09-01
Series:Northern European Journal of Language Technology
Online Access:https://nejlt.ep.liu.se/article/view/1656
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832590635913183232
author Kristina Nilsson Björkenstam
author_facet Kristina Nilsson Björkenstam
author_sort Kristina Nilsson Björkenstam
collection DOAJ
description This paper describes SUC-CORE, a subset of the Stockholm Ume°a Corpus and the Swedish Treebank annotated with noun phrase coreference. While most coreference annotated corpora consist of exts of similar types within related domains, SUC-CORE consists of both informative and imaginative prose and covers a wide range of literary genres and domains. This allows for exploration of coreference cross different text types, but it also means that there are limited amounts of data within each type. Future work on coreference resolution for Swedish should include making more annotated data vailable for the research community.
format Article
id doaj-art-d8159be2d39a4cb687722496e2288f97
institution Kabale University
issn 2000-1533
language English
publishDate 2013-09-01
publisher Linköping University Electronic Press
record_format Article
series Northern European Journal of Language Technology
spelling doaj-art-d8159be2d39a4cb687722496e2288f972025-01-23T10:36:34ZengLinköping University Electronic PressNorthern European Journal of Language Technology2000-15332013-09-01310.3384/nejlt.2000-1533.1332SUC-CORE: A Balanced Corpus Annotated with Noun Phrase CoreferenceKristina Nilsson Björkenstam0Stockholm University, Computational Linguistics, Department of Linguistics This paper describes SUC-CORE, a subset of the Stockholm Ume°a Corpus and the Swedish Treebank annotated with noun phrase coreference. While most coreference annotated corpora consist of exts of similar types within related domains, SUC-CORE consists of both informative and imaginative prose and covers a wide range of literary genres and domains. This allows for exploration of coreference cross different text types, but it also means that there are limited amounts of data within each type. Future work on coreference resolution for Swedish should include making more annotated data vailable for the research community. https://nejlt.ep.liu.se/article/view/1656
spellingShingle Kristina Nilsson Björkenstam
SUC-CORE: A Balanced Corpus Annotated with Noun Phrase Coreference
Northern European Journal of Language Technology
title SUC-CORE: A Balanced Corpus Annotated with Noun Phrase Coreference
title_full SUC-CORE: A Balanced Corpus Annotated with Noun Phrase Coreference
title_fullStr SUC-CORE: A Balanced Corpus Annotated with Noun Phrase Coreference
title_full_unstemmed SUC-CORE: A Balanced Corpus Annotated with Noun Phrase Coreference
title_short SUC-CORE: A Balanced Corpus Annotated with Noun Phrase Coreference
title_sort suc core a balanced corpus annotated with noun phrase coreference
url https://nejlt.ep.liu.se/article/view/1656
work_keys_str_mv AT kristinanilssonbjorkenstam succoreabalancedcorpusannotatedwithnounphrasecoreference