SUC-CORE: A Balanced Corpus Annotated with Noun Phrase Coreference
This paper describes SUC-CORE, a subset of the Stockholm Ume°a Corpus and the Swedish Treebank annotated with noun phrase coreference. While most coreference annotated corpora consist of exts of similar types within related domains, SUC-CORE consists of both informative and imaginative prose and co...
Saved in:
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
Linköping University Electronic Press
2013-09-01
|
Series: | Northern European Journal of Language Technology |
Online Access: | https://nejlt.ep.liu.se/article/view/1656 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832590635913183232 |
---|---|
author | Kristina Nilsson Björkenstam |
author_facet | Kristina Nilsson Björkenstam |
author_sort | Kristina Nilsson Björkenstam |
collection | DOAJ |
description |
This paper describes SUC-CORE, a subset of the Stockholm Ume°a Corpus and the Swedish Treebank annotated with noun phrase coreference. While most coreference annotated corpora consist of exts of similar types within related domains, SUC-CORE consists of both informative and imaginative prose and covers a wide range of literary genres and domains. This allows for exploration of coreference cross different text types, but it also means that there are limited amounts of data within each type. Future work on coreference resolution for Swedish should include making more annotated data vailable for the research community.
|
format | Article |
id | doaj-art-d8159be2d39a4cb687722496e2288f97 |
institution | Kabale University |
issn | 2000-1533 |
language | English |
publishDate | 2013-09-01 |
publisher | Linköping University Electronic Press |
record_format | Article |
series | Northern European Journal of Language Technology |
spelling | doaj-art-d8159be2d39a4cb687722496e2288f972025-01-23T10:36:34ZengLinköping University Electronic PressNorthern European Journal of Language Technology2000-15332013-09-01310.3384/nejlt.2000-1533.1332SUC-CORE: A Balanced Corpus Annotated with Noun Phrase CoreferenceKristina Nilsson Björkenstam0Stockholm University, Computational Linguistics, Department of Linguistics This paper describes SUC-CORE, a subset of the Stockholm Ume°a Corpus and the Swedish Treebank annotated with noun phrase coreference. While most coreference annotated corpora consist of exts of similar types within related domains, SUC-CORE consists of both informative and imaginative prose and covers a wide range of literary genres and domains. This allows for exploration of coreference cross different text types, but it also means that there are limited amounts of data within each type. Future work on coreference resolution for Swedish should include making more annotated data vailable for the research community. https://nejlt.ep.liu.se/article/view/1656 |
spellingShingle | Kristina Nilsson Björkenstam SUC-CORE: A Balanced Corpus Annotated with Noun Phrase Coreference Northern European Journal of Language Technology |
title | SUC-CORE: A Balanced Corpus Annotated with Noun Phrase Coreference |
title_full | SUC-CORE: A Balanced Corpus Annotated with Noun Phrase Coreference |
title_fullStr | SUC-CORE: A Balanced Corpus Annotated with Noun Phrase Coreference |
title_full_unstemmed | SUC-CORE: A Balanced Corpus Annotated with Noun Phrase Coreference |
title_short | SUC-CORE: A Balanced Corpus Annotated with Noun Phrase Coreference |
title_sort | suc core a balanced corpus annotated with noun phrase coreference |
url | https://nejlt.ep.liu.se/article/view/1656 |
work_keys_str_mv | AT kristinanilssonbjorkenstam succoreabalancedcorpusannotatedwithnounphrasecoreference |