SUC-CORE: A Balanced Corpus Annotated with Noun Phrase Coreference
This paper describes SUC-CORE, a subset of the Stockholm Ume°a Corpus and the Swedish Treebank annotated with noun phrase coreference. While most coreference annotated corpora consist of exts of similar types within related domains, SUC-CORE consists of both informative and imaginative prose and co...
Saved in:
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
Linköping University Electronic Press
2013-09-01
|
Series: | Northern European Journal of Language Technology |
Online Access: | https://nejlt.ep.liu.se/article/view/1656 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | This paper describes SUC-CORE, a subset of the Stockholm Ume°a Corpus and the Swedish Treebank annotated with noun phrase coreference. While most coreference annotated corpora consist of exts of similar types within related domains, SUC-CORE consists of both informative and imaginative prose and covers a wide range of literary genres and domains. This allows for exploration of coreference cross different text types, but it also means that there are limited amounts of data within each type. Future work on coreference resolution for Swedish should include making more annotated data vailable for the research community.
|
---|---|
ISSN: | 2000-1533 |