SUC-CORE: A Balanced Corpus Annotated with Noun Phrase Coreference

This paper describes SUC-CORE, a subset of the Stockholm Ume°a Corpus and the Swedish Treebank annotated with noun phrase coreference. While most coreference annotated corpora consist of exts of similar types within related domains, SUC-CORE consists of both informative and imaginative prose and co...

Full description

Saved in:
Bibliographic Details
Main Author: Kristina Nilsson Björkenstam
Format: Article
Language:English
Published: Linköping University Electronic Press 2013-09-01
Series:Northern European Journal of Language Technology
Online Access:https://nejlt.ep.liu.se/article/view/1656
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:This paper describes SUC-CORE, a subset of the Stockholm Ume°a Corpus and the Swedish Treebank annotated with noun phrase coreference. While most coreference annotated corpora consist of exts of similar types within related domains, SUC-CORE consists of both informative and imaginative prose and covers a wide range of literary genres and domains. This allows for exploration of coreference cross different text types, but it also means that there are limited amounts of data within each type. Future work on coreference resolution for Swedish should include making more annotated data vailable for the research community.
ISSN:2000-1533