Resource type
Date created
2018-03
Authors/Contributors
Author: Das, Debopam
Author: Taboada, Maite
Abstract
We present the RST Signalling Corpus (Das et al. in RST signalling corpus, LDC2015T10. https://catalog.ldc.upenn.edu/LDC2015T10, 2015), a corpus annotated for signals of coherence relations. The corpus is developed over the RST Discourse Treebank (Carlson et al. in RST Discourse Treebank, LDC2002T07. https://catalog.ldc.upenn.edu/LDC2002T07, 2002) which is annotated for coherence relations. In the RST Signalling Corpus, these relations are further annotated with signalling information. The corpus includes annotation not only for discourse markers which are considered to be the most typical (or sometimes the only type of) signals in discourse, but also for a wide array of other signals such as reference, lexical, semantic, syntactic, graphical and genre features as potential indicators of coherence relations. We describe the research underlying the development of the corpus and the annotation process, and provide details of the corpus. We also present the results of an inter-annotator agreement study, illustrating the validity and reproducibility of the annotation. The corpus is available through the Linguistic Data Consortium, and can be used to investigate the psycholinguistic mechanisms behind the interpretation of relations through signalling, and also to develop discourse-specific computational systems such as discourse parsing applications.
Document
Published as
Das, D., and Taboada, M. (2018). RST Signalling Corpus: A corpus of signals of coherence relations. Language Resources and Evaluation 52: 149. 10.1007/s10579-017-9383-x
Publication details
Publication title
Language Resources and Evaluation
Document title
RST Signalling Corpus: A Corpus of Signals of Coherence Relations
Date
2018
Volume
52
Issue
149
Publisher DOI
10.1007/s10579-017-9383-x
Copyright statement
Copyright is held by the author(s).
Scholarly level
Peer reviewed?
Yes
Language
English
Member of collection
Download file | Size |
---|---|
Das_Taboada_LRE_pre-pub.pdf | 419.28 KB |