RiQuA: A Corpus of Rich Quotation Annotation for English Literary Text.
Sean PapaySebastian PadóPublished in: LREC (2020)
Keyphrases
- broad coverage
- open domain
- english words
- multiword
- english text
- training corpus
- english language
- annotated corpus
- text analysis
- text data
- word sense
- machine translation system
- supervised machine learning
- natural language processing
- recognizing textual entailment
- link grammar
- statistical machine translation
- information extraction
- mono lingual
- native language
- spontaneous speech
- reading comprehension
- text to speech
- person names
- information retrieval
- wide coverage
- semi automatically
- text collections
- text retrieval
- relation extraction
- lexical features
- question answering
- text classification
- text mining
- parallel corpus
- text corpus
- machine translation
- text corpora
- automatic annotation
- natural language text
- metadata
- temporal expressions
- document level
- sentence level
- named entities
- source language
- language identification
- text documents
- document corpus
- semantic annotation
- natural language
- penn treebank
- manually annotated
- target language