The Project Dialogism Novel Corpus: A Dataset for Quotation Attribution in Literary Texts.
Krishnapriya VishnubhotlaAdam HammondGraeme HirstPublished in: LREC (2022)
Keyphrases
- writing style
- natural language text
- training corpus
- project management
- case study
- newspaper articles
- text corpus
- natural language
- english words
- natural language generation
- benchmark datasets
- data collection
- text analysis
- training dataset
- word sense
- manually annotated
- statistical machine translation
- training data
- knowledge base
- textual features
- neural network
- database