Aggregating Crowdsourced and Automatic Judgments to Scale Up a Corpus of Anaphoric Reference for Fiction and Wikipedia Texts.
Juntao YuSilviu PaunMaris CamilleriPaloma Carretero GarciaJon ChamberlainUdo KruschwitzMassimo PoesioPublished in: EACL (2023)
Keyphrases
- world knowledge
- noun phrases
- natural language text
- named entities
- text corpus
- reference resolution
- anaphora resolution
- wikipedia articles
- linguistic features
- semantic relations
- genia corpus
- scale space
- short texts
- named entity disambiguation
- knowledge sources
- hand crafted
- knowledge base
- coreference resolution
- training corpus
- newspaper articles
- english words
- computing semantic relatedness
- text corpora
- relevance assessments
- word sense
- text documents
- bag of words
- semi automatic
- information extraction
- natural language
- linguistic information
- link structure
- semantic information
- text classification
- topic tracking
- language model
- information extraction systems
- text mining