SciDMT: A Large-Scale Corpus for Detecting Scientific Mentions.
Huitong PanQi ZhangCornelia CarageaEduard DragutLongin Jan LateckiPublished in: CoRR (2024)
Keyphrases
- coreference resolution
- reference resolution
- real life
- real world
- scientific papers
- open domain
- test set
- small scale
- user generated
- named entity disambiguation
- scientific disciplines
- supervised machine learning
- manually annotated
- real time
- artificial intelligence
- web scale
- statistical machine translation
- scientific data
- co occurrence
- probabilistic model
- natural language
- data sets