CUILESS2016: a clinical corpus applying compositional normalization of text mentions.
John David OsborneMatthew B. NeuMaria I. DanilaThamar SolorioSteven J. BethardPublished in: J. Biomed. Semant. (2018)
Keyphrases
- named entity disambiguation
- reference resolution
- broad coverage
- open domain
- text data
- coreference resolution
- supervised machine learning
- relation extraction
- anaphora resolution
- natural language text
- textual content
- plain text
- newspaper articles
- text corpus
- recognizing textual entailment
- manually annotated
- text documents
- multiword
- named entities
- noun phrases
- text retrieval
- scientific papers
- information extraction
- free text
- text mining
- english words
- text collections
- named entity recognition
- world knowledge
- linguistic information
- keywords
- spontaneous speech
- natural language processing
- linguistic patterns
- patient records
- medical data
- topic segmentation
- document level
- clinical data
- entity extraction
- sentence level
- lexical features
- word pairs
- text fragments
- textual features
- statistical machine translation
- information retrieval
- text corpora
- patient data
- web documents
- web pages