Compasses, Magnets, Water Microscopes: Annotation of Terminology in a Diachronic Corpus of Scientific Texts.
Anne-Kathrin SchumannStefan FischerPublished in: LREC (2016)
Keyphrases
- scientific papers
- terminology extraction
- annotated corpus
- natural language text
- training corpus
- newspaper articles
- information extraction systems
- controlled vocabulary
- text corpus
- english words
- scientific literature
- magnetic field
- hand crafted
- scientific data
- inter annotator agreement
- manual annotation
- manually annotated
- word sense
- semantic annotation
- domain specific
- image annotation
- writing style
- world knowledge
- information extraction
- automatic annotation
- linguistic features
- free text
- active learning
- named entity recognition
- comparable corpora
- water quality
- genia corpus
- cross language information retrieval
- linguistic information
- statistical machine translation
- textual features
- named entities
- annotation tool
- text classification
- linguistic patterns
- image retrieval
- data mining
- sentence level
- semi automatically
- science education
- text documents
- text mining
- metadata