Active Learning-Based Corpus Annotation - The PathoJen Experience.
Udo HahnElena BeisswangerEkaterina BuykoErik FaesslerPublished in: AMIA (2012)
Keyphrases
- active learning
- annotation effort
- supervised machine learning
- random sampling
- learning algorithm
- supervised learning
- machine learning
- semi supervised
- experimental design
- annotated corpus
- selective sampling
- training examples
- learning strategies
- manually annotated
- test set
- cost sensitive
- relevance feedback
- learning process
- training set
- semantic annotation
- labeled data
- user experience
- unlabeled data
- co occurrence
- linguistic features
- natural language processing
- sample selection
- inter annotator agreement