Lemmatisation and morphosyntactic annotation for the spoken Dutch corpus.
Frank Van EyndeJakub ZavrelWalter DaelemansPublished in: CLIN (1999)
Keyphrases
- annotated corpus
- manually annotated
- automatic annotation
- metadata
- spoken language
- spontaneous speech
- semantic annotation
- manual annotation
- relation extraction
- image annotation
- hand crafted
- inter annotator agreement
- speech recognition
- active learning
- named entities
- conversational speech
- language understanding
- linguistically motivated
- named entity recognition
- automatic speech recognition
- annotation tool
- artificial intelligence
- automatic image annotation
- linguistic features
- training corpus
- noun phrases
- semi automatically
- machine learning
- part of speech
- semi automatic
- test set
- text classification
- probabilistic model
- training data
- learning algorithm
- information retrieval