Linguistic Annotation of the Spoken Dutch Corpus: If We Had To Do It All Over Again.
Ineke SchuurmanWim GoedertierHeleen HoekstraNelleke OostdijkRichard PiepenbrockMachteld SchouppePublished in: LREC (2004)
Keyphrases
- hand crafted
- linguistic features
- annotated corpus
- linguistic information
- natural language text
- linguistic patterns
- reference resolution
- conversational speech
- inter annotator agreement
- spontaneous speech
- automatic annotation
- natural language processing
- manually annotated
- semantic annotation
- natural language
- automatically generated
- active learning
- manual annotation
- linguistic knowledge
- named entity recognition
- spoken language
- automatic image annotation
- speech recognition
- image annotation
- relation extraction
- text classification
- wordnet
- metadata
- word sense disambiguation
- higher level
- automatic speech recognition
- language understanding
- test set
- semantic features
- topic models
- coreference resolution
- semi automatically