STTS als Part-of-Speech-Tagset in Tübinger Baumbanken.
Heike TelljohannYannick VersleyKathrin BeckErhard W. HinrichsThomas ZastrowPublished in: J. Lang. Technol. Comput. Linguistics (2013)
Keyphrases
- part of speech
- n gram
- pos tagging
- word sense disambiguation
- natural language processing
- unsupervised grammar induction
- training corpus
- syntactic categories
- parse tree
- linguistic information
- noun phrases
- tf idf
- text documents
- lexical information
- text classification
- chinese word segmentation
- pos taggers
- feature vectors
- feature selection
- unknown words
- language model
- training data