Investigating the Best Configuration of HMM Spanish PoS Tagger when Minimum Amount of Training Data Is Available.
Sergio FerrándezJesús PeralPublished in: NLDB (2005)
Keyphrases
- training data
- training corpora
- hidden markov models
- pos taggers
- part of speech
- training corpus
- pos tagging
- learning algorithm
- supervised learning
- data sets
- decision trees
- domain adaptation
- test data
- training set
- text summarization
- classification accuracy
- question answering
- prior knowledge
- speech recognition
- machine translation system
- machine translation
- bayesian networks
- test set
- unlabeled data
- knowledge base
- feature selection