Inducing Word and Part-of-Speech with Pitman-Yor Hidden Semi-Markov Models.
Kei UchiumiHiroshi TsukaharaDaichi MochihashiPublished in: ACL (1) (2015)
Keyphrases
- part of speech
- n gram
- word sense disambiguation
- syntactic information
- hidden semi markov models
- unknown words
- chinese word segmentation
- lexical information
- syntactic categories
- noun phrases
- training corpus
- linguistic information
- pos tagging
- multiword
- pos taggers
- natural language processing
- word sense
- word segmentation
- language model
- language independent
- wordnet
- ambiguous words
- text classification
- abnormality detection
- semi markov
- tf idf
- co occurrence
- hidden markov models
- parse tree
- training data
- text documents
- bag of words
- data analysis