Extracting Thai Compounds Using Collocations and POS Bigram Probabilities without a POS Tagger.
Wirote AroonmanakunPublished in: IALP (2009)
Keyphrases
- n gram
- part of speech
- word segmentation
- pos tagging
- pos taggers
- language model
- language independent
- bag of words
- language modeling
- text classification
- training corpus
- word sense disambiguation
- parse tree
- syntactic categories
- query expansion
- penn treebank
- probabilistic model
- sentiment analysis
- retrieval systems
- web documents