Part-of-Speech for Old Malay Manuscript Corpus: A Review.
Juhaida Abu BakarKhairuddin OmarMohammad Faidzul NasrudinMohd Zamri MurahPublished in: M-CAIT (2013)
Keyphrases
- part of speech
- pos tagging
- training corpus
- linguistic features
- multiword
- noun phrases
- n gram
- syntactic features
- linguistic information
- unknown words
- natural language processing
- penn treebank
- word sense disambiguation
- word sense
- tree bank
- unsupervised grammar induction
- text documents
- chinese word segmentation
- pos taggers
- syntactic categories
- ambiguous words
- parse tree
- text classification
- information retrieval systems