A Part-of-Speech Tag Clustering for a Word Prediction System in Portuguese Language.
Daniel Cruz CavalieriTeodiano Freire Bastos FilhoMário Sarcinelli FilhoSira E. Palazuelos-CagigasJavier Macías GuarasaJosé Luis Martín SánchezPublished in: Proces. del Leng. Natural (2011)
Keyphrases
- part of speech
- n gram
- portuguese language
- word sense disambiguation
- chinese word segmentation
- natural language processing
- syntactic categories
- bag of words
- linguistic information
- lexical information
- unknown words
- syntactic information
- noun phrases
- pos tagging
- training corpus
- multiword
- pos taggers
- clustering algorithm
- clustering method
- word sense
- k means
- text classification
- ambiguous words
- keywords
- unsupervised learning
- document clustering
- cluster analysis
- language model
- text documents
- data points
- named entity recognition
- information retrieval
- natural language
- probabilistic model