Farsi Text Classification Using N-Grams and Knn Algorithm A Comparative Study.
Bahareh BinaMohamad Hasan AhmadiMaseud RahgozarPublished in: DMIN (2008)
Keyphrases
- text classification
- knn algorithm
- n gram
- knn
- bag of words
- k nearest neighbor
- text categorization
- language independent
- feature selection
- naive bayes
- language modeling
- variable length
- machine learning
- labeled data
- text mining
- viterbi algorithm
- text documents
- inside outside algorithm
- unlabeled data
- semi supervised
- semantic features
- statistical language modeling
- nearest neighbor
- artificial neural networks
- part of speech
- databases