Gender prediction on a real life blog data set using LSI and KNN.
Jianle ChenTianqi XiaoJie ShengAnkur TeredesaiPublished in: CCWC (2017)
Keyphrases
- knn
- real life
- k nearest neighbor
- data sets
- nearest neighbor
- text categorization
- k nearest neighbour
- prediction accuracy
- knn algorithm
- classification algorithm
- distance function
- dimensionality reduction methods
- support vector machine svm
- text classification
- imputation methods
- feature selection
- classification method
- similarity search
- training data
- knn classifier
- support vector machine
- high dimensional data
- information retrieval
- latent semantic indexing
- test instances
- shows significant improvements
- input space
- text retrieval
- euclidean distance
- training set
- machine learning
- neural network