A new differential LSI space-based probabilistic document classifier.
Liang ChenNaoyuki TokudaAkira NagaiPublished in: Inf. Process. Lett. (2003)
Keyphrases
- document space
- latent semantic indexing
- vector space
- training data
- decision trees
- document images
- probabilistic model
- training set
- support vector machine
- latent semantic
- text classifiers
- vector space model
- document collections
- classification algorithm
- feature space
- support vector
- data sets
- classification process
- document classification
- uncertain data
- classification method
- svm classifier
- generative model
- information retrieval systems
- feature set
- text documents
- web documents
- document representation
- bayesian networks
- supervised learning
- high dimensional