A Probabilistic Model for Text Categorization: Based on a Single Random Variable with Multiple Values.
Makoto IwayamaTakenobu TokunagaPublished in: ANLP (1994)
Keyphrases
- text categorization
- random variables
- probabilistic model
- graphical models
- bayesian networks
- feature selection
- distribution function
- knn
- text classification
- probability distribution
- conditional probabilities
- document classification
- k nearest neighbor
- multi label
- semi supervised learning
- reuters corpus
- information gain
- naive bayes
- automated text categorization
- automatic text categorization
- language model
- feature selection for text categorization
- test statistic
- text classifiers
- continuous variables
- decision variables
- document frequency
- expectation maximization
- text documents
- learning algorithm
- feature space