Comparing SQL and MapReduce to compute Naive Bayes in a single table scan.
Sasi K. PitchaimalaiCarlos OrdonezCarlos Garcia-AlvaradoPublished in: CloudDB@CIKM (2010)
Keyphrases
- naive bayes
- decision trees
- text classification
- classification accuracy
- logistic regression
- naive bayes classifier
- text categorization
- classification algorithm
- averaged one dependence estimators
- training data
- uci data sets
- probability estimation
- naive bayesian classifier
- bayesian networks
- feature selection
- uci datasets
- base classifiers
- database
- databases
- cost sensitive
- bayesian classifier
- locally weighted
- bayesian network classifiers
- test instances
- text classifiers
- naive bayes models
- probabilistic classifiers
- conditional independence assumption
- augmented naive bayes
- information retrieval
- data sets
- model selection
- training set
- machine learning
- boosted decision trees
- neural network