Hybrid Random Forests: Advantages of Mixed Trees in Classifying Text Data.
Baoxun XuJoshua Zhexue HuangGraham J. WilliamsMark Junjie LiYunming YePublished in: PAKDD (1) (2012)
Keyphrases
- random forests
- text data
- tree ensembles
- decision trees
- decision tree ensembles
- randomized trees
- text classification
- text mining
- random forest
- logistic regression
- machine learning algorithms
- ensemble methods
- document collections
- high dimensional
- structured data
- text documents
- high dimensional data
- machine learning
- information retrieval
- web pages
- active learning
- information extraction
- learning algorithm
- training set
- data mining
- neural network
- databases
- naive bayes
- dimensionality reduction
- database
- natural language processing
- image features
- feature extraction