Text Bundling: Statistics Based Data-Reduction.
Lawrence ShihJason D. M. RennieYu-Han ChangDavid R. KargerPublished in: ICML (2003)
Keyphrases
- data reduction
- data compression
- instance selection
- classification rules
- knowledge discovery
- representative subset
- data mining
- preprocessing
- data analysis
- classification accuracy
- model selection
- text mining
- rough set theory
- feature selection
- database
- information retrieval
- density estimates
- feature extraction
- high dimensionality
- machine learning
- rough sets
- neural network
- training data
- databases