Scaling up text classification for large file systems.
George FormanShyamsundar RajaramPublished in: KDD (2008)
Keyphrases
- file system
- text classification
- instance selection
- text categorization
- bag of words
- access patterns
- text mining
- labeled data
- feature selection
- search tools
- data transfer
- machine learning
- multi label
- semantic features
- knn
- flash memory
- text classifiers
- data cleaning
- application specific
- n gram
- multi tiered
- scalable distributed
- information retrieval
- query processing
- storage devices