Text Classification for Data Loss Prevention.
Michael HartPratyusa K. ManadhataRob JohnsonPublished in: PETS (2011)
Keyphrases
- text classification
- data sets
- high quality
- experimental data
- database
- original data
- raw data
- data analysis
- synthetic data
- data collection
- text data
- training data
- image data
- data quality
- data sources
- bag of words
- sensor data
- spatial data
- feature selection
- data structure
- missing values
- data distribution
- missing data
- end users
- high dimensional data
- databases
- knowledge discovery
- input data
- text mining
- small number