Automatic Training Data Cleaning for Text Classification.
Hassan H. MalikVikas S. BhardwajPublished in: ICDM Workshops (2011)
Keyphrases
- data cleaning
- text classification
- data integration
- data quality
- record linkage
- outlier detection
- data warehouse
- data processing
- text categorization
- data warehousing
- feature selection
- missing values
- database
- knn
- data mining
- training set
- web usage mining
- machine learning
- neural network
- fraud detection
- text mining
- naive bayes
- data sources
- integrity constraints
- decision trees
- natural language
- decision support
- data extraction