Granular support vector machines with data cleaning for fast and accurate biomedical binary classification.
Yuchun TangYan-Qing ZhangPublished in: GrC (2005)
Keyphrases
- binary classification
- data cleaning
- multi class
- cost sensitive
- support vector
- learning problems
- data integration
- text classification
- data quality
- multi label
- fraud detection
- generalization error
- outlier detection
- record linkage
- information extraction
- database
- support vector machine
- data processing
- prediction accuracy
- class imbalance
- data warehousing
- data warehouse
- text mining
- supervised learning
- integrity constraints
- web usage mining
- text categorization
- missing values
- cross validation
- semi supervised learning
- training set
- website
- feature selection
- learning algorithm
- neural network
- databases