Classification of Highly Unbalanced CYP450 Data of Drugs Using Cost Sensitive Machine Learning Techniques.
Tatjana EitrichAchim KlessClaudia DruskaWolfgang MeyerJohannes GrotendorstPublished in: J. Chem. Inf. Model. (2007)
Keyphrases
- cost sensitive
- data sets
- data analysis
- classification algorithm
- cost sensitive classification
- cost sensitive learning
- small number
- training samples
- statistical methods
- knowledge discovery
- training data
- text mining
- machine learning algorithms
- attribute values
- machine learning methods
- missing values
- class distribution
- class imbalance
- binary classification
- machine learning