A Continuous Information Gain Measure to Find the Most Discriminatory Problems for AI Benchmarking.
Matthew StephensonDamien AndersonAhmed KhalifaJohn LevineJochen RenzJulian TogeliusChristoph SalgePublished in: CoRR (2018)
Keyphrases
- information gain
- chi square
- text categorization
- occurrence frequency
- feature selection
- decision trees
- correlation coefficient
- chi squared
- information retrieval
- artificial intelligence
- machine learning
- similarity measure
- mutual information
- feature selection for text categorization
- gini index
- information theory
- logistic regression