Mining Statistically Significant Substrings Based on the Chi-Square Measure
Sourav DuttaArnab BhattacharyaPublished in: CoRR (2010)
Keyphrases
- statistically significant
- chi square
- logistic regression
- information gain
- correlation coefficient
- learning styles
- mutual information
- interestingness measures
- control group
- pearson correlation
- statistical tests
- statistical significance
- confidence intervals
- text mining
- data mining
- text categorization
- decision trees
- kl divergence
- itemsets
- training data
- kullback leibler divergence
- information retrieval