SPID4.7: Discretization Using Successive Pseudo Deletion at Maximum Information Gain Boundary Points.
Himika BiswasSomnath PalPublished in: SDM (2005)
Keyphrases
- information gain
- boundary points
- text categorization
- feature selection
- decision trees
- chi squared
- mutual information
- object boundaries
- naive bayes
- point sets
- data points
- correlation coefficient
- data mining
- text classification
- co occurrence
- nearest neighbor
- high dimensional
- training set
- neural network
- gain ratio
- occurrence probabilities