Text Sampling and Re-Sampling for Imbalanced Authorship Identification Cases.
Efstathios StamatatosPublished in: ECAI (2006)
Keyphrases
- information retrieval
- random sampling
- data sets
- sampling strategies
- sampling strategy
- imbalanced data
- parameter space
- text retrieval
- training data
- text mining
- free text
- sampling rate
- sampling methods
- imbalanced datasets
- database
- monte carlo
- co occurrence
- image retrieval
- class distribution
- importance sampling
- sampled data
- neural network