Sample selection for dictionary-based corpus compression.
Christopher HoobinSimon J. PuglisiJustin ZobelPublished in: SIGIR (2011)
Keyphrases
- sample selection
- active learning
- training data
- selection strategy
- compression ratio
- data compression
- image compression
- support vector machine
- compression algorithm
- query translation
- named entity recognition
- neural network
- cross language information retrieval
- nearest neighbor
- supervised learning
- knn
- training dataset
- meta learning
- training set
- string matching
- word segmentation
- learning algorithm