Fast nGram-Based String Search Over Data Encoded Using Algebraic Signatures.
Witold LitwinRiad MokademPhilippe RigauxThomas J. E. SchwarzPublished in: VLDB (2007)
Keyphrases
- data sets
- data collection
- data objects
- raw data
- data sources
- original data
- data distribution
- data points
- data processing
- database
- high quality
- data mining
- pattern matching
- synthetic data
- machine learning
- image data
- end users
- search algorithm
- data structure
- text classification
- information retrieval systems
- information extraction
- statistical analysis
- high dimensional data
- missing data
- probabilistic model
- training data