The Great Textual Hoax: Boosting Sampled String Matching with Fake Samples.
Simone FaroFrancesco Pio MarinoAntonino Andrea MoschettoArianna PavoneAntonio ScardacePublished in: FUN (2024)
Keyphrases
- string matching
- pattern matching
- approximate string matching
- edit distance
- suffix tree
- learning algorithm
- approximate matching
- training samples
- data sets
- keywords
- regular expressions
- metadata
- clone detection
- aho corasick
- training set
- decision trees
- pattern matching algorithm
- feature space
- pattern recognition
- search engine