Improving Hamming distance-based fuzzy join in MapReduce using Bloom Filters.
Thi-To-Quyen TranThuong-Cang PhanAnne LaurentLaurent d'OrazioPublished in: FUZZ-IEEE (2018)
Keyphrases
- bloom filter
- data structure
- distance measure
- fuzzy sets
- fuzzy logic
- membership queries
- record linkage
- join algorithms
- parallel processing
- query optimization
- fuzzy controller
- fuzzy rules
- fuzzy numbers
- membership functions
- outlier detection
- machine learning
- high performance data mining
- cartesian product
- hamming distance
- cost model
- fuzzy clustering
- index structure
- training data