JOSIE: Overlap Set Similarity Search for Finding Joinable Tables in Data Lakes.
Erkang ZhuDong DengFatemeh NargesianRenée J. MillerPublished in: SIGMOD Conference (2019)
Keyphrases
- similarity search
- data sets
- high dimensional data
- database
- similarity measure
- user defined
- data structure
- data analysis
- databases
- metric space
- data sources
- multimodal data
- input data
- image data
- feature selection
- nearest neighbor
- multi dimensional
- data points
- high dimensional
- data distribution
- multimedia
- data objects
- machine learning