Seeping Semantics: Linking Datasets Using Word Embeddings for Data Discovery.
Raul Castro FernandezEssam MansourAbdulhakim Ali QahtanAhmed K. ElmagarmidIhab F. IlyasSamuel MaddenMourad OuzzaniMichael StonebrakerNan TangPublished in: ICDE (2018)
Keyphrases
- data sets
- database
- knowledge discovery
- raw data
- training data
- data analysis
- data points
- missing data
- high dimensional datasets
- original data
- computer systems
- data mining techniques
- image data
- data collection
- text classification
- synthetic data
- data mining algorithms
- test data
- co occurrence
- data objects
- data mining tasks
- synthetic datasets
- experimental conditions
- feature selection