Entity Matching in the Wild: A Consistent and Versatile Framework to Unify Data in Industrial Applications.
Yan YanStephen MeylesAria HaghighiDan SuciuPublished in: SIGMOD Conference (2020)
Keyphrases
- industrial applications
- data sets
- small number
- data sources
- database
- raw data
- experimental data
- input data
- high quality
- synthetic data
- data processing
- statistical analysis
- original data
- learning algorithm
- data collection
- computer systems
- computational modeling
- knowledge base
- consistency constraints
- data quality
- databases
- image data
- data points
- prior knowledge
- relational databases
- data analysis
- training data
- image sequences