Versatile optimization of UDF-heavy data flows with sofa.
Astrid RheinländerMartin BeckmannAnja KunkelArvid HeiseThomas StoltmannUlf LeserPublished in: SIGMOD Conference (2014)
Keyphrases
- data sets
- data analysis
- data quality
- high quality
- database
- training data
- data collection
- data sources
- optimization algorithm
- synthetic data
- data mining techniques
- application domains
- data structure
- spatial data
- experimental data
- noisy data
- big data
- statistical analysis
- data processing
- optimization problems
- small number
- knowledge discovery
- prior knowledge
- search engine
- databases