Crossing the finish line faster when paddling the Data Lake with Kayak.
Antonio MaccioniRiccardo TorlonePublished in: Proc. VLDB Endow. (2017)
Keyphrases
- data sets
- training data
- data quality
- image data
- synthetic data
- data processing
- data structure
- data analysis
- prior knowledge
- data sources
- probability distribution
- information retrieval
- knowledge discovery
- database
- small number
- original data
- raw data
- big data
- databases
- data objects
- experimental data
- microarray
- spatial data
- neural network
- statistical analysis
- data collection
- data mining
- data warehouse
- data points