On exploring data lakes by finding compact, isolated clusters.
Patricia JiménezJuan C. RoldánRafael CorchueloPublished in: Inf. Sci. (2022)
Keyphrases
- data sets
- input data
- high quality
- database
- data points
- data analysis
- data records
- raw data
- statistical methods
- missing data
- synthetic data
- image data
- statistical analysis
- data collection
- sensor data
- spatial data
- data distribution
- data sources
- prior knowledge
- data objects
- complex data
- computer systems
- attribute values
- spectral clustering
- training data