A clustering approach to extract data from HTML tables.
Patricia JiménezJuan C. RoldánRafael CorchueloPublished in: Inf. Process. Manag. (2021)
Keyphrases
- data sets
- data points
- spectral clustering
- raw data
- data analysis
- data sources
- synthetic data
- data collection
- high dimensional data
- database
- computer systems
- clustering method
- image data
- databases
- data processing
- input data
- statistical analysis
- probability distribution
- attribute values
- spatial data
- training data
- original data
- synthetic datasets