TOMATE: A heuristic-based approach to extract data from HTML tables.
Juan C. RoldánPatricia JiménezPedro A. SzekelyRafael CorchueloPublished in: Inf. Sci. (2021)
Keyphrases
- data sets
- image data
- database
- training data
- databases
- high quality
- data collection
- small number
- data points
- data quality
- synthetic data
- data processing
- data sources
- prior knowledge
- data mining techniques
- input data
- data analysis
- data structure
- original data
- neural network
- probability distribution
- search algorithm
- computer systems
- high dimensional data
- experimental data
- data distribution