Data extraction from web pages based on structural-semantic entropy.
Xiaoqing ZhengYiling GuYinsheng LiPublished in: WWW (Companion Volume) (2012)
Keyphrases
- data sets
- data distribution
- data processing
- small number
- data sources
- neural network
- complex data
- database
- data points
- data analysis
- natural language
- training data
- input data
- semantically enriched
- feature selection
- high level
- sensor data
- missing data
- statistical analysis
- computer systems
- high dimensional
- natural language processing
- data structure
- image data
- probability distribution
- prior knowledge