Vertical Classification of Web Pages for Structured Data Extraction.
Long LiDandan SongLejian LiaoPublished in: AIRS (2012)
Keyphrases
- information retrieval systems
- data extraction
- web pages
- semi structured
- web data extraction
- web sources
- web page classification
- data integration
- structured data
- html pages
- data records
- machine learning
- information extraction
- web search
- search engine
- data sets
- mobile devices
- feature space
- similarity measure
- case study
- real world