Clustering Visually Similar Web Page Elements for Structured Web Data Extraction.
Tomas GrigalisLukas RadvilaviciusAntanas CenysJuozas GordeviciusPublished in: ICWE (2012)
Keyphrases
- web data extraction
- web pages
- visually similar
- data extraction
- semi structured
- web images
- clustering algorithm
- visual features
- k means
- clustering method
- automatically extracted
- search engine
- web search
- structured data
- semantically related
- document clustering
- web data
- web search engines
- natural language
- data mining
- web documents
- domain specific
- anchor text
- text mining
- visual similarity
- meta search engine
- information retrieval