Bottom-Up Discovery of Clusters of Maximal Ranges in HTML Trees for Search Engines Results Extraction
Dominik FlejterRoman HryniewieckiPublished in: BIS (2007)
Keyphrases
- search engine
- web pages
- information extraction
- clustering algorithm
- information retrieval
- web search
- decision trees
- web search engines
- information discovery
- hierarchical clustering
- semi structured
- meta search engine
- data extraction
- web data extraction
- web browser
- internet search
- leaf nodes
- keyword search
- fuzzy clustering
- news pages
- data clustering
- image segmentation
- keywords
- website
- self organizing maps
- automatic extraction
- machine learning
- data objects
- content extraction
- natural language
- data points
- tree structures
- web queries
- structured data
- search queries
- tree structure
- fuzzy c means
- query logs
- visual attention