Clustering header categories extracted from web tables.
George NagyDavid W. EmbleyMukkai S. KrishnamoorthySharad C. SethPublished in: DRR (2015)
Keyphrases
- website
- clustering algorithm
- web applications
- k means
- web pages
- database
- hierarchical clustering
- cluster analysis
- categorical data
- automatically extracted
- web mining
- clustering method
- semantic web
- web snippets
- web sessions
- web people search
- user generated content
- web content
- classifying web pages
- fuzzy clustering
- data mining
- web users
- web resources
- web technologies
- information sources
- web communities
- tag information
- data points
- databases
- search engine
- end users
- content similarity
- relational information
- spectral clustering
- outlier detection
- object categories
- hierarchical structure