Semantic keywords-based duplicated web pages removing.
Yunhe WengLei LiYixin ZhongPublished in: NLPKE (2008)
Keyphrases
- keywords
- web pages
- semantic information
- search engine
- semantic relationships
- semantic content
- web documents
- semantic categories
- website
- web search
- text representation
- keyword search
- semantic search
- semantic similarity
- relevant documents
- web page classification
- domain specific
- web information
- semantic features
- semantic context
- web content mining
- visual features
- web search engines
- domain ontology
- high level
- semantic web
- semi structured
- textual information
- web data
- web server
- link analysis
- conceptual graphs
- visual information
- low level features
- web users
- semantically related
- wordnet
- semantic annotation
- keyword extraction
- latent semantic
- text mining
- web information extraction
- web content