A Query-Dependent Duplicate Detection Approach for Large Scale Search Engines.
Shaozhi YeRuihua SongJi-Rong WenWei-Ying MaPublished in: APWeb (2004)
Keyphrases
- duplicate detection
- query dependent
- web search
- search engine
- query independent
- meta search
- web search engines
- ranking algorithm
- record linkage
- web scale
- multimedia retrieval
- web pages
- learning to rank
- information retrieval
- data cleaning
- search queries
- query logs
- databases
- ranking functions
- user queries
- retrieval systems
- web queries
- keywords
- database systems
- data sets
- data streams
- multimedia
- metadata
- machine learning