I/O-Conscious Data Preparation for Large-Scale Web Search Engines.
Maxim LifantsevTzi-cker ChiuehPublished in: VLDB (2002)
Keyphrases
- web search engines
- data preparation
- web search
- search engine
- knowledge discovery in databases
- preprocessing
- data quality
- knowledge discovery
- search queries
- web documents
- query logs
- data analysis
- pattern discovery
- user queries
- web pages
- web usage mining
- data mining
- real world
- ranking functions
- web server
- feature selection
- process model
- data modeling
- knowledge discovery and data mining
- caching strategies
- text mining
- nearest neighbor
- machine learning
- knowledge driven