1 Billion Pages = 1 Million Dollars? Mining the Web to Play "Who Wants to be a Millionaire?"
Shyong K. LamDavid M. PennockDan CosleySteve LawrencePublished in: CoRR (2012)
Keyphrases
- website
- web pages
- web mining
- web users
- web access logs
- web documents
- web usage
- web information
- web logs
- web crawling
- web applications
- web content
- clickstream data
- web usage mining
- page content
- web access
- web structure mining
- web data
- web crawlers
- search engine
- web graph
- click stream
- data mining
- dynamically generated
- web objects
- traversal patterns
- page contents
- web communities
- focused crawling
- page layout
- text mining
- web news
- user sessions
- link structure
- content similarity
- content features
- web server
- home page
- web scale
- digital libraries
- topic specific
- dynamic content
- knowledge discovery
- access logs
- information extraction
- log analysis
- data mining techniques
- web search engines
- data extraction
- ranking algorithm
- log files
- link analysis