CWRCzech: 100M Query-Document Czech Click Dataset and Its Application to Web Relevance Ranking.
Josef VonásekMilan StrakaRostislav KrcLenka LasonováEkaterina EgorovaJana StrakováJakub NáplavaPublished in: SIGIR (2024)
Keyphrases
- relevance ranking
- web search
- search engine
- document content
- web pages
- click logs
- anchor text
- web documents
- document collections
- query result
- search queries
- web search engines
- web data
- query logs
- topic modeling
- user queries
- retrieval systems
- database
- ranked list
- foreign language
- deep web
- web mining
- information retrieval
- test collection
- query processing
- link analysis
- web databases
- keywords