SogouT-16: A New Web Corpus to Embrace IR Research.
Cheng LuoYukun ZhengYiqun LiuXiaochuan WangJingfang XuMin ZhangShaoping MaPublished in: SIGIR (2017)
Keyphrases
- information access
- website
- web applications
- information sources
- multi lingual
- newspaper articles
- web pages
- web resources
- end users
- web documents
- web technologies
- web scale
- web information
- neural network
- open domain
- database
- textual features
- user generated content
- link analysis
- linked data
- user experience
- semantic web
- digital libraries
- knowledge base