Towards better entity resolution techniques for Web document collections.
Surender Reddy YervaZoltán MiklósKarl AbererPublished in: ICDE Workshops (2010)
Keyphrases
- document collections
- entity resolution
- information retrieval
- information retrieval systems
- scatter gather
- website
- test collection
- record linkage
- web pages
- information extraction
- digital libraries
- web documents
- query processing
- data cleaning
- web scale
- web content
- data integration
- link prediction
- web mining
- semantic web
- network structure
- web data
- web search engines
- web users
- markov networks
- database
- active learning
- data model
- machine learning