Context-Aware Duplicate Detection in Semi-structured Data Streams.
Parijat ShuklaArun K. SomaniPublished in: SERVICES (2014)
Keyphrases
- semi structured
- context aware
- duplicate detection
- data streams
- structured data
- contextual information
- ubiquitous computing
- mobile devices
- information integration
- data model
- context awareness
- information extraction
- web documents
- web data
- data extraction
- data cleaning
- record linkage
- sensor networks
- uncertain data
- text mining
- xml databases
- outlier detection
- data sets
- current context
- itemsets
- active learning
- natural language
- data mining