Bayesian Data Cleaning for Web Data
Yuheng HuSushovan DeYi ChenSubbarao KambhampatiPublished in: CoRR (2012)
Keyphrases
- data cleaning
- web data
- web usage mining
- web mining
- data integration
- outlier detection
- semi structured
- data quality
- web content
- web pages
- record linkage
- text classification
- web documents
- data processing
- data model
- website
- fraud detection
- information retrieval
- database
- data warehouse
- data warehousing
- data mining techniques
- metadata
- feature selection
- data extraction
- real world