Group based Self Training for E-Commerce Product Record Linkage.
Xin ZhaoYuexin WuHongfei YanXiaoming LiPublished in: COLING (2014)
Keyphrases
- record linkage
- group membership
- duplicate detection
- product information
- data cleaning
- privacy preserving
- entity resolution
- electronic commerce
- multiple databases
- life cycle
- linked data
- product recommendation
- product quality
- machine learning
- disclosure risk
- cost sensitive
- semi supervised
- semi supervised learning
- consumer behavior
- approximate matching
- data analysis
- metadata
- online stores
- feature selection
- census data
- product search
- data sets
- customer preferences