HoloCleanX: A Multi-source Heterogeneous Data Cleaning Solution Based on Lakehouse.
Qin CuiWenkui ZhengWei HouMing ShengPeng RenWang ChangXiangyang LiPublished in: HIS (2022)
Keyphrases
- multi source
- data cleaning
- data integration
- data fusion
- information fusion
- data sources
- multiple data sources
- data management
- databases
- data warehouse
- data model
- data processing
- data quality
- information integration
- multiple sources
- artificial intelligence
- business intelligence
- outlier detection
- data warehousing
- information extraction
- case study
- web usage mining
- fraud detection
- data mining
- record linkage
- text classification
- database