OB-Tree: Accelerating Data Cleaning in Out-of-Core Column-Store Databases.
Feng YuBrandon J. LatronicsTyler MatacicEric S. JonesPublished in: BigData Congress (2017)
Keyphrases
- data cleaning
- column oriented
- data integration
- databases
- database
- record linkage
- data quality
- outlier detection
- text classification
- data management
- data warehouse
- data processing
- knowledge discovery
- data warehousing
- data model
- database applications
- relational databases
- fraud detection
- missing values
- web usage mining
- data sources
- database systems
- case study
- linked data
- data mining
- database management systems
- privacy preserving
- multi class
- knn
- data analysis