A framework for detecting unnecessary industrial data in ETL processes.
Philip WoodallTorben JessMark HarrisonDuncan C. McFarlaneAmar ShahWilliam E. KrechelEric NicksPublished in: INDIN (2014)
Keyphrases
- data sets
- data collection
- statistical analysis
- data mining techniques
- experimental data
- database
- complex data
- data points
- computer systems
- original data
- data quality
- image data
- data acquisition
- data distribution
- missing data
- end users
- data sources
- historical data
- training data
- raw data
- multimedia data
- industrial applications
- network structure
- feature selection
- communication channels
- data mining algorithms
- social networks
- data processing
- information systems
- knowledge discovery
- probability distribution
- data analysis
- data structure
- high quality