Preliminary findings on the occurrence and causes of data smells in a real-world business travel data processing pipeline.
Valentina GolendukhinaHarald FoidlMichael FeldererRudolf RamlerPublished in: SEA4DQ@ESEC/SIGSOFT FSE (2022)
Keyphrases
- data sets
- data analysis
- real world
- complex data
- synthetic data
- statistical analysis
- data points
- sensor data
- missing data
- data processing
- high quality
- data sources
- end users
- image data
- small number
- raw data
- training data
- experimental data
- neural network
- big data
- data quality
- noisy data
- data objects
- database
- data acquisition
- information systems
- business processes
- data mining
- data mining techniques
- probability distribution
- social networks
- wide range
- decision trees
- decision making