Quality-Driven Machine Learning-based Data Science Pipeline Realization: a software engineering approach.
Giordano d'AloisioPublished in: ICSE-Companion (2022)
Keyphrases
- data science
- machine learning
- software engineering
- big data
- statistical learning
- artificial intelligence
- software quality
- knowledge engineering
- high quality
- knowledge acquisition
- software systems
- software development
- supervised learning
- information extraction
- semi supervised learning
- programming language
- active learning
- data model
- data analysis
- decision trees
- feature selection
- data mining
- object oriented
- data sets
- natural language processing
- relational databases
- pairwise
- similarity measure
- feature extraction
- information theory
- learning algorithm