MLINSPECT: A Data Distribution Debugger for Machine Learning Pipelines.
Stefan GrafbergerShubha GuhaJulia StoyanovichSebastian SchelterPublished in: SIGMOD Conference (2021)
Keyphrases
- data distribution
- machine learning
- data streams
- concept drift
- decision trees
- pattern recognition
- high dimensional data
- streaming data
- index structure
- communication cost
- computer vision
- database systems
- active learning
- learning algorithm
- decision boundary
- skyline queries
- neural network
- data points
- data mining
- supervised learning
- high dimensional
- feature selection