Skluma: A Statistical Learning Pipeline for Taming Unkempt Data Repositories.
Paul G. BeckmanTyler J. SkluzacekKyle ChardIan T. FosterPublished in: SSDBM (2017)
Keyphrases
- statistical learning
- data repositories
- information theory
- back end
- data sources
- model selection
- metadata
- web data
- supervised learning
- data mining
- scientific data
- multi view face detection
- database
- text mining
- manifold learning
- user friendly
- web mining
- data analysis
- digital libraries
- training data
- feature selection
- machine learning
- databases