Disdat: Bundle Data Management for Machine Learning Pipelines.
Ken YocumSean RowanJonathan LuntTheodore M. WongPublished in: OpML (2019)
Keyphrases
- data management
- machine learning
- big data
- data warehousing
- query processing
- database management systems
- machine learning methods
- database systems
- data mining
- artificial intelligence
- data processing
- databases
- feature selection
- pattern recognition
- decision trees
- cloud computing
- data warehouse
- machine learning algorithms
- explanation based learning
- supervised learning
- database
- neural network
- learning algorithm
- active learning
- data management systems
- heterogeneous data
- computer vision
- natural language
- knowledge acquisition
- text classification
- computer science
- training data
- social networks
- learning problems
- search engine
- data analysis
- inductive learning
- distributed systems
- computational intelligence
- natural language processing