DataHub: Collaborative Data Science & Dataset Version Management at Scale.
Anant P. BhardwajSouvik BhattacherjeeAmit ChavanAmol DeshpandeAaron J. ElmoreSamuel MaddenAditya G. ParameswaranPublished in: CoRR (2014)
Keyphrases
- data science
- big data
- statistical learning
- management system
- data processing
- information management
- knowledge management
- information systems
- collaborative learning
- machine learning
- benchmark datasets
- data management
- information technology
- case study
- project management
- synthetic datasets
- decision makers
- decision support
- databases
- social media
- multiscale
- database systems
- decision making
- real world