Petabytes in Practice: Working with Collections as Data at Scale.
Will ThomasBenjamin GalewskySandeep Puthanveetil SatheesanGregory JansenRichard MarcianoShannon BradleyJong LeeLuigi MariniKenton McHenryPublished in: Data Inf. Manag. (2019)
Keyphrases
- social networks
- network structure
- data sets
- data sources
- data quality
- data collection
- raw data
- data analysis
- computer systems
- synthetic data
- image data
- spatial data
- sensor data
- high quality
- data processing
- original data
- experimental data
- neural network
- probability distribution
- prior knowledge
- training data
- multimedia
- document collections
- input data
- end users
- data acquisition
- temporal information
- knowledge base
- machine learning
- data collections