Building a scalable global data processing pipeline for large astronomical photometric datasets.
Paul DoylePublished in: CoRR (2015)
Keyphrases
- data sets
- database
- raw data
- high quality
- experimental conditions
- data analysis
- data sources
- data processing
- data collection
- processing pipeline
- global scale
- data mining tasks
- data quality
- original data
- spatial data
- missing data
- benchmark datasets
- synthetic data
- statistical analysis
- computer systems
- data points
- training data
- network structure
- xml documents
- training dataset
- global information
- data structure
- data mining