A performance comparison of Dask and Apache Spark for data-intensive neuroimaging pipelines.
Mathieu DugréValérie Hayot-SassonTristan GlatardPublished in: CoRR (2019)
Keyphrases
- data intensive
- big data
- data management
- web services
- globally distributed
- geographically distributed
- earth science
- open source
- data access
- grid computing
- data processing
- cloud computing
- knowledge discovery
- databases
- data warehousing
- response time
- object oriented
- human brain
- management system
- social media
- data analysis
- information retrieval