A performance comparison of Dask and Apache Spark for data-intensive neuroimaging pipelines.

Mathieu Dugré Valérie Hayot-Sasson Tristan Glatard

Published in: CoRR (2019)

Keyphrases

data intensive
big data
data management
web services
globally distributed
geographically distributed
earth science
open source
data access
grid computing
data processing
cloud computing
knowledge discovery
databases
data warehousing
response time
object oriented
human brain
management system
social media
data analysis
information retrieval