Scheduling independent tasks sharing large data distributed with BitTorrent.
Baohua WeiGilles FedakFranck CappelloPublished in: GRID (2005)
Keyphrases
- data sets
- distributed data
- database
- distributed systems
- original data
- data processing
- input data
- data points
- scheduling problem
- peer to peer
- data mining
- data sources
- knowledge discovery
- prior knowledge
- data mining techniques
- data analysis
- high quality
- synthetic data
- missing data
- training data
- grid computing
- heterogeneous data
- data transfer