Task Scheduling and File Replication for Data-Intensive Jobs with Batch-shared I/O.
Gaurav KhannaNagavijayalakshmi VydyanathanÜmit V. ÇatalyürekTahsin M. KurçSriram KrishnamoorthyP. SadayappanJoel H. SaltzPublished in: HPDC (2006)
Keyphrases
- data intensive
- grid computing
- file system
- batch processing
- batch size
- data management
- peer to peer
- web services
- globally distributed
- data grid
- distributed computing
- data access
- load balancing
- big data
- fault tolerance
- geographically distributed
- earth science
- wafer fabrication
- grid environment
- hard disk
- fault tolerant
- scheduling algorithm
- distributed databases
- main memory
- resource management
- grid technology
- computing environments
- data transfer
- data sets
- database
- high performance computing
- knowledge discovery
- databases