An MPI-CUDA Implementation and Optimization for Parallel Sparse Equations and Least Squares (LSQR).
He HuangLiqiang WangEn-Jui LeePo ChenPublished in: ICCS (2012)
Keyphrases
- parallel implementation
- least squares
- shared memory
- message passing interface
- parallel programming
- parallel computing
- parallel computers
- distributed memory
- parallel computation
- compute unified device architecture
- general purpose
- nonlinear least squares
- sparse linear
- parallel architecture
- sparse representation
- massively parallel
- parallel algorithm
- parallel processing
- graphic processing unit
- optimization problems
- optimization algorithm
- scientific computing
- global optimization
- computer architecture
- sparse data
- differential equations
- efficient implementation
- joint optimization
- robust estimation
- hardware implementation
- parallel architectures
- high performance computing
- mathematical model
- moving objects
- parallelization strategy
- constrained optimization