Using Hardware Multithreading to Overcome Broadcast/Reduction Latency in an Associative SIMD Processor.
Kevin SchafferRobert A. WalkerPublished in: Parallel Process. Lett. (2008)
Keyphrases
- multithreading
- parallel computing
- massively parallel
- computational power
- power reduction
- highly efficient
- wireless broadcast
- parallel processing
- shared memory
- distributed memory
- parallel algorithm
- parallel architectures
- low latency
- multi core processors
- coarse grained
- data partitioning
- memory efficient
- single instruction multiple data
- parallel implementation
- associative memory
- message passing
- response time
- clock frequency
- low cost
- access control
- operating system
- computing systems
- fine grained
- processing elements
- prefetching
- neural network
- memory bandwidth
- real time