FreeLunch: Compression-based GPU Memory Management for Convolutional Neural Networks.
Shaurya PatelTongping LiuHui GuanPublished in: MCHPC@SC (2021)
Keyphrases
- memory management
- convolutional neural networks
- parallel computation
- operating system
- garbage collection
- convolutional network
- hardware implementation
- real time
- parallel algorithm
- parallel computing
- parallel implementation
- parallel processing
- java virtual machine
- computing environments
- pattern recognition
- graphics processing units
- flash memory
- parallel programming
- memory access
- distributed environment
- computer systems