Towards an Accurate Latency Model for Convolutional Neural Network Layers on GPUs.
Jinyang LiRunyu MaVikram Sharma MailthodyColin SamplawskiBenjamin M. MarlinSongqing ChenShuochao YaoTarek F. AbdelzaherPublished in: MILCOM (2021)
Keyphrases
- convolutional neural network
- experimental data
- objective function
- multi layer
- cost function
- probabilistic model
- probability distribution
- real time
- statistical model
- mathematical model
- general purpose
- formal model
- theoretical framework
- detection method
- theoretical analysis
- management system
- prior knowledge
- multi agent
- genetic algorithm