Designing a graphics processing unit accelerated petaflop capable lattice Boltzmann solver: Read aligned data layouts and asynchronous communication.
Fredrik RobertsenJan WesterholmKeijo MattilaPublished in: Int. J. High Perform. Comput. Appl. (2017)