​
Login / Signup
Akira Naruse
ORCID
Publication Activity (10 Years)
Years Active: 2002-2024
Publications (10 Years): 15
Top Topics
Deep Learning
Nelder Mead Simplex
Highly Parallel
Approximate Nearest Neighbor Search
Top Venues
CoRR
SC
WACCPD@SC
KDD
</>
Publications
</>
Hiroyuki Ootomo
,
Akira Naruse
,
Corey Nolet
,
Ray Wang
,
Tamas Feher
,
Yong Wang
CAGRA: Highly Parallel Graph Construction and Approximate Nearest Neighbor Search for GPUs.
ICDE
(2024)
Hiroyuki Ootomo
,
Akira Naruse
,
Corey Nolet
,
Ray Wang
,
Tamas Feher
,
Yong Wang
CAGRA: Highly Parallel Graph Construction and Approximate Nearest Neighbor Search for GPUs.
CoRR
(2023)
Hiroyuki Ootomo
,
Akira Naruse
Custom 8-bit floating point value format for reducing shared memory bank conflict in approximate nearest neighbor search.
CoRR
(2023)
Jingrong Zhang
,
Akira Naruse
,
Xipeng Li
,
Yong Wang
Parallel Top-K Algorithms on GPU: A Comprehensive Study and New Methods.
SC
(2023)
Kazuki Osawa
,
Yohei Tsuji
,
Yuichiro Ueno
,
Akira Naruse
,
Chuan-Sheng Foo
,
Rio Yokota
Scalable and Practical Natural Gradient for Large-Scale Deep Learning.
IEEE Trans. Pattern Anal. Mach. Intell.
44 (1) (2022)
Yuichiro Ueno
,
Kazuki Osawa
,
Yohei Tsuji
,
Akira Naruse
,
Rio Yokota
Rich Information is Affordable: A Systematic Performance Analysis of Second-order Optimization Using K-FAC.
KDD
(2020)
Kazuki Osawa
,
Yohei Tsuji
,
Yuichiro Ueno
,
Akira Naruse
,
Chuan-Sheng Foo
,
Rio Yokota
Scalable and Practical Natural Gradient for Large-Scale Deep Learning.
CoRR
(2020)
Takuma Yamaguchi
,
Kohei Fujita
,
Tsuyoshi Ichimura
,
Akira Naruse
,
Jack C. Wells
,
Christopher Zimmer
,
Tjerk P. Straatsma
,
Muneo Hori
,
Lalith Maddegedara
,
Naonori Ueda
Low-Order Finite Element Solver with Small Matrix-Matrix Multiplication Accelerated by AI-Specific Hardware for Crustal Deformation Computation.
PASC
(2020)
Kazuki Osawa
,
Yohei Tsuji
,
Yuichiro Ueno
,
Akira Naruse
,
Rio Yokota
,
Satoshi Matsuoka
Large-Scale Distributed Second-Order Optimization Using Kronecker-Factored Approximate Curvature for Deep Convolutional Neural Networks.
CVPR
(2019)
Yohei Tsuji
,
Kazuki Osawa
,
Yuichiro Ueno
,
Akira Naruse
,
Rio Yokota
,
Satoshi Matsuoka
Performance Optimizations and Analysis of Distributed Deep Learning with Approximated Second-Order Optimization Method.
ICPP Workshops
(2019)
Takuma Yamaguchi
,
Kohei Fujita
,
Tsuyoshi Ichimura
,
Akira Naruse
,
Lalith Maddegedara
,
Muneo Hori
GPU Implementation of a Sophisticated Implicit Low-Order Finite Element Solver with FP21-32-64 Computation Using OpenACC.
WACCPD@SC
(2019)
Tsuyoshi Ichimura
,
Kohei Fujita
,
Takuma Yamaguchi
,
Akira Naruse
,
Jack C. Wells
,
Thomas C. Schulthess
,
Tjerk P. Straatsma
,
Christopher Zimmer
,
Maxime Martinasso
,
Kengo Nakajima
,
Muneo Hori
,
Lalith Maddegedara
A fast scalable implicit solver for nonlinear time-evolution earthquake city problem on low-ordered unstructured finite elements with artificial intelligence and transprecision computing.
SC
(2018)
Kazuki Osawa
,
Yohei Tsuji
,
Yuichiro Ueno
,
Akira Naruse
,
Rio Yokota
,
Satoshi Matsuoka
Second-order Optimization Method for Large Mini-batch: Training ResNet-50 on ImageNet in 35 Epochs.
CoRR
(2018)
Tsuyoshi Ichimura
,
Kohei Fujita
,
Masashi Horikoshi
,
Larry Meadows
,
Kengo Nakajima
,
Takuma Yamaguchi
,
Kentaro Koyama
,
Hikaru Inoue
,
Akira Naruse
,
Keisuke Katsushima
,
Muneo Hori
,
Lalith Maddegedara
A Fast Scalable Implicit Solver with Concentrated Computation for Nonlinear Time-Evolution Problems on Low-Order Unstructured Finite Elements.
IPDPS
(2018)
Michio Katouda
,
Akira Naruse
,
Yukihiko Hirano
,
Takahito Nakajima
Massively parallel algorithm and implementation of RI-MP2 energy calculation for peta-scale many-core supercomputers.
J. Comput. Chem.
37 (30) (2016)
Masahiro Miwa
,
Kohta Nakashima
,
Akira Naruse
Interference-aware Incoming Message Detection for MPI Threaded Progression.
CCGRID
(2013)
Masaya Yoshikawa
,
Akira Naruse
Multiplexing aware arbiter physical unclonable function.
IRI
(2012)
Masaya Yoshikawa
,
Akira Naruse
,
Shinsuke Souboku
Adaptive Immune Algorithm Considering Intensification and Diversification.
IRI
(2009)
Shinji Sumimoto
,
Kohta Nakashima
,
Akira Naruse
,
Kouichi Kumon
,
Takashi Yasui
,
Yoshikazu Kamoshida
,
Hiroya Matsuba
,
Atsushi Hori
,
Yutaka Ishikawa
The Design of Seamless MPI Computing Environment for Commodity-Based Clusters.
PVM/MPI
(2009)
Shuji Yamamura
,
Akira Hirai
,
Mitsuru Sato
,
Masao Yamamoto
,
Akira Naruse
,
Kouichi Kumon
Speeding Up Kernel Scheduler by Reducing Cache Misses.
USENIX Annual Technical Conference, FREENIX Track
(2002)