Akira Naruse

Publication Activity (10 Years)

Years Active: 2002-2024
Publications (10 Years): 15

Top Topics

Nelder Mead Simplex

Highly Parallel

Approximate Nearest Neighbor Search

Top Venues

Publications

Hiroyuki Ootomo, Akira Naruse, Corey Nolet, Ray Wang, Tamas Feher, Yong Wang
CAGRA: Highly Parallel Graph Construction and Approximate Nearest Neighbor Search for GPUs. ICDE (2024)
Hiroyuki Ootomo, Akira Naruse, Corey Nolet, Ray Wang, Tamas Feher, Yong Wang
CAGRA: Highly Parallel Graph Construction and Approximate Nearest Neighbor Search for GPUs. CoRR (2023)
Hiroyuki Ootomo, Akira Naruse
Custom 8-bit floating point value format for reducing shared memory bank conflict in approximate nearest neighbor search. CoRR (2023)
Jingrong Zhang, Akira Naruse, Xipeng Li, Yong Wang
Parallel Top-K Algorithms on GPU: A Comprehensive Study and New Methods. SC (2023)
Kazuki Osawa, Yohei Tsuji, Yuichiro Ueno, Akira Naruse, Chuan-Sheng Foo, Rio Yokota
Scalable and Practical Natural Gradient for Large-Scale Deep Learning. IEEE Trans. Pattern Anal. Mach. Intell. 44 (1) (2022)
Yuichiro Ueno, Kazuki Osawa, Yohei Tsuji, Akira Naruse, Rio Yokota
Rich Information is Affordable: A Systematic Performance Analysis of Second-order Optimization Using K-FAC. KDD (2020)
Kazuki Osawa, Yohei Tsuji, Yuichiro Ueno, Akira Naruse, Chuan-Sheng Foo, Rio Yokota
Scalable and Practical Natural Gradient for Large-Scale Deep Learning. CoRR (2020)
Takuma Yamaguchi, Kohei Fujita, Tsuyoshi Ichimura, Akira Naruse, Jack C. Wells, Christopher Zimmer, Tjerk P. Straatsma, Muneo Hori, Lalith Maddegedara, Naonori Ueda
Low-Order Finite Element Solver with Small Matrix-Matrix Multiplication Accelerated by AI-Specific Hardware for Crustal Deformation Computation. PASC (2020)
Kazuki Osawa, Yohei Tsuji, Yuichiro Ueno, Akira Naruse, Rio Yokota, Satoshi Matsuoka
Large-Scale Distributed Second-Order Optimization Using Kronecker-Factored Approximate Curvature for Deep Convolutional Neural Networks. CVPR (2019)
Yohei Tsuji, Kazuki Osawa, Yuichiro Ueno, Akira Naruse, Rio Yokota, Satoshi Matsuoka
Performance Optimizations and Analysis of Distributed Deep Learning with Approximated Second-Order Optimization Method. ICPP Workshops (2019)
Takuma Yamaguchi, Kohei Fujita, Tsuyoshi Ichimura, Akira Naruse, Lalith Maddegedara, Muneo Hori
GPU Implementation of a Sophisticated Implicit Low-Order Finite Element Solver with FP21-32-64 Computation Using OpenACC. WACCPD@SC (2019)
Tsuyoshi Ichimura, Kohei Fujita, Takuma Yamaguchi, Akira Naruse, Jack C. Wells, Thomas C. Schulthess, Tjerk P. Straatsma, Christopher Zimmer, Maxime Martinasso, Kengo Nakajima, Muneo Hori, Lalith Maddegedara
A fast scalable implicit solver for nonlinear time-evolution earthquake city problem on low-ordered unstructured finite elements with artificial intelligence and transprecision computing. SC (2018)
Kazuki Osawa, Yohei Tsuji, Yuichiro Ueno, Akira Naruse, Rio Yokota, Satoshi Matsuoka
Second-order Optimization Method for Large Mini-batch: Training ResNet-50 on ImageNet in 35 Epochs. CoRR (2018)
Tsuyoshi Ichimura, Kohei Fujita, Masashi Horikoshi, Larry Meadows, Kengo Nakajima, Takuma Yamaguchi, Kentaro Koyama, Hikaru Inoue, Akira Naruse, Keisuke Katsushima, Muneo Hori, Lalith Maddegedara
A Fast Scalable Implicit Solver with Concentrated Computation for Nonlinear Time-Evolution Problems on Low-Order Unstructured Finite Elements. IPDPS (2018)
Michio Katouda, Akira Naruse, Yukihiko Hirano, Takahito Nakajima
Massively parallel algorithm and implementation of RI-MP2 energy calculation for peta-scale many-core supercomputers. J. Comput. Chem. 37 (30) (2016)
Masahiro Miwa, Kohta Nakashima, Akira Naruse
Interference-aware Incoming Message Detection for MPI Threaded Progression. CCGRID (2013)
Masaya Yoshikawa, Akira Naruse
Multiplexing aware arbiter physical unclonable function. IRI (2012)
Masaya Yoshikawa, Akira Naruse, Shinsuke Souboku
Adaptive Immune Algorithm Considering Intensification and Diversification. IRI (2009)
Shinji Sumimoto, Kohta Nakashima, Akira Naruse, Kouichi Kumon, Takashi Yasui, Yoshikazu Kamoshida, Hiroya Matsuba, Atsushi Hori, Yutaka Ishikawa
The Design of Seamless MPI Computing Environment for Commodity-Based Clusters. PVM/MPI (2009)
Shuji Yamamura, Akira Hirai, Mitsuru Sato, Masao Yamamoto, Akira Naruse, Kouichi Kumon
Speeding Up Kernel Scheduler by Reducing Cache Misses. USENIX Annual Technical Conference, FREENIX Track (2002)