​
Login / Signup
Nan Wu
Publication Activity (10 Years)
Years Active: 2004-2019
Publications (10 Years): 1
Top Topics
Graphics Processors
Cpu Implementation
Memory Management
River Basin
Top Venues
ICPADS
IEICE Trans. Inf. Syst.
J. Supercomput.
ISC
</>
Publications
</>
Yang Shi
,
Jiawei Fei
,
Mei Wen
,
Qun Huang
,
Nan Wu
Metaflow: A Better Traffic Abstraction for Distributed Applications.
HPCC/SmartCity/DSS
(2019)
Dafei Huang
,
Changqing Xun
,
Nan Wu
,
Mei Wen
,
Chunyuan Zhang
,
Xing Cai
,
Qianming Yang
Enabling a Uniform OpenCL Device View for Heterogeneous Platforms.
IEICE Trans. Inf. Syst.
(4) (2015)
Jun Chai
,
Johan Hake
,
Nan Wu
,
Mei Wen
,
Xing Cai
,
Glenn Terje Lines
,
Jing Yang
,
Huayou Su
,
Chunyuan Zhang
,
Xiangke Liao
Towards simulation of subcellular calcium dynamics at nanometre resolution.
Int. J. High Perform. Comput. Appl.
29 (1) (2015)
Xinnan Dong
,
Jun Chai
,
Jing Yang
,
Mei Wen
,
Nan Wu
,
Xing Cai
,
Chunyuan Zhang
,
Zhaoyun Chen
Utilizing Multiple Xeon Phi Coprocessors on One Compute Node.
ICA3PP (2)
(2014)
Mei Wen
,
Huayou Su
,
Wenjie Wei
,
Nan Wu
,
Xing Cai
,
Chunyuan Zhang
High efficient sedimentary basin simulations on hybrid CPU-GPU clusters.
Clust. Comput.
17 (2) (2014)
Dafei Huang
,
Mei Wen
,
Changqing Xun
,
Dong Chen
,
Xing Cai
,
Yuran Qiao
,
Nan Wu
,
Chunyuan Zhang
Automated Transformation of GPU-Specific OpenCL Kernels Targeting Performance Portability on Multi-Core/Many-Core CPUs.
Euro-Par
(2014)
Jun Chai
,
Huayou Su
,
Mei Wen
,
Xing Cai
,
Nan Wu
,
Chunyuan Zhang
Resource-efficient utilization of CPU/GPU-based heterogeneous supercomputers for Bayesian phylogenetic inference.
J. Supercomput.
66 (1) (2013)
Huayou Su
,
Nan Wu
,
Mei Wen
,
Chunyuan Zhang
,
Xing Cai
On the GPU Performance of 3D Stencil Computations Implemented in OpenCL.
ISC
(2013)
Jun Chai
,
Mei Wen
,
Nan Wu
,
Dafei Huang
,
Jing Yang
,
Xing Cai
,
Chunyuan Zhang
,
Qianming Yang
Simulating Cardiac Electrophysiology in the Era of GPU-Cluster Computing.
IEICE Trans. Inf. Syst.
(12) (2013)
Huayou Su
,
Nan Wu
,
Mei Wen
,
Chunyuan Zhang
,
Xing Cai
On the GPU-CPU Performance Portability of OpenCL for 3D Stencil Computations.
ICPADS
(2013)
Qianming Yang
,
Mei Wen
,
Nan Wu
,
Chunyuan Zhang
Accelerating thread-intensive and explicit memory management programs with dynamic partial reconfiguration.
J. Supercomput.
63 (2) (2013)
Huayou Su
,
Nan Wu
,
Mei Wen
,
Chunyuan Zhang
,
Xing Cai
Performance of Sediment Transport Simulations on NVIDIA's Kepler Architecture.
ICCS
(2013)
Jing Yang
,
Jun Chai
,
Mei Wen
,
Nan Wu
,
Chunyuan Zhang
Solving the Cardiac Model Using Multi-core CPU and Many Integrated Cores (MIC).
HPCC/EUC
(2013)
Nan Wu
,
Yuran Qiao
,
Mei Wen
,
Chunyuan Zhang
ACF: Networks-on-Chip Deadlock Recovery with Accurate Detection and Elastic Credit.
APPT
(2013)
Changqing Xun
,
Mei Wen
,
Nan Wu
,
Chunyuan Zhang
,
Hayden Kwok-Hay So
Extending BORPH for shared memory reconfigurable computers.
FPL
(2012)
Nan Wu
,
Mei Wen
,
Huayou Su
,
Ju Ren
,
Chunyuan Zhang
A Parallel H.264 Encoder with CUDA: Mapping and Evaluation.
ICPADS
(2012)
Mei Wen
,
Nan Wu
,
Qianming Yang
,
Chunyuan Zhang
,
Liang Zhao
The masala machine: accelerating thread-intensive and explicit memory management programs with dynamically reconfigurable FPGAs (abstract only).
FPGA
(2012)
Mei Wen
,
Huayou Su
,
Wenjie Wei
,
Nan Wu
,
Xing Cai
,
Chunyuan Zhang
Using 1000+ GPUs and 10000+ CPUs for Sedimentary Basin Simulations.
CLUSTER
(2012)
Nan Wu
,
Qianming Yang
,
Mei Wen
,
Yi He
,
Ju Ren
,
Maolin Guan
,
Chunyuan Zhang
Tiled Multi-Core Stream Architecture.
Trans. High Perform. Embed. Archit. Compil.
4 (2011)
Huayou Su
,
Chunyuan Zhang
,
Jun Chai
,
Mei Wen
,
Nan Wu
,
Ju Ren
A high-efficient software parallel CAVCL encoder based on GPU.
TSP
(2011)
Huayou Su
,
Nan Wu
,
Chunyuan Zhang
,
Mei Wen
,
Ju Ren
A Multilevel Parallel Intra Coding for H.264/AVC Based on CUDA.
ICIG
(2011)
Huayou Su
,
Chunyuan Zhang
,
Jun Chai
,
Mei Wen
,
Nan Wu
,
Ju Ren
High-efficient software parallel CAVLC encoder based on programmable stream processor.
ACM Multimedia
(2011)
Yi He
,
Ju Ren
,
Mei Wen
,
Qianming Yang
,
Nan Wu
,
Chunyuan Zhang
Software Managed Instruction Scratchpad Memory Optimization in Stream Architecture Based on Hot Code Analysis of Kernels.
DSD
(2010)
Ju Ren
,
Mei Wen
,
Chunyuan Zhang
,
Huayou Su
,
Yi He
,
Nan Wu
A Parallel Streaming Motion Estimation for Real-Time HD H.264 Encoding on Programmable Processors.
FCST
(2010)
Qianming Yang
,
Nan Wu
,
Mei Wen
,
Yi He
,
Huayou Su
,
Chunyuan Zhang
SAT: A Stream Architecture Template for Embedded Applications.
CIT
(2010)
Ju Ren
,
Yi He
,
Wei Wu
,
Mei Wen
,
Nan Wu
,
Chunyuan Zhang
Software parallel CAVLC encoder based on stream processing.
ESTIMedia
(2009)
Nan Wu
,
Mei Wen
,
Ju Ren
,
Yi He
,
Changqing Xun
,
Wei Wu
,
Chunyuan Zhang
Cache streamization for high performance stream processor.
HiPC
(2009)
Nan Wu
,
Mei Wen
,
Wei Wu
,
Ju Ren
,
Huayou Su
,
Changqing Xun
,
Chunyuan Zhang
Streaming HD H.264 encoder on programmable processors.
ACM Multimedia
(2009)
Mei Wen
,
Nan Wu
,
Maolin Guan
,
Chunyuan Zhang
Load scheduling: Reducing pressure on distributed register files for free.
ASP-DAC
(2008)
Mei Wen
,
Nan Wu
,
Chunyuan Zhang
,
Qianming Yang
,
Ju Ren
,
Yi He
,
Wei Wu
,
Jun Chai
,
Maolin Guan
,
Changqing Xun
On-Chip Memory System Optimization Design for the FT64 Scientific Stream Accelerator.
IEEE Micro
28 (4) (2008)
Yi He
,
Ju Ren
,
Qianming Yang
,
Mei Wen
,
Nan Wu
,
Chunyuan Zhang
FPGA-based Equivalent Simulation Technology (FEST) for clustered stream architecture.
ACSAC
(2008)
Mei Wen
,
Nan Wu
,
Chunyuan Zhang
,
Wei Wu
,
Qianming Yang
,
Changqing Xun
FT64: Scientific Computing with Streams.
HiPC
(2007)
Nan Wu
,
Qianming Yang
,
Mei Wen
,
Yi He
,
Changqing Xun
,
Chunyuan Zhang
A Stream System-on-Chip Architecture for High Speed Target Recognition Based on Biologic Vision.
Asia-Pacific Computer Systems Architecture Conference
(2007)
Nan Wu
,
Mei Wen
,
Ju Ren
,
Yi He
,
Chunyuan Zhang
Register Allocation on Stream Processor with Local Register File.
Asia-Pacific Computer Systems Architecture Conference
(2006)
Mei Wen
,
Nan Wu
,
Changqing Xun
,
Wei Wu
,
Chunyuan Zhang
Analysis and Performance Results of a fluid dynamics Application on MASA Stream Processor.
ACIS-ICIS
(2006)
Mei Wen
,
Nan Wu
,
Changqing Xun
,
Wei Wu
,
Chunyuan Zhang
Optimization and Evaluating of StreamYGX2 on MASA Stream Processor.
Asia-Pacific Computer Systems Architecture Conference
(2006)
Haiyan Li
,
Mei Wen
,
Chunyuan Zhang
,
Nan Wu
,
Li Li
,
Changqing Xun
Accelerated Motion Estimation of H.264 on Imagine Stream Processor.
ICIAR
(2005)
Nan Wu
,
Mei Wen
,
Haiyan Li
,
Li Li
,
Chunyuan Zhang
A Stream Architecture Supporting Multiple Stream Execution Models.
Asia-Pacific Computer Systems Architecture Conference
(2005)
Mei Wen
,
Nan Wu
,
Haiyan Li
,
Chunyuan Zhang
Multiple-Morphs Adaptive Stream Architecture.
J. Comput. Sci. Technol.
20 (5) (2005)
Mei Wen
,
Chunyuan Zhang
,
Nan Wu
,
Haiyan Li
,
Li Li
A Parallel Reed-Solomon Decoder on the Imagine Stream Processor.
ISPA
(2004)
Mei Wen
,
Nan Wu
,
Haiyan Li
,
Chunyuan Zhang
Multiple-Dimension Scalable Adaptive Stream Architecture.
Asia-Pacific Computer Systems Architecture Conference
(2004)