Login / Signup
Wenhao Guan
Publication Activity (10 Years)
Years Active: 2018-2024
Publications (10 Years): 15
Top Topics
Fusing Multiple
Frequency Domain
Speech Synthesis
Scene Understanding
Top Venues
CoRR
ICASSP
ICRA
INTERSPEECH
</>
Publications
</>
Jiawei Hou
,
Xiaoyan Li
,
Wenhao Guan
,
Gang Zhang
,
Di Feng
,
Yuheng Du
,
Xiangyang Xue
,
Jian Pu
FastOcc: Accelerating 3D Occupancy Prediction by Fusing the 2D Bird's-Eye View and Perspective View.
ICRA
(2024)
Tao Li
,
Feng Wang
,
Wenhao Guan
,
Lingyan Huang
,
Qingyang Hong
,
Lin Li
Improving Multi-Speaker ASR With Overlap-Aware Encoding And Monotonic Attention.
ICASSP
(2024)
Yishuang Li
,
Hukai Huang
,
Zhicong Chen
,
Wenhao Guan
,
Jiayan Lin
,
Lin Li
,
Qingyang Hong
SR-HuBERT : An Efficient Pre-Trained Model for Speaker Verification.
ICASSP
(2024)
Wenhao Guan
,
Kaidi Wang
,
Wangjin Zhou
,
Yang Wang
,
Feng Deng
,
Hui Wang
,
Lin Li
,
Qingyang Hong
,
Yong Qin
LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation.
CoRR
(2024)
Hukai Huang
,
Shenghui Lu
,
Yahui Shan
,
He Qu
,
Wenhao Guan
,
Qingyang Hong
,
Lin Li
Dynamic Language Group-Based MoE: Enhancing Efficiency and Flexibility for Code-Switching Speech Recognition.
CoRR
(2024)
Wenhao Guan
,
Qi Su
,
Haodong Zhou
,
Shiyu Miao
,
Xingjia Xie
,
Lin Li
,
Qingyang Hong
Reflow-TTS: A Rectified Flow Model for High-Fidelity Text-to-Speech.
ICASSP
(2024)
Jiawei Hou
,
Wenhao Guan
,
Xiangyang Xue
,
Taiping Zeng
LOP-Field: Brain-inspired Layout-Object-Position Fields for Robotic Scene Understanding.
CoRR
(2024)
Xianfeng Li
,
Weijie Chen
,
Shicai Yang
,
Yishuang Li
,
Wenhao Guan
,
Lin Li
Multivariate Fourier Distribution Perturbation: Domain Shifts with Uncertainty in Frequency Domain.
ICASSP
(2024)
Jiawei Hou
,
Xiaoyan Li
,
Wenhao Guan
,
Gang Zhang
,
Di Feng
,
Yuheng Du
,
Xiangyang Xue
,
Jian Pu
FastOcc: Accelerating 3D Occupancy Prediction by Fusing the 2D Bird's-Eye View and Perspective View.
CoRR
(2024)
Wenhao Guan
,
Yishuang Li
,
Tao Li
,
Hukai Huang
,
Feng Wang
,
Jiayan Lin
,
Lingyan Huang
,
Lin Li
,
Qingyang Hong
MM-TTS: Multi-Modal Prompt Based Style Transfer for Expressive Text-to-Speech Synthesis.
AAAI
(2024)
Wenhao Guan
,
Qi Su
,
Haodong Zhou
,
Shiyu Miao
,
Xingjia Xie
,
Lin Li
,
Qingyang Hong
ReFlow-TTS: A Rectified Flow Model for High-fidelity Text-to-Speech.
CoRR
(2023)
Wenhao Guan
,
Tao Li
,
Yishuang Li
,
Hukai Huang
,
Qingyang Hong
,
Lin Li
Interpretable Style Transfer for Text-to-Speech with ControlVAE and Diffusion Bridge.
CoRR
(2023)
Wenhao Guan
,
Tao Li
,
Yishuang Li
,
Hukai Huang
,
Qingyang Hong
,
Lin Li
Interpretable Style Transfer for Text-to-Speech with ControlVAE and Diffusion Bridge.
INTERSPEECH
(2023)
Wenhao Guan
,
Yishuang Li
,
Tao Li
,
Hukai Huang
,
Feng Wang
,
Jiayan Lin
,
Lingyan Huang
,
Lin Li
,
Qingyang Hong
MM-TTS: Multi-modal Prompt based Style Transfer for Expressive Text-to-Speech Synthesis.
CoRR
(2023)
Wenhao Guan
,
Yunling Wang
,
Jianfeng Wang
,
Xiaotong Fu
Verifiable memory leakage-resilient dynamic searchable encryption.
J. High Speed Networks
24 (3) (2018)