Login / Signup
Po-chun Hsu
Publication Activity (10 Years)
Years Active: 2018-2022
Publications (10 Years): 18
Top Topics
Random Field Models
Prosodic Features
Autoregressive
Visual Learning
Top Venues
CoRR
ICASSP
INTERSPEECH
SLT
</>
Publications
</>
Da-Rong Liu
,
Po-chun Hsu
,
Yi-Chen Chen
,
Sung-Feng Huang
,
Shun-Po Chuang
,
Da-Yi Wu
,
Hung-yi Lee
Learning Phone Recognition from Unpaired Audio and Phone Sequences Based on Generative Adversarial Network.
CoRR
(2022)
Da-Rong Liu
,
Po-chun Hsu
,
Yi-Chen Chen
,
Sung-Feng Huang
,
Shun-Po Chuang
,
Da-Yi Wu
,
Hung-yi Lee
Learning Phone Recognition From Unpaired Audio and Phone Sequences Based on Generative Adversarial Network.
IEEE ACM Trans. Audio Speech Lang. Process.
30 (2022)
Po-chun Hsu
,
Da-Rong Liu
,
Andy T. Liu
,
Hung-yi Lee
Parallel Synthesis for Autoregressive Speech Generation.
CoRR
(2022)
Haibin Wu
,
Po-chun Hsu
,
Ji Gao
,
Shanshan Zhang
,
Shen Huang
,
Jian Kang
,
Zhiyong Wu
,
Helen Meng
,
Hung-Yi Lee
Adversarial Sample Detection for Speaker Verification by Neural Vocoders.
ICASSP
(2022)
Chi-Luen Feng
,
Po-chun Hsu
,
Hung-yi Lee
Silence is Sweeter Than Speech: Self-Supervised Model Using Silence to Store Speaker Information.
CoRR
(2022)
Fan-Lin Wang
,
Po-chun Hsu
,
Da-Rong Liu
,
Hung-yi Lee
Universal Adaptor: Converting Mel-Spectrograms Between Different Configurations for Speech Synthesis.
CoRR
(2022)
Haibin Wu
,
Po-chun Hsu
,
Ji Gao
,
Shanshan Zhang
,
Shen Huang
,
Jian Kang
,
Zhiyong Wu
,
Helen Meng
,
Hung-yi Lee
Spotting adversarial samples for speaker verification by neural vocoders.
CoRR
(2021)
Chung-Ming Chien
,
Jheng-Hao Lin
,
Chien-yu Huang
,
Po-chun Hsu
,
Hung-yi Lee
Investigating on Incorporating Pretrained and Learnable Speaker Representations for Multi-Speaker Multi-Style Text-to-Speech.
CoRR
(2021)
Chung-Ming Chien
,
Jheng-Hao Lin
,
Chien-yu Huang
,
Po-chun Hsu
,
Hung-yi Lee
Investigating on Incorporating Pretrained and Learnable Speaker Representations for Multi-Speaker Multi-Style Text-to-Speech.
ICASSP
(2021)
Andy T. Liu
,
Shu-Wen Yang
,
Po-Han Chi
,
Po-chun Hsu
,
Hung-yi Lee
Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer Encoders.
ICASSP
(2020)
Po-chun Hsu
,
Hung-yi Lee
WG-WaveNet: Real-Time High-Fidelity Speech Synthesis without GPU.
CoRR
(2020)
Po-chun Hsu
,
Hung-yi Lee
WG-WaveNet: Real-Time High-Fidelity Speech Synthesis Without GPU.
INTERSPEECH
(2020)
Andy T. Liu
,
Shu-Wen Yang
,
Po-Han Chi
,
Po-chun Hsu
,
Hung-yi Lee
Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer Encoders.
CoRR
(2019)
Po-chun Hsu
,
Chun-hsuan Wang
,
Andy T. Liu
,
Hung-yi Lee
Towards Robust Neural Vocoding for Speech Generation: A Survey.
CoRR
(2019)
Andy T. Liu
,
Po-chun Hsu
,
Hung-yi Lee
Unsupervised End-to-End Learning of Discrete Linguistic Units for Voice Conversion.
INTERSPEECH
(2019)
Andy T. Liu
,
Po-chun Hsu
,
Hung-yi Lee
Unsupervised End-to-End Learning of Discrete Linguistic Units for Voice Conversion.
CoRR
(2019)
Cheng-chieh Yeh
,
Po-chun Hsu
,
Ju-Chieh Chou
,
Hung-yi Lee
,
Lin-Shan Lee
Rhythm-Flexible Voice Conversion without Parallel Data Using Cycle-GAN over Phoneme Posteriorgram Sequences.
CoRR
(2018)
Cheng-chieh Yeh
,
Po-chun Hsu
,
Ju-Chieh Chou
,
Hung-yi Lee
,
Lin-Shan Lee
Rhythm-Flexible Voice Conversion Without Parallel Data Using Cycle-GAN Over Phoneme Posteriorgram Sequences.
SLT
(2018)