Po-chun Hsu

Publication Activity (10 Years)

Years Active: 2018-2022
Publications (10 Years): 18

Top Topics

Random Field Models

Prosodic Features

Visual Learning

Top Venues

Publications

Da-Rong Liu, Po-chun Hsu, Yi-Chen Chen, Sung-Feng Huang, Shun-Po Chuang, Da-Yi Wu, Hung-yi Lee
Learning Phone Recognition from Unpaired Audio and Phone Sequences Based on Generative Adversarial Network. CoRR (2022)
Da-Rong Liu, Po-chun Hsu, Yi-Chen Chen, Sung-Feng Huang, Shun-Po Chuang, Da-Yi Wu, Hung-yi Lee
Learning Phone Recognition From Unpaired Audio and Phone Sequences Based on Generative Adversarial Network. IEEE ACM Trans. Audio Speech Lang. Process. 30 (2022)
Po-chun Hsu, Da-Rong Liu, Andy T. Liu, Hung-yi Lee
Parallel Synthesis for Autoregressive Speech Generation. CoRR (2022)
Haibin Wu, Po-chun Hsu, Ji Gao, Shanshan Zhang, Shen Huang, Jian Kang, Zhiyong Wu, Helen Meng, Hung-Yi Lee
Adversarial Sample Detection for Speaker Verification by Neural Vocoders. ICASSP (2022)
Chi-Luen Feng, Po-chun Hsu, Hung-yi Lee
Silence is Sweeter Than Speech: Self-Supervised Model Using Silence to Store Speaker Information. CoRR (2022)
Fan-Lin Wang, Po-chun Hsu, Da-Rong Liu, Hung-yi Lee
Universal Adaptor: Converting Mel-Spectrograms Between Different Configurations for Speech Synthesis. CoRR (2022)
Haibin Wu, Po-chun Hsu, Ji Gao, Shanshan Zhang, Shen Huang, Jian Kang, Zhiyong Wu, Helen Meng, Hung-yi Lee
Spotting adversarial samples for speaker verification by neural vocoders. CoRR (2021)
Chung-Ming Chien, Jheng-Hao Lin, Chien-yu Huang, Po-chun Hsu, Hung-yi Lee
Investigating on Incorporating Pretrained and Learnable Speaker Representations for Multi-Speaker Multi-Style Text-to-Speech. CoRR (2021)
Chung-Ming Chien, Jheng-Hao Lin, Chien-yu Huang, Po-chun Hsu, Hung-yi Lee
Investigating on Incorporating Pretrained and Learnable Speaker Representations for Multi-Speaker Multi-Style Text-to-Speech. ICASSP (2021)
Andy T. Liu, Shu-Wen Yang, Po-Han Chi, Po-chun Hsu, Hung-yi Lee
Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer Encoders. ICASSP (2020)
Po-chun Hsu, Hung-yi Lee
WG-WaveNet: Real-Time High-Fidelity Speech Synthesis without GPU. CoRR (2020)
Po-chun Hsu, Hung-yi Lee
WG-WaveNet: Real-Time High-Fidelity Speech Synthesis Without GPU. INTERSPEECH (2020)
Andy T. Liu, Shu-Wen Yang, Po-Han Chi, Po-chun Hsu, Hung-yi Lee
Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer Encoders. CoRR (2019)
Po-chun Hsu, Chun-hsuan Wang, Andy T. Liu, Hung-yi Lee
Towards Robust Neural Vocoding for Speech Generation: A Survey. CoRR (2019)
Andy T. Liu, Po-chun Hsu, Hung-yi Lee
Unsupervised End-to-End Learning of Discrete Linguistic Units for Voice Conversion. INTERSPEECH (2019)
Andy T. Liu, Po-chun Hsu, Hung-yi Lee
Unsupervised End-to-End Learning of Discrete Linguistic Units for Voice Conversion. CoRR (2019)
Cheng-chieh Yeh, Po-chun Hsu, Ju-Chieh Chou, Hung-yi Lee, Lin-Shan Lee
Rhythm-Flexible Voice Conversion without Parallel Data Using Cycle-GAN over Phoneme Posteriorgram Sequences. CoRR (2018)
Cheng-chieh Yeh, Po-chun Hsu, Ju-Chieh Chou, Hung-yi Lee, Lin-Shan Lee
Rhythm-Flexible Voice Conversion Without Parallel Data Using Cycle-GAN Over Phoneme Posteriorgram Sequences. SLT (2018)