Login / Signup
Chun-Yi Kuan
Publication Activity (10 Years)
Years Active: 2023-2024
Publications (10 Years): 9
Top Topics
Passage Retrieval
Speech Recognition
Dirichlet Prior
Language Model
Top Venues
CoRR
ICASSP
ASRU
</>
Publications
</>
Chun-Yi Kuan
,
Wei-Ping Huang
,
Hung-yi Lee
Understanding Sounds, Missing the Questions: The Challenge of Object Hallucination in Large Audio-Language Models.
CoRR
(2024)
Yi-Cheng Lin
,
Tzu-Quan Lin
,
Chih-Kai Yang
,
Ke-Han Lu
,
Wei-Chih Chen
,
Chun-Yi Kuan
,
Hung-yi Lee
Listen and Speak Fairly: A Study on Semantic Gender Bias in Speech Integrated Large Language Models.
CoRR
(2024)
Chih-Kai Yang
,
Kuan-Po Huang
,
Ke-Han Lu
,
Chun-Yi Kuan
,
Chi-Yuan Hsiao
,
Hung-yi Lee
Investigating Zero-Shot Generalizability on Mandarin-English Code-Switched ASR and Speech-to-text Translation of Recent Foundation Models with Self-Supervision and Weak Supervision.
CoRR
(2024)
Chien-Yu Huang
,
Ke-Han Lu
,
Shih-Heng Wang
,
Chi-Yuan Hsiao
,
Chun-Yi Kuan
,
Haibin Wu
,
Siddhant Arora
,
Kai-Wei Chang
,
Jiatong Shi
,
Yifan Peng
,
Roshan S. Sharma
,
Shinji Watanabe
,
Bhiksha Ramakrishnan
,
Shady Shehata
,
Hung-Yi Lee
Dynamic-Superb: Towards a Dynamic, Collaborative, and Comprehensive Instruction-Tuning Benchmark For Speech.
ICASSP
(2024)
Cheng-Han Chiang
,
Wei-Chih Chen
,
Chun-Yi Kuan
,
Chienchou Yang
,
Hung-yi Lee
Large Language Model as an Assignment Evaluator: Insights, Feedback, and Challenges in a 1000+ Student Course.
CoRR
(2024)
Chun-Yi Kuan
,
Chih-Kai Yang
,
Wei-Ping Huang
,
Ke-Han Lu
,
Hung-yi Lee
Speech-Copilot: Leveraging Large Language Models for Speech Processing via Task Decomposition, Modularization, and Program Generation.
CoRR
(2024)
Chun-Yi Kuan
,
Chen-An Li
,
Tsu-Yuan Hsu
,
Tse-Yang Lin
,
Ho-Lam Chung
,
Kai-Wei Chang
,
Shuo-Yiin Chang
,
Hung-yi Lee
Towards General-Purpose Text-Instruction-Guided Voice Conversion.
CoRR
(2023)
Chun-Yi Kuan
,
Chen-An Li
,
Tsu-Yuan Hsu
,
Tse-Yang Lin
,
Ho-Lam Chung
,
Kai-Wei Chang
,
Shuo-Yiin Chang
,
Hung-Yi Lee
Towards General-Purpose Text-Instruction-Guided Voice Conversion.
ASRU
(2023)
Chien-yu Huang
,
Ke-Han Lu
,
Shih-Heng Wang
,
Chi-Yuan Hsiao
,
Chun-Yi Kuan
,
Haibin Wu
,
Siddhant Arora
,
Kai-Wei Chang
,
Jiatong Shi
,
Yifan Peng
,
Roshan S. Sharma
,
Shinji Watanabe
,
Bhiksha Ramakrishnan
,
Shady Shehata
,
Hung-yi Lee
Dynamic-SUPERB: Towards A Dynamic, Collaborative, and Comprehensive Instruction-Tuning Benchmark for Speech.
CoRR
(2023)