​
Login / Signup
Xiang Yin
ORCID
Publication Activity (10 Years)
Years Active: 2021-2024
Publications (10 Years): 23
Top Topics
Speech Synthesis
Top Venues
CoRR
INTERSPEECH
ICASSP
ICLR
</>
Publications
</>
Zhenhui Ye
,
Tianyun Zhong
,
Yi Ren
,
Jiaqi Yang
,
Weichuang Li
,
Jiawei Huang
,
Ziyue Jiang
,
Jinzheng He
,
Rongjie Huang
,
Jinglin Liu
,
Chen Zhang
,
Xiang Yin
,
Zejun Ma
,
Zhou Zhao
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis.
ICLR
(2024)
Yilei Qiu
,
Zhou He
,
Wenyu Zhang
,
Xiang Yin
,
Chengjie Ni
MSGCN-ISTL: A multi-scaled self-attention-enhanced graph convolutional network with improved STL decomposition for probabilistic load forecasting.
Expert Syst. Appl.
238 (Part A) (2024)
Rui Liu
,
Yifan Hu
,
Yi Ren
,
Xiang Yin
,
Haizhou Li
Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling.
AAAI
(2024)
Ziyue Jiang
,
Jinglin Liu
,
Yi Ren
,
Jinzheng He
,
Zhenhui Ye
,
Shengpeng Ji
,
Qian Yang
,
Chen Zhang
,
Pengfei Wei
,
Chunfeng Wang
,
Xiang Yin
,
Zejun Ma
,
Zhou Zhao
Mega-TTS 2: Boosting Prompting Mechanisms for Zero-Shot Speech Synthesis.
ICLR
(2024)
Rui Liu
,
Yifan Hu
,
Yi Ren
,
Xiang Yin
,
Haizhou Li
Generative Expressive Conversational Speech Synthesis.
CoRR
(2024)
Chunfeng Wang
,
Peisong Huang
,
Yuxiang Zou
,
Haoyu Zhang
,
Shichao Liu
,
Xiang Yin
,
Zejun Ma
LiteG2P: A fast, light and high accuracy model for grapheme-to-phoneme conversion.
CoRR
(2023)
Pengfei Wei
,
Lingdong Kong
,
Xinghua Qu
,
Yi Ren
,
Zhiqiang Xu
,
Jing Jiang
,
Xiang Yin
Unsupervised Video Domain Adaptation for Action Recognition: A Disentanglement Perspective.
NeurIPS
(2023)
Xinghua Qu
,
Hongyang Liu
,
Zhu Sun
,
Xiang Yin
,
Yew Soon Ong
,
Lu Lu
,
Zejun Ma
Towards Building Voice-based Conversational Recommender Systems: Datasets, Potential Solutions, and Prospects.
CoRR
(2023)
Yahuan Cong
,
Haoyu Zhang
,
Haopeng Lin
,
Shichao Liu
,
Chunfeng Wang
,
Yi Ren
,
Xiang Yin
,
Zejun Ma
GenerTTS: Pronunciation Disentanglement for Timbre and Style Generalization in Cross-Lingual Text-to-Speech.
CoRR
(2023)
Kun Song
,
Yi Ren
,
Yi Lei
,
Chunfeng Wang
,
Kun Wei
,
Lei Xie
,
Xiang Yin
,
Zejun Ma
StyleS2ST: Zero-shot Style Transfer for Direct Speech-to-speech Translation.
CoRR
(2023)
Kun Song
,
Yi Ren
,
Yi Lei
,
Chunfeng Wang
,
Kun Wei
,
Lei Xie
,
Xiang Yin
,
Zejun Ma
StyleS2ST: Zero-shot Style Transfer for Direct Speech-to-speech Translation.
INTERSPEECH
(2023)
Zhi Li
,
Pengfei Wei
,
Xiang Yin
,
Zejun Ma
,
Alex C. Kot
Virtual Try-On with Pose-Garment Keypoints Guided Inpainting.
ICCV
(2023)
Pengfei Wei
,
Xiang Yin
,
Chunfeng Wang
,
Zhonghao Li
,
Xinghua Qu
,
Zhiqiang Xu
,
Zejun Ma
S2CD: Self-heuristic Speaker Content Disentanglement for Any-to-Any Voice Conversion.
INTERSPEECH
(2023)
Yahuan Cong
,
Haoyu Zhang
,
Haopeng Lin
,
Shichao Liu
,
Chunfeng Wang
,
Yi Ren
,
Xiang Yin
,
Zejun Ma
GenerTTS: Pronunciation Disentanglement for Timbre and Style Generalization in Cross-Lingual Text-to-Speech.
INTERSPEECH
(2023)
Xinghua Qu
,
Hongyang Liu
,
Zhu Sun
,
Xiang Yin
,
Yew Soon Ong
,
Lu Lu
,
Zejun Ma
Towards Building Voice-based Conversational Recommender Systems: Datasets, Potential Solutions and Prospects.
SIGIR
(2023)
Chunfeng Wang
,
Peisong Huang
,
Yuxiang Zou
,
Haoyu Zhang
,
Shichao Liu
,
Xiang Yin
,
Zejun Ma
LiteG2P: A Fast, Light and High Accuracy Model for Grapheme-to-Phoneme Conversion.
ICASSP
(2023)
Zikai Chen
,
Lin Wu
,
Junjie Pan
,
Xiang Yin
An Automatic Soundtracking System for Text-to-Speech Audiobooks.
INTERSPEECH
(2022)
Jingning Xu
,
Benlai Tang
,
Mingjie Wang
,
Siyuan Bian
,
Wenyi Guo
,
Xiang Yin
,
Zejun Ma
Towards Using Clothes Style Transfer for Scenario-Aware Person Video Generation.
ICASSP
(2022)
Chao Wang
,
Zhonghao Li
,
Benlai Tang
,
Xiang Yin
,
Yuan Wan
,
Yibiao Yu
,
Zejun Ma
Towards high-fidelity singing voice conversion with acoustic reference and contrastive predictive coding.
INTERSPEECH
(2022)
Wudi Bao
,
Junhui Zhang
,
Junjie Pan
,
Xiang Yin
,
Zejun Ma
A Novel Chinese Dialect TTS Frontend with Non-Autoregressive Neural Machine Translation.
CoRR
(2022)
Junhui Zhang
,
Junjie Pan
,
Xiang Yin
,
Zejun Ma
Direct Speech-to-speech Translation without Textual Annotation using Bottleneck Features.
CoRR
(2022)
Pengfei Wei
,
Lingdong Kong
,
Xinghua Qu
,
Xiang Yin
,
Zhiqiang Xu
,
Jing Jiang
,
Zejun Ma
Unsupervised Video Domain Adaptation: A Disentanglement Perspective.
CoRR
(2022)
Yuxiang Zou
,
Shichao Liu
,
Xiang Yin
,
Haopeng Lin
,
Chunfeng Wang
,
Haoyu Zhang
,
Zejun Ma
Fine-Grained Prosody Modeling in Neural Speech Synthesis Using ToBI Representation.
Interspeech
(2021)