​
Login / Signup
Jixun Yao
ORCID
Publication Activity (10 Years)
Years Active: 2020-2024
Publications (10 Years): 28
Top Topics
Broadcast News
Acoustic Features
Speech Emotion Recognition
Latent Space
Top Venues
CoRR
ICASSP
INTERSPEECH
CSAI
</>
Publications
</>
Jixun Yao
,
Yuguang Yang
,
Yi Lei
,
Ziqian Ning
,
Yanni Hu
,
Yu Pan
,
Jingjing Yin
,
Hongbin Zhou
,
Heng Lu
,
Lei Xie
Promptvc: Flexible Stylistic Voice Conversion in Latent Space Driven by Natural Language Prompts.
ICASSP
(2024)
Jixun Yao
,
Qing Wang
,
Pengcheng Guo
,
Ziqian Ning
,
Lei Xie
Distinctive and Natural Speaker Anonymization via Singular Value Transformation-Assisted Matrix.
IEEE ACM Trans. Audio Speech Lang. Process.
32 (2024)
Yu Pan
,
Yanni Hu
,
Yuguang Yang
,
Wen Fei
,
Jixun Yao
,
Heng Lu
,
Lei Ma
,
Jianjun Zhao
GEmo-CLAP: Gender-Attribute-Enhanced Contrastive Language-Audio Pretraining for Accurate Speech Emotion Recognition.
ICASSP
(2024)
Ziqian Ning
,
Yuepeng Jiang
,
Pengcheng Zhu
,
Shuai Wang
,
Jixun Yao
,
Lei Xie
,
Mengxiao Bi
Dualvc 2: Dynamic Masked Convolution for Unified Streaming and Non-Streaming Voice Conversion.
ICASSP
(2024)
Jixun Yao
,
Yi Lei
,
Qing Wang
,
Pengcheng Guo
,
Ziqian Ning
,
Lei Xie
,
Hai Li
,
Junhui Liu
,
Danming Xie
Preserving Background Sound in Noise-Robust Voice Conversion Via Multi-Task Learning.
ICASSP
(2023)
Jixun Yao
,
Qing Wang
,
Yi Lei
,
Pengcheng Guo
,
Lei Xie
,
Namin Wang
,
Jie Liu
Distinguishable Speaker Anonymization Based on Formant and Fundamental Frequency Scaling.
ICASSP
(2023)
Qing Wang
,
Jixun Yao
,
Ziqian Wang
,
Pengcheng Guo
,
Lei Xie
Pseudo-Siamese Network based Timbre-reserved Black-box Adversarial Attack in Speaker Identification.
CoRR
(2023)
Yu Pan
,
Yanni Hu
,
Yuguang Yang
,
Jixun Yao
,
Wen Fei
,
Lei Ma
,
Heng Lu
GEmo-CLAP: Gender-Attribute-Enhanced Contrastive Language-Audio Pretraining for Speech Emotion Recognition.
CoRR
(2023)
Qing Wang
,
Jixun Yao
,
Ziqian Wang
,
Pengcheng Guo
,
Lei Xie
Pseudo-Siamese Network based Timbre-reserved Black-box Adversarial Attack in Speaker Identification.
INTERSPEECH
(2023)
Qing Wang
,
Jixun Yao
,
Li Zhang
,
Pengcheng Guo
,
Lei Xie
Timbre-Reserved Adversarial Attack in Speaker Identification.
IEEE ACM Trans. Audio Speech Lang. Process.
31 (2023)
Qing Wang
,
Jixun Yao
,
Li Zhang
,
Pengcheng Guo
,
Lei Xie
Timbre-reserved Adversarial Attack in Speaker Identification.
CoRR
(2023)
Yi Lei
,
Shan Yang
,
Xinsheng Wang
,
Qicong Xie
,
Jixun Yao
,
Lei Xie
,
Dan Su
UniSyn: An End-to-End Unified Model for Text-to-Speech and Singing Voice Synthesis.
AAAI
(2023)
Ziqian Wang
,
Qing Wang
,
Jixun Yao
,
Lei Xie
The NPU-ASLP System for Deepfake Algorithm Recognition in ADD 2023 Challenge.
DADA@IJCAI
(2023)
Ziqian Ning
,
Yuepeng Jiang
,
Pengcheng Zhu
,
Jixun Yao
,
Shuai Wang
,
Lei Xie
,
Mengxiao Bi
DualVC: Dual-mode Voice Conversion using Intra-model Knowledge Distillation and Hybrid Predictive Coding.
CoRR
(2023)
Ziqian Ning
,
Yuepeng Jiang
,
Pengcheng Zhu
,
Shuai Wang
,
Jixun Yao
,
Lei Xie
,
Mengxiao Bi
DualVC 2: Dynamic Masked Convolution for Unified Streaming and Non-Streaming Voice Conversion.
CoRR
(2023)
Jixun Yao
,
Yuguang Yang
,
Yi Lei
,
Ziqian Ning
,
Yanni Hu
,
Yu Pan
,
Jingjing Yin
,
Hongbin Zhou
,
Heng Lu
,
Lei Xie
PromptVC: Flexible Stylistic Voice Conversion in Latent Space Driven by Natural Language Prompts.
CoRR
(2023)
Guofeng Yi
,
Yuguang Yang
,
Yu Pan
,
Yuhang Cao
,
Jixun Yao
,
Xiang Lv
,
Cunhang Fan
,
Zhao Lv
,
Jianhua Tao
,
Shan Liang
,
Heng Lu
Exploring the Power of Cross-Contextual Large Language Model in Mimic Emotion Prediction.
MuSe@ACM Multimedia
(2023)
Ziqian Ning
,
Yuepeng Jiang
,
Pengcheng Zhu
,
Jixun Yao
,
Shuai Wang
,
Lei Xie
,
Mengxiao Bi
DualVC: Dual-mode Voice Conversion using Intra-model Knowledge Distillation and Hybrid Predictive Coding.
INTERSPEECH
(2023)
Yuanjun Lv
,
Jixun Yao
,
Peikun Chen
,
Hongbin Zhou
,
Heng Lu
,
Lei Xie
Salt: Distinguishable Speaker Anonymization Through Latent Space Transformation.
ASRU
(2023)
Yuanjun Lv
,
Jixun Yao
,
Peikun Chen
,
Hongbin Zhou
,
Heng Lu
,
Lei Xie
SALT: Distinguishable Speaker Anonymization Through Latent Space Transformation.
CoRR
(2023)
Ziqian Ning
,
Qicong Xie
,
Pengcheng Zhu
,
Zhichao Wang
,
Liumeng Xue
,
Jixun Yao
,
Lei Xie
,
Mengxiao Bi
Expressive-VC: Highly Expressive Voice Conversion with Attention Fusion of Bottleneck and Perturbation Features.
ICASSP
(2023)
Jixun Yao
,
Yi Lei
,
Qing Wang
,
Pengcheng Guo
,
Ziqian Ning
,
Lei Xie
,
Hai Li
,
Junhui Liu
,
Danming Xie
Preserving background sound in noise-robust voice conversion via multi-task learning.
CoRR
(2022)
Jixun Yao
,
Qing Wang
,
Yi Lei
,
Pengcheng Guo
,
Lei Xie
,
Namin Wang
,
Jie Liu
Distinguishable Speaker Anonymization based on Formant and Fundamental Frequency Scaling.
CoRR
(2022)
Renmingyue Du
,
Jixun Yao
High Quality and Similarity One-Shot Voice Conversion Using End-to-End Model.
CSAI
(2022)
Ziqian Ning
,
Qicong Xie
,
Pengcheng Zhu
,
Zhichao Wang
,
Liumeng Xue
,
Jixun Yao
,
Lei Xie
,
Mengxiao Bi
Expressive-VC: Highly Expressive Voice Conversion with Attention Fusion of Bottleneck and Perturbation Features.
CoRR
(2022)
Jixun Yao
,
Qing Wang
,
Li Zhang
,
Pengcheng Guo
,
Yuhao Liang
,
Lei Xie
NWPU-ASLP System for the VoicePrivacy 2022 Challenge.
CoRR
(2022)
Yi Lei
,
Shan Yang
,
Xinsheng Wang
,
Qicong Xie
,
Jixun Yao
,
Lei Xie
,
Dan Su
UniSyn: An End-to-End Unified Model for Text-to-Speech and Singing Voice Synthesis.
CoRR
(2022)
Jixun Yao
,
Xiaoan Li
,
Dengshan Huang
A Reward Shaping Method based on Meta-LSTM for Continuous Control of Robot.
CSAI
(2020)