​
Login / Signup
Midia Yousefi
Publication Activity (10 Years)
Years Active: 2016-2024
Publications (10 Years): 18
Top Topics
Speech Recognition
Convolutional Neural Network
Acoustic Models
Speaker Diarization
Top Venues
CoRR
INTERSPEECH
ICASSP
IST
</>
Publications
</>
Leying Zhang
,
Yao Qian
,
Long Zhou
,
Shujie Liu
,
Dongmei Wang
,
Xiaofei Wang
,
Midia Yousefi
,
Yanmin Qian
,
Jinyu Li
,
Lei He
,
Sheng Zhao
,
Michael Zeng
CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations.
CoRR
(2024)
Dongmei Wang
,
Xiong Xiao
,
Naoyuki Kanda
,
Midia Yousefi
,
Takuya Yoshioka
,
Jian Wu
Profile-Error-Tolerant Target-Speaker Voice Activity Detection.
ICASSP
(2024)
Chenyang Le
,
Yao Qian
,
Dongmei Wang
,
Long Zhou
,
Shujie Liu
,
Xiaofei Wang
,
Midia Yousefi
,
Yanmin Qian
,
Jinyu Li
,
Sheng Zhao
,
Michael Zeng
TransVIP: Speech to Speech Translation System with Voice and Isochrony Preservation.
CoRR
(2024)
Midia Yousefi
,
Naoyuki Kanda
,
Dongmei Wang
,
Zhuo Chen
,
Xiaofei Wang
,
Takuya Yoshioka
Speaker Diarization for ASR Output with T-vectors: A Sequence Classification Approach.
INTERSPEECH
(2023)
Midia Yousefi
,
John H. L. Hansen
Single-channel speech separation using soft-minimum permutation invariant training.
Speech Commun.
151 (2023)
Dongmei Wang
,
Xiong Xiao
,
Naoyuki Kanda
,
Midia Yousefi
,
Takuya Yoshioka
,
Jian Wu
Profile-Error-Tolerant Target-Speaker Voice Activity Detection.
CoRR
(2023)
Midia Yousefi
,
John H. L. Hansen
Speaker Conditioning of Acoustic Models Using Affine Transformation for Multi-Speaker Speech Recognition.
ASRU
(2021)
Midia Yousefi
,
Dimitra Emmanouilidou
Audio-based Toxic Language Classification using Self-attentive Convolutional Neural Network.
EUSIPCO
(2021)
Midia Yousefi
,
John H. L. Hansen
Real-time Speaker counting in a cocktail party scenario using Attention-guided Convolutional Neural Network.
CoRR
(2021)
Midia Yousefi
,
John H. L. Hansen
Real-Time Speaker Counting in a Cocktail Party Scenario Using Attention-Guided Convolutional Neural Network.
Interspeech
(2021)
Midia Yousefi
,
John H. L. Hansen
Single-channel speech separation using Soft-minimum Permutation Invariant Training.
CoRR
(2021)
Midia Yousefi
,
John H. L. Hansen
Block-Based High Performance CNN Architectures for Frame-Level Overlapping Speech Detection.
IEEE ACM Trans. Audio Speech Lang. Process.
29 (2021)
Midia Yousefi
,
John H. L. Hanse
Speaker conditioning of acoustic models using affine transformation for multi-speaker speech recognition.
CoRR
(2021)
Midia Yousefi
,
John H. L. Hansen
Frame-Based Overlapping Speech Detection Using Convolutional Neural Networks.
ICASSP
(2020)
Midia Yousefi
,
Soheil Khorram
,
John H. L. Hansen
Probabilistic Permutation Invariant Training for Speech Separation.
CoRR
(2019)
Midia Yousefi
,
Soheil Khorram
,
John H. L. Hansen
Probabilistic Permutation Invariant Training for Speech Separation.
INTERSPEECH
(2019)
Midia Yousefi
,
Navid Shokouhi
,
John H. L. Hansen
Assessing Speaker Engagement in 2-Person Debates: Overlap Detection in United States Presidential Debates.
INTERSPEECH
(2018)
Midia Yousefi
,
Mohammad Hassan Savoji
Supervised speech enhancement using online Group-Sparse Convolutive NMF.
IST
(2016)