IEEE ACM Trans. Audio Speech Lang. Process.

Keyphrases

Publications

volume 32, 2024

Tiantian Zhu, Yang Qin, Ming Feng, Qingcai Chen, Baotian Hu, Yang Xiang
BioPRO: Context-Infused Prompt Learning for Biomedical Entity Linking. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Ernesto Accolti, Javier Gimenez, Michael Vorländer
Uncertainties of Room Acoustics Simulation Due to Directivity Data of Musical Instruments. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Yang Xiang, Jesper Lisby Højvang, Morten Højfeldt Rasmussen, Mads Græsbøll Christensen
A Two-Stage Deep Representation Learning-Based Speech Enhancement Method Using Variational Autoencoder and Adversarial Training. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Qijie Shao, Pengcheng Guo, Jinghao Yan, Pengfei Hu, Lei Xie
Decoupling and Interacting Multi-Task Learning Network for Joint Speech and Accent Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Yongwei Zhou, Junwei Bao, Youzheng Wu, Xiaodong He, Tiejun Zhao
Operation-Augmented Numerical Reasoning for Question Answering. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Penghui Ma, Jianfeng Li, Jingjing Pan, Xiaofei Zhang, Roberto Gil-Pita
Coherent Signal DOA Estimation With Coprime Array: Exploiting Signal Subspace Reconstructing Strategy. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Xuenan Xu, Zeyu Xie, Mengyue Wu, Kai Yu
Beyond the Status Quo: A Contemporary Survey of Advances and Challenges in Audio Captioning. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Khandokar Md. Nayem, Donald S. Williamson
Attention-Based Speech Enhancement Using Human Quality Perception Modeling. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Mathias Bach Pedersen, Søren Holdt Jensen, Zheng-Hua Tan, Jesper Jensen
Data-Driven Non-Intrusive Speech Intelligibility Prediction Using Speech Presence Probability. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Xiaobo Liang, Runze Mao, Lijun Wu, Juntao Li, Min Zhang, Qing Li
Enhancing Low-Resource NLP by Consistency Training With Data and Model Perturbations. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Bing Han, Zhengyang Chen, Yanmin Qian
Self-Supervised Learning With Cluster-Aware-DINO for High-Performance Robust Speaker Verification. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Thomas Haubner, Andreas Brendel, Walter Kellermann
End-to-End Deep Learning-Based Adaptation Control for Linear Acoustic Echo Cancellation. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Chin-Po Chen, Ho-Hsien Pan, Susan Shur-Fen Gau, Chi-Chun Lee
Using Measures of Vowel Space for Autistic Traits Characterization. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Yile Wang, Yue Zhang, Peng Li, Yang Liu
Gradual Syntactic Label Replacement for Language Model Pre-Training. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Emma Hamel, Nickvash Kani
Factors That Influence Automatic Recognition of African-American Vernacular English in Machine-Learning Models. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Qiangqiang Zhang, Dongyuan Lin, Yingying Xiao, Yunfei Zheng, Shiyuan Wang
Error Reused Filtered-X Least Mean Square Algorithm for Active Noise Control. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Shiwen Ni, Jiawen Li, Min Yang, Hung-Yu Kao
DropAttack: A Random Dropped Weight Attack Adversarial Training for Natural Language Understanding. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Yuanbo Hou, Bo Kang, Andrew Mitchell, Wenwu Wang, Jian Kang, Dick Botteldooren
Cooperative Scene-Event Modelling for Acoustic Scene Classification. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Rohit Prabhavalkar, Takaaki Hori, Tara N. Sainath, Ralf Schlüter, Shinji Watanabe
End-to-End Speech Recognition: A Survey. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Szymon Drgas, Lars Bramsløw, Archontis Politis, Gaurav Naithani, Tuomas Virtanen
Dynamic Processing Neural Network Architecture for Hearing Loss Compensation. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Femke B. Gelderblom, Tron V. Tronstad, Torbjørn Svendsen, Tor André Myrvoll
On the Predictive Power of Objective Intelligibility Metrics for the Subjective Performance of Deep Complex Convolutional Recurrent Speech Enhancement Networks. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Federico Miotello, Mirco Pezzoli, Luca Comanducci, Fabio Antonacci, Augusto Sarti
Deep Prior-Based Audio Inpainting Using Multi-Resolution Harmonic Convolutional Neural Networks. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Georgios Paraskevopoulos, Theodoros Kouzelis, Georgios Rouvalis, Athanasios Katsamanis, Vassilis Katsouros, Alexandros Potamianos
Sample-Efficient Unsupervised Domain Adaptation of Speech Recognition Systems: A Case Study for Modern Greek. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Xiao Li, Ruirui Liu, Huichou Huang, Qingyao Wu
Contrastive Learning for Target Speaker Extraction With Attention-Based Fusion. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Jingsong Yan, Piji Li, Haibin Chen, Junhao Zheng, Qianli Ma
Does the Order Matter? A Random Generative Way to Learn Label Hierarchy for Hierarchical Text Classification. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Zhibo Man, Zengcheng Huang, Yujie Zhang, Yu Li, Yuanmeng Chen, Yufeng Chen, Jinan Xu
WDSRL: Multi-Domain Neural Machine Translation With Word-Level Domain-Sensitive Representation Learning. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Haisheng Lu, Jiangnan Liang, Chuang Shi
Comments on "Primary-Ambient Extraction Using Ambient Spectrum Estimation for Immersive Spatial Audio Reproduction". IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Kristina Tesch, Timo Gerkmann
Multi-Channel Speech Separation Using Spatially Selective Deep Non-Linear Filters. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Hao-Chen Pei, Hao Fang, Xin Luo, Xin-Shun Xu
Gradformer: A Framework for Multi-Aspect Multi-Granularity Pronunciation Assessment. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Jingbei Li, Sipan Li, Ping Chen, Luwen Zhang, Yi Meng, Zhiyong Wu, Helen Meng, Qiao Tian, Yuping Wang, Yuxuan Wang
Joint Multiscale Cross-Lingual Speaking Style Transfer With Bidirectional Attention Mechanism for Automatic Dubbing. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Han Zhu, Gaofeng Cheng, Jindong Wang, Wenxin Hou, Pengyuan Zhang, Yonghong Yan
Boosting Cross-Domain Speech Recognition With Self-Supervision. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Srdan Kitic, Jérôme Daniel
Blind Identification of Ambisonic Reduced Room Impulse Response. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Zengrui Jin, Mengzhe Geng, Jiajun Deng, Tianzi Wang, Shujie Hu, Guinan Li, Xunying Liu
Personalized Adversarial Data Augmentation for Dysarthric and Elderly Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Garima Sharma, Karthikeyan Umapathy, Sridhar Krishnan
Time-Frequency Scattergrams for Biomedical Audio Signal Representation and Classification. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Jiapu Wang, Boyue Wang, Junbin Gao, Simin Hu, Yongli Hu, Baocai Yin
Multi-Level Interaction Based Knowledge Graph Completion. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Yun Zhao, Dexi Liu, Changxuan Wan, Xiping Liu, Jian-Yun Nie, Jiaming Liu
JMS-QA: A Joint Hierarchical Architecture for Mental Health Question Answering. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Yoshiki Masuyama, Kouei Yamaoka, Yuma Kinoshita, Taishi Nakashima, Nobutaka Ono
Causal and Relaxed-Distortionless Response Beamforming for Online Target Source Extraction. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Jun Kong, Jin Wang, Xuejie Zhang
Adaptive Ensemble Self-Distillation With Consistent Gradients for Fast Inference of Pretrained Language Models. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Jin Chu Wu, Raghu N. Kacker
Statistical Analysis for Speaker Recognition Evaluation With Data Dependence and Three Score Distributions. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Ying Zhang, Fandong Meng, Yufeng Chen, Jinan Xu, Jie Zhou
Complex Question Enhanced Transfer Learning for Zero-Shot Joint Information Extraction. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Huiyao Chen, Yueheng Sun, Meishan Zhang, Min Zhang
Automatic Noise Generation and Reduction for Text Classification. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Alejandro Santorum Varela, Svetlana Stoyanchev, Simon Keizer, Rama Doddipatla, Kate Knill
Entity Resolution in Situated Dialog With Unimodal and Multimodal Transformers. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Anurenjan Purushothaman, Debottam Dutta, Rohit Kumar, Sriram Ganapathy
Speech Dereverberation With Frequency Domain Autoregressive Modeling. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Cristian Lucian Stanciu, Jacob Benesty, Constantin Paleologu, Ruxandra-Liana Costea, Laura-Maria Dogariu, Silviu Ciochina
Decomposition-Based Wiener Filter Using the Kronecker Product and Conjugate Gradient Method. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Han Wu, Kun Xu, Linqi Song
Structure-Aware Dialogue Modeling Methods for Conversational Semantic Role Labeling. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Shekhar Kumar Yadav, Nithin V. George
Joint Dereverberation and Beamforming With Blind Estimation of the Shape Parameter of the Desired Source Prior. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Jiaming Xu, Jian Cui, Yunzhe Hao, Bo Xu
Multi-Cue Guided Semi-Supervised Learning Toward Target Speaker Separation in Real Environments. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Leyuan Qu, Taihao Li, Cornelius Weber, Theresa Pekarek-Rosin, Fuji Ren, Stefan Wermter
Disentangling Prosody Representations With Unsupervised Speech Reconstruction. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Xiaotong Jiang, Peiwen You, Chen Chen, Zhongqing Wang, Guodong Zhou
Exploring Scope Detection for Aspect-Based Sentiment Analysis. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Congcong Jiang, Tieyun Qian, Bing Liu
One General Teacher for Multi-Data Multi-Task: A New Knowledge Distillation Framework for Discourse Relation Analysis. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)