Tobias May

Publication Activity (10 Years)

Years Active: 2011-2024
Publications (10 Years): 16

Top Topics

Matrix Factorization

Diffusion Models

Speech Enhancement

Top Venues

IEEE ACM Trans. Audio Speech Lang. Process.

Publications

Philippe Gonzalez, Zheng-Hua Tan, Jan Østergaard, Jesper Jensen, Tommy Sonne Alstrøm, Tobias May
Diffusion-Based Speech Enhancement in Matched and Mismatched Conditions Using a Heun-Based Sampler. ICASSP (2024)
Philippe Gonzalez, Zheng-Hua Tan, Jan Østergaard, Jesper Jensen, Tommy Sonne Alstrøm, Tobias May
Investigating the Design Space of Diffusion Models for Speech Enhancement. CoRR (2023)
Philippe Gonzalez, Tommy Sonne Alstrøm, Tobias May
Assessing the Generalization Gap of Learning-Based Speech Enhancement Systems in Noisy and Reverberant Environments. CoRR (2023)
Philippe Gonzalez, Tommy Sonne Alstrøm, Tobias May
On Batching Variable Size Inputs for Training End-to-End Speech Enhancement Systems. ICASSP (2023)
Philippe Gonzalez, Zheng-Hua Tan, Jan Østergaard, Jesper Jensen, Tommy Sonne Alstrøm, Tobias May
Diffusion-Based Speech Enhancement in Matched and Mismatched Conditions Using a Heun-Based Sampler. CoRR (2023)
Philippe Gonzalez, Tommy Sonne Alstrøm, Tobias May
On Batching Variable Size Inputs for Training End-to-End Speech Enhancement Systems. CoRR (2023)
Philippe Gonzalez, Tommy Sonne Alstrøm, Tobias May
Assessing the Generalization Gap of Learning-Based Speech Enhancement Systems in Noisy and Reverberant Environments. IEEE ACM Trans. Audio Speech Lang. Process. 31 (2023)
Sarinah Sutojo, Tobias May, Steven van de Par
Segmentation of Multitalker Mixtures Based on Local Feature Contrasts and Auditory Glimpses. IEEE ACM Trans. Audio Speech Lang. Process. 30 (2022)
Ingvi Örnolfsson, Torsten Dau, Ning Ma, Tobias May
Exploiting Non-Negative Matrix Factorization for Binaural Sound Localization in the Presence of Directional Interference. ICASSP (2021)
Peter Mølgaard Sørensen, Bastian Epp, Tobias May
A depthwise separable convolutional neural network for keyword spotting on an embedded system. EURASIP J. Audio Speech Music. Process. 2020 (1) (2020)
Ning Ma, Tobias May, Guy J. Brown
Exploiting Deep Neural Networks and Head Movements for Robust Binaural Localisation of Multiple Sources in Reverberant Environments. CoRR (2019)
Tobias May
Robust Speech Dereverberation With a Neural Network-Based Post-Filter That Exploits Multi-Conditional Training of Binaural Cues. IEEE ACM Trans. Audio Speech Lang. Process. 26 (2) (2018)
Ning Ma, Tobias May, Guy J. Brown
Exploiting Deep Neural Networks and Head Movements for Robust Binaural Localization of Multiple Sources in Reverberant Environments. IEEE ACM Trans. Audio Speech Lang. Process. 25 (12) (2017)
Tobias May
Influence of binary mask estimation errors on robust speaker identification. Speech Commun. 87 (2017)
Tobias May, Borys Kowalewski, Michal Fereczkowski, Ewen N. MacDonald
Assessment of broadband SNR estimation for hearing aid applications. ICASSP (2017)
Thomas Bentsen, Tobias May, Abigail A. Kressner, Torsten Dau
Comparing the Influence of Spectro-Temporal Integration in Computational Speech Segregation. INTERSPEECH (2016)
Ning Ma, Tobias May, Hagen Wierstorf, Guy J. Brown
A machine-hearing system exploiting head movements for binaural sound localisation in reverberant conditions. ICASSP (2015)
Tobias May, Ning Ma, Guy J. Brown
Robust localisation of multiple speakers exploiting head movements and multi-conditional training of binaural cues. ICASSP (2015)
Ning Ma, Guy J. Brown, Tobias May
Exploiting deep neural networks and head movements for binaural localisation of multiple speakers in reverberant conditions. INTERSPEECH (2015)
Tobias May, Thomas Bentsen, Torsten Dau
The role of temporal resolution in modulation-based speech segregation. INTERSPEECH (2015)
Tobias May, Timo Gerkmann
Generalization of supervised learning for binary mask estimation. IWAENC (2014)
Tobias May, Torsten Dau
Environment-aware ideal binary mask estimation using monaural cues. WASPAA (2013)
Eleftheria Georganti, Tobias May, Steven van de Par, John Mourjopoulos
Sound Source Distance Estimation in Rooms based on Statistical Properties of Binaural Signals. IEEE Trans. Speech Audio Process. 21 (8) (2013)
Tobias May, Steven van de Par, Armin Kohlrausch
Noise-Robust Speaker Recognition Combining Missing Data Techniques and Universal Background Modeling. IEEE Trans. Speech Audio Process. 20 (1) (2012)
Tobias May, Steven van de Par
Blind Estimation of the Number of Speech Sources in Reverberant Multisource Scenarios Based on Binaural Signals. IWAENC (2012)
Tobias May, Steven van de Par, Armin Kohlrausch
A Binaural Scene Analyzer for Joint Localization and Recognition of Speakers in the Presence of Interfering Noise Sources and Reverberation. IEEE Trans. Speech Audio Process. 20 (7) (2012)
Eleftheria Georganti, Tobias May, Steven van de Par, Aki Härmä, John Mourjopoulos
Speaker Distance Detection Using a Single Microphone. IEEE Trans. Speech Audio Process. 19 (7) (2011)
Tobias May, Steven van de Par, Armin Kohlrausch
A Probabilistic Model for Robust Localization Based on a Binaural Auditory Front-End. IEEE Trans. Speech Audio Process. 19 (1) (2011)
Tobias May, Steven van de Par, Armin Kohlrausch
Binaural detection of speech sources in complex acoustic scenes. WASPAA (2011)