Sign in
SAPA@INTERSPEECH
2004
2007
2009
2012
2004
2012
Keyphrases
Publications
2012
Liang Lu
,
Arnab Ghoshal
,
Steve Renals
Joint uncertainty decoding with unscented transform for noise robust subspace Gaussian mixture models.
SAPA@INTERSPEECH
(2012)
Kamal Sahni
,
Pranay Dighe
,
Rita Singh
,
Bhiksha Raj
Language identification using spectro-temporal patch features.
SAPA@INTERSPEECH
(2012)
Heyun Huang
,
Louis ten Bosch
,
Bert Cranen
,
Lou Boves
Smoothing speech trajectories by regularization.
SAPA@INTERSPEECH
(2012)
Takuya Yoshioka
,
Daichi Sakaue
Log-normal matrix factorization with application to speech-music separation.
SAPA@INTERSPEECH
(2012)
Josh H. McDermott
,
Daniel P. W. Ellis
,
Hideki Kawahara
Inharmonic speech: a tool for the study of speech perception and separation.
SAPA@INTERSPEECH
(2012)
Tomoyasu Nakano
,
Masataka Goto
A spectral envelope estimation method based on F0-adaptive multi-frame integration analysis.
SAPA@INTERSPEECH
(2012)
Sunder Ram Krishnan
,
Chandra Sekhar Seelamantula
A generalized Stein's estimation approach for speech enhancement based on perceptual criteria.
SAPA@INTERSPEECH
(2012)
Majid Mirbagheri
,
Yanbo Xu
,
Shihab A. Shamma
Pitch estimation using mutual information.
SAPA@INTERSPEECH
(2012)
Tuomas Virtanen
Human sound perception - what can we learn from it when developing audio analysis algorithms?
SAPA@INTERSPEECH
(2012)
Deepu Vijayasenan
,
Fabio Valente
Dimensionality reduction of large TDOA vectors for speaker diarization.
SAPA@INTERSPEECH
(2012)
Joris Driesen
,
Jort F. Gemmeke
,
Hugo Van hamme
Data-driven speech representations for NMF-based word learning.
SAPA@INTERSPEECH
(2012)
Zoltán Tüske
,
Friedhelm R. Drepper
,
Ralf Schlüter
Non-stationary signal processing and its application in speech recognition.
SAPA@INTERSPEECH
(2012)
ISCA Workshop on Statistical And Perceptual Audition, SAPA 2012, Portland, OR, USA, September 7-8, 2012
SAPA@INTERSPEECH
(2012)
Kalu U. Ogbureke
,
João P. Cabral
,
Julie Carson-Berndsen
Explicit duration modelling in HMM-based speech synthesis using a hybrid hidden Markov model-multilayer perceptron.
SAPA@INTERSPEECH
(2012)
Afsaneh Asaei
,
Bhiksha Raj
,
Hervé Bourlard
,
Volkan Cevher
Structured sparse coding for microphone array location calibration.
SAPA@INTERSPEECH
(2012)
Serena Soldo
,
Mathew Magimai-Doss
,
Hervé Bourlard
Template-based ASR using posterior features and synthetic references: comparing different TTS systems.
SAPA@INTERSPEECH
(2012)
M. Ali Basha Shaik
,
David Rybach
,
Stefan Hahn
,
Ralf Schlüter
,
Hermann Ney
Hierarchical hybrid language models for open vocabulary continuous speech recognition using WFST.
SAPA@INTERSPEECH
(2012)
Cassia Valentini-Botinhao
,
Junichi Yamagishi
,
Simon King
Evaluating speech intelligibility enhancement for HMM-based synthetic speech in noise.
SAPA@INTERSPEECH
(2012)
Cong-Thanh Do
,
Claude Barras
Cochlear implant-like processing of speech signal for speaker verification.
SAPA@INTERSPEECH
(2012)
Samuel K. Ngouoko M
,
Martin Heckmann
,
Britta Wrede
Spectro-temporal features with distribution equalization.
SAPA@INTERSPEECH
(2012)
Mauro Nicolao
,
Roger K. Moore
Establishing some principles of human speech production through two-dimensional computational models.
SAPA@INTERSPEECH
(2012)
Youssef Oualil
,
Mathew Magimai-Doss
,
Friedrich Faubel
,
Dietrich Klakow
Joint detection and localization of multiple speakers using a probabilistic interpretation of the steered response power.
SAPA@INTERSPEECH
(2012)
Rahil Mahdian Toroghi
,
Friedrich Faubel
,
Dietrich Klakow
Multi-channel speech separation with soft time-frequency masking.
SAPA@INTERSPEECH
(2012)
2010
Janet M. Baker
,
Alexander M. Chan
,
Ksenija Marinkovic
,
Eric Halgren
,
Sydney S. Cash
Machine learning for learning how the brain recognizes speech and language.
SAPA@INTERSPEECH
(2010)
Ning Ma
,
Jon Barker
,
Heidi Christensen
,
Phil D. Green
Distant microphone speech recognition in a noisy indoor environment: combining soft missing data and speech fragment decoding.
SAPA@INTERSPEECH
(2010)
Jun Wu
,
Yu Kitano
,
Stanislaw Andrzej Raczynski
,
Shigeki Miyabe
,
Takuya Nishimoto
,
Nobutaka Ono
,
Shigeki Sagayama
Musical instrument identification based on harmonic temporal timbre features.
SAPA@INTERSPEECH
(2010)
Martin Heckmann
Supervised vs. unsupervised learning of spectro temporal speech features.
SAPA@INTERSPEECH
(2010)
Yushen Han
,
Christopher Raphael
Informed source separation of orchestra and soloist using masking and unmasking.
SAPA@INTERSPEECH
(2010)
ISCA Workshop on Statistical And Perceptual Audition, SAPA 2010, Makuhari, Japan, September 25, 2010
SAPA@INTERSPEECH
(2010)
Masahito Togami
,
Koichi Hori
Online speech source separation in meeting scene with time-varying weights of noise covariance matrices.
SAPA@INTERSPEECH
(2010)
Piotr Holonowicz
,
Perfecto Herrera
Detection of polyphonic music note onsets by application of the Bayesian theory of surprise.
SAPA@INTERSPEECH
(2010)
Hirokazu Kameoka
,
Jonathan Le Roux
,
Yasunori Ohishi
A statistical model of speech F0 contours.
SAPA@INTERSPEECH
(2010)
Emmanouil Benetos
,
Simon Dixon
Multiple-F0 estimation of piano sounds exploiting spectral structure and temporal evolution.
SAPA@INTERSPEECH
(2010)
2008
Tony Ezzat
,
Tomaso A. Poggio
Discriminative word-spotting using ordered spectro-temporal patch features.
SAPA@INTERSPEECH
(2008)
Tuomas Virtanen
,
Annamaria Mesaros
,
Matti Ryynänen
Combining pitch-based inference and non-negative spectrogram factorization in separating vocals from polyphonic music.
SAPA@INTERSPEECH
(2008)
Jonathan Le Roux
,
Hirokazu Kameoka
,
Nobutaka Ono
,
Alain de Cheveigné
,
Shigeki Sagayama
Computational auditory induction by missing-data non-negative matrix factorization.
SAPA@INTERSPEECH
(2008)
Adam C. Lammert
,
Daniel P. W. Ellis
,
Pierre L. Divenyi
Data-driven articulatory inversion incorporating articulator priors.
SAPA@INTERSPEECH
(2008)
Ke Hu
,
Pierre L. Divenyi
,
Daniel P. W. Ellis
,
Zhaozhang Jin
,
Barbara G. Shinn-Cunningham
,
DeLiang Wang
Preliminary intelligibility tests of a monaural speech segregation system.
SAPA@INTERSPEECH
(2008)
ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition, SAPA 2008, Brisbane, Australia, September 21, 2008
SAPA@INTERSPEECH
(2008)
Maria E. Markaki
,
Andre Holzapfel
,
Yannis Stylianou
Singing voice detection using modulation frequency feature.
SAPA@INTERSPEECH
(2008)
Jonathan Le Roux
,
Nobutaka Ono
,
Shigeki Sagayama
Explicit consistency constraints for STFT spectrograms and their application to phase reconstruction.
SAPA@INTERSPEECH
(2008)
2006
Steven J. Rennie
,
Peder A. Olsen
,
John R. Hershey
,
Trausti T. Kristjansson
The Iroquois model: using temporal dynamics to separate speakers.
SAPA@INTERSPEECH
(2006)
Michael I. Mandel
,
Daniel P. W. Ellis
A probability model for interaural phase difference.
SAPA@INTERSPEECH
(2006)
Yoshitaka Nishimura
,
Mikio Nakano
,
Kazuhiro Nakadai
,
Hiroshi Tsujino
,
Mitsuru Ishizuka
Speech recognition for a robot under its motor noises by selective application of missing feature theory and MLLR.
SAPA@INTERSPEECH
(2006)
Tomonori Izumitani
,
Kunio Kashino
Frequency component restoration for music sounds using local probabilistic models with maximum entropy learning.
SAPA@INTERSPEECH
(2006)
Hiroko Terasawa
,
Malcolm Slaney
,
Jonathan Berger
A statistical model of timbre perception.
SAPA@INTERSPEECH
(2006)
Guoping Li
,
Mark E. Lutman
Sparseness and speech perception in noise.
SAPA@INTERSPEECH
(2006)
Kentaro Ishizuka
,
Tomohiro Nakatani
Study of noise robust voice activity detection based on periodic component to aperiodic component ratio.
SAPA@INTERSPEECH
(2006)
Jerome R. Bellegarda
LSM-based feature extraction for concatenative speech synthesis.
SAPA@INTERSPEECH
(2006)
Sourabh Ravindran
,
David V. Anderson
,
Malcolm Slaney
Improving the noise-robustness of mel-frequency cepstral coefficients for speech processing.
SAPA@INTERSPEECH
(2006)