Sign in
Éva Székely
ORCID
Publication Activity (10 Years)
Years Active: 2011-2023
Publications (10 Years): 34
Top Topics
Fundamental Frequency
Facial Gestures
Denoising
Speech Synthesis
Top Venues
CoRR
INTERSPEECH
ICASSP
RO-MAN
</>
Publications
</>
Shivam Mehta
,
Ruibo Tu
,
Jonas Beskow
,
Éva Székely
,
Gustav Eje Henter
Matcha-TTS: A fast TTS architecture with conditional flow matching.
CoRR
(2023)
Harm Lameris
,
Shivam Mehta
,
Gustav Eje Henter
,
Joakim Gustafson
,
Éva Székely
Prosody-Controllable Spontaneous TTS with Neural HMMS.
ICASSP
(2023)
Shivam Mehta
,
Siyang Wang
,
Simon Alexanderson
,
Jonas Beskow
,
Éva Székely
,
Gustav Eje Henter
Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis.
CoRR
(2023)
Siyang Wang
,
Gustav Eje Henter
,
Joakim Gustafson
,
Éva Székely
A Comparative Study of Self-Supervised Speech Representations in Read and Spontaneous TTS.
ICASSP Workshops
(2023)
Shivam Mehta
,
Ruibo Tu
,
Simon Alexanderson
,
Jonas Beskow
,
Éva Székely
,
Gustav Eje Henter
Unified speech and gesture synthesis using flow matching.
CoRR
(2023)
Jura Miniota
,
Siyang Wang
,
Jonas Beskow
,
Joakim Gustafson
,
Éva Székely
,
André Pereiral
Hi robot, it's not what you say, it's how you say it.
RO-MAN
(2023)
Matthew Peter Aylett
,
Éva Székely
,
Donald McMillan
,
Gabriel Skantze
,
Marta Romeo
,
Joel E. Fischer
,
Gisela Reyes-Cruz
Why is my Agent so Slow? Deploying Human-Like Conversational Turn-Taking.
HAI
(2023)
Ilaria Torre
,
Erik Lagerstedt
,
Nathaniel Dennler
,
Katie Seaborn
,
Iolanda Leite
,
Éva Székely
Can a gender-ambiguous voice reduce gender stereotypes in human-robot interactions?
RO-MAN
(2023)
Siyang Wang
,
Gustav Eje Henter
,
Joakim Gustafson
,
Éva Székely
A Comparative Study of Self-Supervised Speech Representations in Read and Spontaneous TTS.
CoRR
(2023)
Joakim Gustafson
,
Éva Székely
,
Jonas Beskow
Generation of speech and facial animation with controllable articulatory effort for amusing conversational characters.
IVA
(2023)
Siyang Wang
,
Gustav Eje Henter
,
Joakim Gustafson
,
Éva Székely
On the Use of Self-Supervised Speech Representations in Spontaneous Speech Synthesis.
CoRR
(2023)
Joakim Gustafson
,
Éva Székely
,
Simon Alexandersson
,
Jonas Beskow
Casual chatter or speaking up? Adjusting articulatory effort in generation of speech and animation for conversational characters.
FG
(2023)
Erik Ekstedt
,
Siyang Wang
,
Éva Székely
,
Joakim Gustafson
,
Gabriel Skantze
Automatic Evaluation of Turn-taking Cues in Conversational Speech Synthesis.
CoRR
(2023)
Shivam Mehta
,
Ambika Kirkland
,
Harm Lameris
,
Jonas Beskow
,
Éva Székely
,
Gustav Eje Henter
OverFlow: Putting flows on top of neural transducers for better TTS.
CoRR
(2022)
Siyang Wang
,
Joakim Gustafson
,
Éva Székely
Evaluating Sampling-based Filler Insertion with Spontaneous TTS.
LREC
(2022)
Ambika Kirkland
,
Harm Lameris
,
Éva Székely
,
Joakim Gustafson
Where's the uh, hesitation? The interplay between filled pause location, speech rate and fundamental frequency in perception of confidence.
INTERSPEECH
(2022)
Harm Lameris
,
Shivam Mehta
,
Gustav Eje Henter
,
Joakim Gustafson
,
Éva Székely
Prosody-controllable spontaneous TTS with neural HMMs.
CoRR
(2022)
Shivam Mehta
,
Éva Székely
,
Jonas Beskow
,
Gustav Eje Henter
Neural HMMS Are All You Need (For High-Quality Attention-Free TTS).
ICASSP
(2022)
Siyang Wang
,
Simon Alexanderson
,
Joakim Gustafson
,
Jonas Beskow
,
Gustav Eje Henter
,
Éva Székely
Integrated Speech and Gesture Synthesis.
ICMI
(2021)
Simon Alexanderson
,
Éva Székely
,
Gustav Eje Henter
,
Taras Kucherenko
,
Jonas Beskow
Generating coherent spontaneous speech and gesture from text.
CoRR
(2021)
Siyang Wang
,
Simon Alexanderson
,
Joakim Gustafson
,
Jonas Beskow
,
Gustav Eje Henter
,
Éva Székely
Integrated Speech and Gesture Synthesis.
CoRR
(2021)
Shivam Mehta
,
Éva Székely
,
Jonas Beskow
,
Gustav Eje Henter
Neural HMMs are all you need (for high-quality attention-free TTS).
CoRR
(2021)
Éva Székely
,
Gustav Eje Henter
,
Jonas Beskow
,
Joakim Gustafson
Breathing and Speech Planning in Spontaneous Speech Synthesis.
ICASSP
(2020)
Éva Székely
,
Jens Edlund
,
Joakim Gustafson
Augmented Prompt Selection for Evaluation of Spontaneous Speech Synthesis.
LREC
(2020)
Simon Alexanderson
,
Éva Székely
,
Gustav Eje Henter
,
Taras Kucherenko
,
Jonas Beskow
Generating coherent spontaneous speech and gesture from text.
IVA
(2020)
Éva Székely
,
Gustav Eje Henter
,
Jonas Beskow
,
Joakim Gustafson
Spontaneous Conversational Speech Synthesis from Found Data.
INTERSPEECH
(2019)
Éva Székely
,
Gustav Eje Henter
,
Jonas Beskow
,
Joakim Gustafson
Off the Cuff: Exploring Extemporaneous Speech Delivery with TTS.
INTERSPEECH
(2019)
Éva Székely
,
Gustav Eje Henter
,
Joakim Gustafson
Casting to Corpus: Segmenting and Selecting Spontaneous Dialogue for Tts with a Cnn-lstm Speaker-dependent Breath Detector.
ICASSP
(2019)
Leigh Clark
,
Benjamin R. Cowan
,
Justin Edwards
,
Cosmin Munteanu
,
Christine Murad
,
Matthew P. Aylett
,
Roger K. Moore
,
Jens Edlund
,
Éva Székely
,
Patrick Healey
,
Naomi Harte
,
Ilaria Torre
,
Philip R. Doyle
Mapping Theoretical and Methodological Perspectives for Understanding Speech Interface Interactions.
CHI Extended Abstracts
(2019)
Simon Betz
,
Sina Zarrieß
,
Éva Székely
,
Petra Wagner
The Greennn Tree - Lengthening Position Influences Uncertainty Perception.
INTERSPEECH
(2019)
Benjamin R. Cowan
,
Holly P. Branigan
,
Habiba Begum
,
Lucy McKenna
,
Éva Székely
They Know as Much as We Do: Knowledge Estimation and Partner Modelling of Artificial Partners.
CogSci
(2017)
Catharine Oertel
,
Patrik Jonell
,
Kevin El Haddad
,
Éva Székely
,
Joakim Gustafson
Using crowd-sourcing for the design of listening agents: challenges and opportunities.
ISIAA@ICMI
(2017)
Éva Székely
,
Joseph Mendelson
,
Joakim Gustafson
Synthesising Uncertainty: The Interplay of Vocal Effort and Hesitation Disfluencies.
INTERSPEECH
(2017)
Éva Székely
,
Mark T. Keane
,
Julie Carson-Berndsen
The effect of soft, modal and loud voice levels on entrainment in noisy conditions.
INTERSPEECH
(2015)
Éva Székely
,
Ingmar Steiner
,
Zeeshan Ahmed
,
Julie Carson-Berndsen
Facial expression-based affective speech translation.
J. Multimodal User Interfaces
8 (1) (2014)
Éva Székely
,
Zeeshan Ahmed
,
Shannon Hennig
,
João P. Cabral
,
Julie Carson-Berndsen
Predicting synthetic voice style from facial expressions. An application for augmented conversations.
Speech Commun.
57 (2014)
Zeeshan Ahmed
,
Ingmar Steiner
,
Éva Székely
,
Julie Carson-Berndsen
A system for facial expression-based affective speech translation.
IUI Companion
(2013)
Éva Székely
,
John Kane
,
Stefan Scherer
,
Christer Gobl
,
Julie Carson-Berndsen
Detecting a targeted voice style in an audiobook using voice quality features.
ICASSP
(2012)
João P. Cabral
,
Mark Kane
,
Zeeshan Ahmed
,
Mohamed Abou-Zleikha
,
Éva Székely
,
Amalia Zahra
,
Kalu U. Ogbureke
,
Peter Cahill
,
Julie Carson-Berndsen
,
Stephan Schlögl
Rapidly Testing the Interaction Model of a Pronunciation Training System via Wizard-of-Oz.
LREC
(2012)
Éva Székely
,
Tamás Gábor Csapó
,
Bálint Tóth
,
Péter Mihajlik
,
Julie Carson-Berndsen
Synthesizing expressive speech from amateur audiobook recordings.
SLT
(2012)
Éva Székely
,
João P. Cabral
,
Mohamed Abou-Zleikha
,
Peter Cahill
,
Julie Carson-Berndsen
Evaluating expressive speech synthesis from audiobook corpora for conversational phrases.
LREC
(2012)
Éva Székely
,
Zeeshan Ahmed
,
João P. Cabral
,
Julie Carson-Berndsen
WinkTalk: a demonstration of a multimodal speech synthesis platform linking facial expressions to expressive synthetic voices.
SLPAT@HLT-NAACL
(2012)
Éva Székely
,
João P. Cabral
,
Peter Cahill
,
Julie Carson-Berndsen
Clustering Expressive Speech Styles in Audiobooks Using Glottal Source Parameters.
INTERSPEECH
(2011)