Login / Signup
Penny Karanasou
ORCID
Publication Activity (10 Years)
Years Active: 2013-2023
Publications (10 Years): 26
Top Topics
Transfer Learning
Neural Network
Acoustic Models
Text To Speech
Top Venues
CoRR
INTERSPEECH
ICASSP
SSW
</>
Publications
</>
Marcel Granero Moya
,
Penny Karanasou
,
Sri Karlapati
,
Bastian Schnell
,
Nicole Peinelt
,
Alexis Moinet
,
Thomas Drugman
A Comparative Analysis of Pretrained Language Models for Text-to-Speech.
CoRR
(2023)
Arnaud Joly
,
Marco Nicolis
,
Ekaterina Peterova
,
Alessandro Lombardi
,
Ammar Abbas
,
Arent van Korlaar
,
Aman Hussain
,
Parul Sharma
,
Alexis Moinet
,
Mateusz Lajszczak
,
Penny Karanasou
,
Antonio Bonafonte
,
Thomas Drugman
,
Elena Sokolova
Controllable Emphasis with zero data for text-to-speech.
CoRR
(2023)
Marcel Granero Moya
,
Penny Karanasou
,
Sri Karlapati
,
Bastian Schnell
,
Nicole Peinelt
,
Alexis Moinet
,
Thomas Drugman
A Comparative Analysis of Pretrained Language Models for Text-to-Speech.
SSW
(2023)
Ammar Abbas
,
Sri Karlapati
,
Bastian Schnell
,
Penny Karanasou
,
Marcel Granero Moya
,
Amith Nagaraj
,
Ayman Boustati
,
Nicole Peinelt
,
Alexis Moinet
,
Thomas Drugman
eCat: An End-to-End Model for Multi-Speaker TTS & Many-to-Many Fine-Grained Prosody Transfer.
CoRR
(2023)
Arnaud Joly
,
Marco Nicolis
,
Ekaterina Peterova
,
Alessandro Lombardi
,
Ammar Abbas
,
Arent van Korlaar
,
Aman Hussain
,
Parul Sharma
,
Alexis Moinet
,
Mateusz Lajszczak
,
Penny Karanasou
,
Antonio Bonafonte
,
Thomas Drugman
,
Elena Sokolova
Controllable Emphasis with zero data for text-to-speech.
SSW
(2023)
Ammar Abbas
,
Sri Karlapati
,
Bastian Schnell
,
Penny Karanasou
,
Marcel Granero Moya
,
Amith Nagaraj
,
Ayman Boustati
,
Nicole Peinelt
,
Alexis Moinet
,
Thomas Drugman
eCat: An End-to-End Model for Multi-Speaker TTS & Many-to-Many Fine-Grained Prosody Transfer.
INTERSPEECH
(2023)
Sri Karlapati
,
Penny Karanasou
,
Mateusz Lajszczak
,
Ammar Abbas
,
Alexis Moinet
,
Peter Makarov
,
Ray Li
,
Arent van Korlaar
,
Simon Slangen
,
Thomas Drugman
CopyCat2: A Single Model for Multi-Speaker TTS and Many-to-Many Fine-Grained Prosody Transfer.
CoRR
(2022)
Sri Karlapati
,
Penny Karanasou
,
Mateusz Lajszczak
,
Syed Ammar Abbas
,
Alexis Moinet
,
Peter Makarov
,
Ray Li
,
Arent van Korlaar
,
Simon Slangen
,
Thomas Drugman
CopyCat2: A Single Model for Multi-Speaker TTS and Many-to-Many Fine-Grained Prosody Transfer.
INTERSPEECH
(2022)
Peter Makarov
,
Ammar Abbas
,
Mateusz Lajszczak
,
Arnaud Joly
,
Sri Karlapati
,
Alexis Moinet
,
Thomas Drugman
,
Penny Karanasou
Simple and Effective Multi-sentence TTS with Expressive and Coherent Prosody.
CoRR
(2022)
Peter Makarov
,
Syed Ammar Abbas
,
Mateusz Lajszczak
,
Arnaud Joly
,
Sri Karlapati
,
Alexis Moinet
,
Thomas Drugman
,
Penny Karanasou
Simple and Effective Multi-sentence TTS with Expressive and Coherent Prosody.
INTERSPEECH
(2022)
Dino Rattcliffe
,
You Wang
,
Alex Mansbridge
,
Penny Karanasou
,
Alexis Moinet
,
Marius Cotescu
Cross-lingual Style Transfer with Conditional Prior VAE and Style Loss.
INTERSPEECH
(2022)
Penny Karanasou
,
Sri Karlapati
,
Alexis Moinet
,
Arnaud Joly
,
Ammar Abbas
,
Simon Slangen
,
Jaime Lorenzo-Trueba
,
Thomas Drugman
A learned conditional prior for the VAE acoustic space of a TTS system.
CoRR
(2021)
Ammar Abbas
,
Bajibabu Bollepalli
,
Alexis Moinet
,
Arnaud Joly
,
Penny Karanasou
,
Peter Makarov
,
Simon Slangens
,
Sri Karlapati
,
Thomas Drugman
Multi-Scale Spectrogram Modelling for Neural Text-to-Speech.
SSW
(2021)
Sri Karlapati
,
Ammar Abbas
,
Zack Hodari
,
Alexis Moinet
,
Arnaud Joly
,
Penny Karanasou
,
Thomas Drugman
Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech.
ICASSP
(2021)
Penny Karanasou
,
Sri Karlapati
,
Alexis Moinet
,
Arnaud Joly
,
Ammar Abbas
,
Simon Slangen
,
Jaime Lorenzo-Trueba
,
Thomas Drugman
A Learned Conditional Prior for the VAE Acoustic Space of a TTS System.
Interspeech
(2021)
Zack Hodari
,
Alexis Moinet
,
Sri Karlapati
,
Jaime Lorenzo-Trueba
,
Thomas Merritt
,
Arnaud Joly
,
Ammar Abbas
,
Penny Karanasou
,
Thomas Drugman
Camp: A Two-Stage Approach to Modelling Prosody in Context.
ICASSP
(2021)
Ammar Abbas
,
Bajibabu Bollepalli
,
Alexis Moinet
,
Arnaud Joly
,
Penny Karanasou
,
Peter Makarov
,
Simon Slangen
,
Sri Karlapati
,
Thomas Drugman
Multi-Scale Spectrogram Modelling for Neural Text-to-Speech.
CoRR
(2021)
Sri Karlapati
,
Ammar Abbas
,
Zack Hodari
,
Alexis Moinet
,
Arnaud Joly
,
Penny Karanasou
,
Thomas Drugman
Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech.
CoRR
(2020)
Andrew Johnson
,
Penny Karanasou
,
Judith Gaspers
,
Dietrich Klakow
Cross-lingual Transfer Learning for Japanese Named Entity Recognition.
NAACL-HLT (2)
(2019)
Chunyang Wu
,
Mark J. F. Gales
,
Anton Ragni
,
Penny Karanasou
,
Khe Chai Sim
Improving Interpretability and Regularization in Deep Learning.
IEEE ACM Trans. Audio Speech Lang. Process.
26 (2) (2018)
Judith Gaspers
,
Penny Karanasou
,
Rajen Chatterjee
Selecting Machine-Translated Data for Quick Bootstrapping of a Natural Language Understanding System.
CoRR
(2018)
Judith Gaspers
,
Penny Karanasou
,
Rajen Chatterjee
Selecting Machine-Translated Data for Quick Bootstrapping of a Natural Language Understanding System.
NAACL-HLT (3)
(2018)
Penny Karanasou
,
Chunyang Wu
,
Mark J. F. Gales
,
Philip C. Woodland
I-Vectors and Structured Neural Networks for Rapid Adaptation of Acoustic Models.
IEEE ACM Trans. Audio Speech Lang. Process.
25 (4) (2017)
Pierre Lanchantin
,
Mark J. F. Gales
,
Penny Karanasou
,
Xunying Liu
,
Yanman Qian
,
Linlin Wang
,
Philip C. Woodland
,
Chao Zhang
Selection of Multi-Genre Broadcast Data for the Training of Automatic Speech Recognition Systems.
INTERSPEECH
(2016)
Chunyang Wu
,
Penny Karanasou
,
Mark J. F. Gales
,
Khe Chai Sim
Stimulated Deep Neural Network for Speech Recognition.
INTERSPEECH
(2016)
Chunyang Wu
,
Penny Karanasou
,
Mark J. F. Gales
Combining i-vector representation and structured neural networks for rapid adaptation.
ICASSP
(2016)
Pierre Lanchantin
,
Mark J. F. Gales
,
Penny Karanasou
,
Xunying Liu
,
Yanmin Qian
,
Linlin Wang
,
Philip C. Woodland
,
Chao Zhang
The development of the cambridge university alignment systems for the multi-genre broadcast challenge.
ASRU
(2015)
Philip C. Woodland
,
Xunying Liu
,
Yanmin Qian
,
Chao Zhang
,
Mark J. F. Gales
,
Penny Karanasou
,
Pierre Lanchantin
,
Linlin Wang
Cambridge university transcription systems for the multi-genre broadcast challenge.
ASRU
(2015)
Penny Karanasou
,
Mark J. F. Gales
,
Pierre Lanchantin
,
Xunying Liu
,
Yanmin Qian
,
Linlin Wang
,
Philip C. Woodland
,
Chao Zhang
Speaker diarisation and longitudinal linking in multi-genre broadcast data.
ASRU
(2015)
Yulan Liu
,
Penny Karanasou
,
Thomas Hain
An investigation into speaker informed DNN front-end for LVCSR.
ICASSP
(2015)
Penny Karanasou
,
Mark J. F. Gales
,
Philip C. Woodland
I-vector estimation using informative priors for adaptation of deep neural networks.
INTERSPEECH
(2015)
Penny Karanasou
,
Yongqiang Wang
,
Mark J. F. Gales
,
Philip C. Woodland
Adaptation of deep neural network acoustic models using factorised i-vectors.
INTERSPEECH
(2014)
Penny Karanasou
,
François Yvon
,
Thomas Lavergne
,
Lori Lamel
Discriminative training of a phoneme confusion model for a dynamic lexicon in ASR.
INTERSPEECH
(2013)