Penny Karanasou

Publication Activity (10 Years)

Years Active: 2013-2023
Publications (10 Years): 26

Top Topics

Transfer Learning

Acoustic Models

Top Venues

Publications

Marcel Granero Moya, Penny Karanasou, Sri Karlapati, Bastian Schnell, Nicole Peinelt, Alexis Moinet, Thomas Drugman
A Comparative Analysis of Pretrained Language Models for Text-to-Speech. CoRR (2023)
Arnaud Joly, Marco Nicolis, Ekaterina Peterova, Alessandro Lombardi, Ammar Abbas, Arent van Korlaar, Aman Hussain, Parul Sharma, Alexis Moinet, Mateusz Lajszczak, Penny Karanasou, Antonio Bonafonte, Thomas Drugman, Elena Sokolova
Controllable Emphasis with zero data for text-to-speech. CoRR (2023)
Marcel Granero Moya, Penny Karanasou, Sri Karlapati, Bastian Schnell, Nicole Peinelt, Alexis Moinet, Thomas Drugman
A Comparative Analysis of Pretrained Language Models for Text-to-Speech. SSW (2023)
Ammar Abbas, Sri Karlapati, Bastian Schnell, Penny Karanasou, Marcel Granero Moya, Amith Nagaraj, Ayman Boustati, Nicole Peinelt, Alexis Moinet, Thomas Drugman
eCat: An End-to-End Model for Multi-Speaker TTS & Many-to-Many Fine-Grained Prosody Transfer. CoRR (2023)
Arnaud Joly, Marco Nicolis, Ekaterina Peterova, Alessandro Lombardi, Ammar Abbas, Arent van Korlaar, Aman Hussain, Parul Sharma, Alexis Moinet, Mateusz Lajszczak, Penny Karanasou, Antonio Bonafonte, Thomas Drugman, Elena Sokolova
Controllable Emphasis with zero data for text-to-speech. SSW (2023)
Ammar Abbas, Sri Karlapati, Bastian Schnell, Penny Karanasou, Marcel Granero Moya, Amith Nagaraj, Ayman Boustati, Nicole Peinelt, Alexis Moinet, Thomas Drugman
eCat: An End-to-End Model for Multi-Speaker TTS & Many-to-Many Fine-Grained Prosody Transfer. INTERSPEECH (2023)
Sri Karlapati, Penny Karanasou, Mateusz Lajszczak, Ammar Abbas, Alexis Moinet, Peter Makarov, Ray Li, Arent van Korlaar, Simon Slangen, Thomas Drugman
CopyCat2: A Single Model for Multi-Speaker TTS and Many-to-Many Fine-Grained Prosody Transfer. CoRR (2022)
Sri Karlapati, Penny Karanasou, Mateusz Lajszczak, Syed Ammar Abbas, Alexis Moinet, Peter Makarov, Ray Li, Arent van Korlaar, Simon Slangen, Thomas Drugman
CopyCat2: A Single Model for Multi-Speaker TTS and Many-to-Many Fine-Grained Prosody Transfer. INTERSPEECH (2022)
Peter Makarov, Ammar Abbas, Mateusz Lajszczak, Arnaud Joly, Sri Karlapati, Alexis Moinet, Thomas Drugman, Penny Karanasou
Simple and Effective Multi-sentence TTS with Expressive and Coherent Prosody. CoRR (2022)
Peter Makarov, Syed Ammar Abbas, Mateusz Lajszczak, Arnaud Joly, Sri Karlapati, Alexis Moinet, Thomas Drugman, Penny Karanasou
Simple and Effective Multi-sentence TTS with Expressive and Coherent Prosody. INTERSPEECH (2022)
Dino Rattcliffe, You Wang, Alex Mansbridge, Penny Karanasou, Alexis Moinet, Marius Cotescu
Cross-lingual Style Transfer with Conditional Prior VAE and Style Loss. INTERSPEECH (2022)
Penny Karanasou, Sri Karlapati, Alexis Moinet, Arnaud Joly, Ammar Abbas, Simon Slangen, Jaime Lorenzo-Trueba, Thomas Drugman
A learned conditional prior for the VAE acoustic space of a TTS system. CoRR (2021)
Ammar Abbas, Bajibabu Bollepalli, Alexis Moinet, Arnaud Joly, Penny Karanasou, Peter Makarov, Simon Slangens, Sri Karlapati, Thomas Drugman
Multi-Scale Spectrogram Modelling for Neural Text-to-Speech. SSW (2021)
Sri Karlapati, Ammar Abbas, Zack Hodari, Alexis Moinet, Arnaud Joly, Penny Karanasou, Thomas Drugman
Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech. ICASSP (2021)
Penny Karanasou, Sri Karlapati, Alexis Moinet, Arnaud Joly, Ammar Abbas, Simon Slangen, Jaime Lorenzo-Trueba, Thomas Drugman
A Learned Conditional Prior for the VAE Acoustic Space of a TTS System. Interspeech (2021)
Zack Hodari, Alexis Moinet, Sri Karlapati, Jaime Lorenzo-Trueba, Thomas Merritt, Arnaud Joly, Ammar Abbas, Penny Karanasou, Thomas Drugman
Camp: A Two-Stage Approach to Modelling Prosody in Context. ICASSP (2021)
Ammar Abbas, Bajibabu Bollepalli, Alexis Moinet, Arnaud Joly, Penny Karanasou, Peter Makarov, Simon Slangen, Sri Karlapati, Thomas Drugman
Multi-Scale Spectrogram Modelling for Neural Text-to-Speech. CoRR (2021)
Sri Karlapati, Ammar Abbas, Zack Hodari, Alexis Moinet, Arnaud Joly, Penny Karanasou, Thomas Drugman
Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech. CoRR (2020)
Andrew Johnson, Penny Karanasou, Judith Gaspers, Dietrich Klakow
Cross-lingual Transfer Learning for Japanese Named Entity Recognition. NAACL-HLT (2) (2019)
Chunyang Wu, Mark J. F. Gales, Anton Ragni, Penny Karanasou, Khe Chai Sim
Improving Interpretability and Regularization in Deep Learning. IEEE ACM Trans. Audio Speech Lang. Process. 26 (2) (2018)
Judith Gaspers, Penny Karanasou, Rajen Chatterjee
Selecting Machine-Translated Data for Quick Bootstrapping of a Natural Language Understanding System. CoRR (2018)
Judith Gaspers, Penny Karanasou, Rajen Chatterjee
Selecting Machine-Translated Data for Quick Bootstrapping of a Natural Language Understanding System. NAACL-HLT (3) (2018)
Penny Karanasou, Chunyang Wu, Mark J. F. Gales, Philip C. Woodland
I-Vectors and Structured Neural Networks for Rapid Adaptation of Acoustic Models. IEEE ACM Trans. Audio Speech Lang. Process. 25 (4) (2017)
Pierre Lanchantin, Mark J. F. Gales, Penny Karanasou, Xunying Liu, Yanman Qian, Linlin Wang, Philip C. Woodland, Chao Zhang
Selection of Multi-Genre Broadcast Data for the Training of Automatic Speech Recognition Systems. INTERSPEECH (2016)
Chunyang Wu, Penny Karanasou, Mark J. F. Gales, Khe Chai Sim
Stimulated Deep Neural Network for Speech Recognition. INTERSPEECH (2016)
Chunyang Wu, Penny Karanasou, Mark J. F. Gales
Combining i-vector representation and structured neural networks for rapid adaptation. ICASSP (2016)
Pierre Lanchantin, Mark J. F. Gales, Penny Karanasou, Xunying Liu, Yanmin Qian, Linlin Wang, Philip C. Woodland, Chao Zhang
The development of the cambridge university alignment systems for the multi-genre broadcast challenge. ASRU (2015)
Philip C. Woodland, Xunying Liu, Yanmin Qian, Chao Zhang, Mark J. F. Gales, Penny Karanasou, Pierre Lanchantin, Linlin Wang
Cambridge university transcription systems for the multi-genre broadcast challenge. ASRU (2015)
Penny Karanasou, Mark J. F. Gales, Pierre Lanchantin, Xunying Liu, Yanmin Qian, Linlin Wang, Philip C. Woodland, Chao Zhang
Speaker diarisation and longitudinal linking in multi-genre broadcast data. ASRU (2015)
Yulan Liu, Penny Karanasou, Thomas Hain
An investigation into speaker informed DNN front-end for LVCSR. ICASSP (2015)
Penny Karanasou, Mark J. F. Gales, Philip C. Woodland
I-vector estimation using informative priors for adaptation of deep neural networks. INTERSPEECH (2015)
Penny Karanasou, Yongqiang Wang, Mark J. F. Gales, Philip C. Woodland
Adaptation of deep neural network acoustic models using factorised i-vectors. INTERSPEECH (2014)
Penny Karanasou, François Yvon, Thomas Lavergne, Lori Lamel
Discriminative training of a phoneme confusion model for a dynamic lexicon in ASR. INTERSPEECH (2013)