Login / Signup
Ammar Abbas
ORCID
Publication Activity (10 Years)
Years Active: 2007-2024
Publications (10 Years): 21
Top Topics
Diffusion Models
Haptic Feedback
Top Venues
CoRR
ICASSP
INTERSPEECH
ROBIO
</>
Publications
</>
Mateusz Lajszczak
,
Guillermo Cámbara
,
Yang Li
,
Fatih Beyhan
,
Arent van Korlaar
,
Fan Yang
,
Arnaud Joly
,
Álvaro Martín-Cortinas
,
Ammar Abbas
,
Adam Michalski
,
Alexis Moinet
,
Sri Karlapati
,
Ewa Muszynska
,
Haohan Guo
,
Bartosz Putrycz
,
Soledad López Gambino
,
Kayeon Yoo
,
Elena Sokolova
,
Thomas Drugman
BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data.
CoRR
(2024)
Arnaud Joly
,
Marco Nicolis
,
Ekaterina Peterova
,
Alessandro Lombardi
,
Ammar Abbas
,
Arent van Korlaar
,
Aman Hussain
,
Parul Sharma
,
Alexis Moinet
,
Mateusz Lajszczak
,
Penny Karanasou
,
Antonio Bonafonte
,
Thomas Drugman
,
Elena Sokolova
Controllable Emphasis with zero data for text-to-speech.
CoRR
(2023)
Guangyan Zhang
,
Thomas Merritt
,
Manuel Sam Ribeiro
,
Biel Tura Vecino
,
Kayoko Yanagisawa
,
Kamil Pokora
,
Abdelhamid Ezzerg
,
Sebastian Cygert
,
Ammar Abbas
,
Piotr Bilinski
,
Roberto Barra-Chicote
,
Daniel Korzekwa
,
Jaime Lorenzo-Trueba
Comparing normalizing flows and diffusion models for prosody and acoustic modelling in text-to-speech.
INTERSPEECH
(2023)
Ammar Abbas
,
Sri Karlapati
,
Bastian Schnell
,
Penny Karanasou
,
Marcel Granero Moya
,
Amith Nagaraj
,
Ayman Boustati
,
Nicole Peinelt
,
Alexis Moinet
,
Thomas Drugman
eCat: An End-to-End Model for Multi-Speaker TTS & Many-to-Many Fine-Grained Prosody Transfer.
CoRR
(2023)
Arnaud Joly
,
Marco Nicolis
,
Ekaterina Peterova
,
Alessandro Lombardi
,
Ammar Abbas
,
Arent van Korlaar
,
Aman Hussain
,
Parul Sharma
,
Alexis Moinet
,
Mateusz Lajszczak
,
Penny Karanasou
,
Antonio Bonafonte
,
Thomas Drugman
,
Elena Sokolova
Controllable Emphasis with zero data for text-to-speech.
SSW
(2023)
George A. Baky
,
Krolus S. Hebeish
,
Mohamed H. Mohamed
,
Ammar Abbas
,
Kirolos E. Awni
,
Sotir Usama
,
Mohamed M. Gad
,
Mai O. Sallam
CPW-Fed Bow-Tie Antenna for Ambient RF Energy Harvesting Applications.
NILES
(2023)
Ammar Abbas
,
Sri Karlapati
,
Bastian Schnell
,
Penny Karanasou
,
Marcel Granero Moya
,
Amith Nagaraj
,
Ayman Boustati
,
Nicole Peinelt
,
Alexis Moinet
,
Thomas Drugman
eCat: An End-to-End Model for Multi-Speaker TTS & Many-to-Many Fine-Grained Prosody Transfer.
INTERSPEECH
(2023)
Guangyan Zhang
,
Thomas Merritt
,
Manuel Sam Ribeiro
,
Biel Tura Vecino
,
Kayoko Yanagisawa
,
Kamil Pokora
,
Abdelhamid Ezzerg
,
Sebastian Cygert
,
Ammar Abbas
,
Piotr Bilinski
,
Roberto Barra-Chicote
,
Daniel Korzekwa
,
Jaime Lorenzo-Trueba
Comparing normalizing flows and diffusion models for prosody and acoustic modelling in text-to-speech.
CoRR
(2023)
Sri Karlapati
,
Penny Karanasou
,
Mateusz Lajszczak
,
Ammar Abbas
,
Alexis Moinet
,
Peter Makarov
,
Ray Li
,
Arent van Korlaar
,
Simon Slangen
,
Thomas Drugman
CopyCat2: A Single Model for Multi-Speaker TTS and Many-to-Many Fine-Grained Prosody Transfer.
CoRR
(2022)
Ammar Abbas
,
Thomas Merritt
,
Alexis Moinet
,
Sri Karlapati
,
Ewa Muszynska
,
Simon Slangen
,
Elia Gatti
,
Thomas Drugman
Expressive, Variable, and Controllable Duration Modelling in TTS.
CoRR
(2022)
Peter Makarov
,
Ammar Abbas
,
Mateusz Lajszczak
,
Arnaud Joly
,
Sri Karlapati
,
Alexis Moinet
,
Thomas Drugman
,
Penny Karanasou
Simple and Effective Multi-sentence TTS with Expressive and Coherent Prosody.
CoRR
(2022)
Penny Karanasou
,
Sri Karlapati
,
Alexis Moinet
,
Arnaud Joly
,
Ammar Abbas
,
Simon Slangen
,
Jaime Lorenzo-Trueba
,
Thomas Drugman
A learned conditional prior for the VAE acoustic space of a TTS system.
CoRR
(2021)
Ammar Abbas
,
Bajibabu Bollepalli
,
Alexis Moinet
,
Arnaud Joly
,
Penny Karanasou
,
Peter Makarov
,
Simon Slangens
,
Sri Karlapati
,
Thomas Drugman
Multi-Scale Spectrogram Modelling for Neural Text-to-Speech.
SSW
(2021)
Sri Karlapati
,
Ammar Abbas
,
Zack Hodari
,
Alexis Moinet
,
Arnaud Joly
,
Penny Karanasou
,
Thomas Drugman
Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech.
ICASSP
(2021)
Penny Karanasou
,
Sri Karlapati
,
Alexis Moinet
,
Arnaud Joly
,
Ammar Abbas
,
Simon Slangen
,
Jaime Lorenzo-Trueba
,
Thomas Drugman
A Learned Conditional Prior for the VAE Acoustic Space of a TTS System.
Interspeech
(2021)
Zack Hodari
,
Alexis Moinet
,
Sri Karlapati
,
Jaime Lorenzo-Trueba
,
Thomas Merritt
,
Arnaud Joly
,
Ammar Abbas
,
Penny Karanasou
,
Thomas Drugman
Camp: A Two-Stage Approach to Modelling Prosody in Context.
ICASSP
(2021)
Ammar Abbas
,
Bajibabu Bollepalli
,
Alexis Moinet
,
Arnaud Joly
,
Penny Karanasou
,
Peter Makarov
,
Simon Slangen
,
Sri Karlapati
,
Thomas Drugman
Multi-Scale Spectrogram Modelling for Neural Text-to-Speech.
CoRR
(2021)
Sri Karlapati
,
Ammar Abbas
,
Zack Hodari
,
Alexis Moinet
,
Arnaud Joly
,
Penny Karanasou
,
Thomas Drugman
Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech.
CoRR
(2020)
Ammar Abbas
,
Andrew Zisserman
A Geometric Approach to Obtain a Bird's Eye View from an Image.
CoRR
(2019)
Hiba Ovais Latifee
,
Ammar Abbas
,
Taha Ahmed Siddiqui
,
Muhammad Nabeel
,
Muhammad Khurram
Assistive mobility cane exploiting skin-stroke tactile haptic feedback mechanism for visually impaired persons.
ROBIO
(2017)
Kashan Aqeel
,
Urooj Naveed
,
Faarah Fatima
,
Farah Haq
,
M. Arshad
,
Ammar Abbas
,
Muhammad Nabeel
,
Muhammad Khurram
Skin stroking haptic feedback glove for assisting blinds in navigation.
ROBIO
(2017)
Atiya Azmi
,
Nadia Ishaque
,
Ammar Abbas
,
Safeeullah Soomro
VCAN-Controller Area Network Based Human Vital Sign Data Transmission Protocol.
CSEE (1)
(2011)
Ammar Abbas
,
Sven Hoefler
,
Basel Fardi
,
Gerd Wanielik
Stereo vision based pedestrian detection using B-spline modeling.
ICVES
(2008)
Basel Fardi
,
Ammar Abbas
,
Gerd Wanielik
Enhanced Disparity Computation for ADAS Applications.
GI Jahrestagung (2)
(2007)