Ammar Abbas

Publication Activity (10 Years)

Years Active: 2007-2024
Publications (10 Years): 21

Top Topics

Diffusion Models

Haptic Feedback

Top Venues

Publications

Mateusz Lajszczak, Guillermo Cámbara, Yang Li, Fatih Beyhan, Arent van Korlaar, Fan Yang, Arnaud Joly, Álvaro Martín-Cortinas, Ammar Abbas, Adam Michalski, Alexis Moinet, Sri Karlapati, Ewa Muszynska, Haohan Guo, Bartosz Putrycz, Soledad López Gambino, Kayeon Yoo, Elena Sokolova, Thomas Drugman
BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data. CoRR (2024)
Arnaud Joly, Marco Nicolis, Ekaterina Peterova, Alessandro Lombardi, Ammar Abbas, Arent van Korlaar, Aman Hussain, Parul Sharma, Alexis Moinet, Mateusz Lajszczak, Penny Karanasou, Antonio Bonafonte, Thomas Drugman, Elena Sokolova
Controllable Emphasis with zero data for text-to-speech. CoRR (2023)
Guangyan Zhang, Thomas Merritt, Manuel Sam Ribeiro, Biel Tura Vecino, Kayoko Yanagisawa, Kamil Pokora, Abdelhamid Ezzerg, Sebastian Cygert, Ammar Abbas, Piotr Bilinski, Roberto Barra-Chicote, Daniel Korzekwa, Jaime Lorenzo-Trueba
Comparing normalizing flows and diffusion models for prosody and acoustic modelling in text-to-speech. INTERSPEECH (2023)
Ammar Abbas, Sri Karlapati, Bastian Schnell, Penny Karanasou, Marcel Granero Moya, Amith Nagaraj, Ayman Boustati, Nicole Peinelt, Alexis Moinet, Thomas Drugman
eCat: An End-to-End Model for Multi-Speaker TTS & Many-to-Many Fine-Grained Prosody Transfer. CoRR (2023)
Arnaud Joly, Marco Nicolis, Ekaterina Peterova, Alessandro Lombardi, Ammar Abbas, Arent van Korlaar, Aman Hussain, Parul Sharma, Alexis Moinet, Mateusz Lajszczak, Penny Karanasou, Antonio Bonafonte, Thomas Drugman, Elena Sokolova
Controllable Emphasis with zero data for text-to-speech. SSW (2023)
George A. Baky, Krolus S. Hebeish, Mohamed H. Mohamed, Ammar Abbas, Kirolos E. Awni, Sotir Usama, Mohamed M. Gad, Mai O. Sallam
CPW-Fed Bow-Tie Antenna for Ambient RF Energy Harvesting Applications. NILES (2023)
Ammar Abbas, Sri Karlapati, Bastian Schnell, Penny Karanasou, Marcel Granero Moya, Amith Nagaraj, Ayman Boustati, Nicole Peinelt, Alexis Moinet, Thomas Drugman
eCat: An End-to-End Model for Multi-Speaker TTS & Many-to-Many Fine-Grained Prosody Transfer. INTERSPEECH (2023)
Guangyan Zhang, Thomas Merritt, Manuel Sam Ribeiro, Biel Tura Vecino, Kayoko Yanagisawa, Kamil Pokora, Abdelhamid Ezzerg, Sebastian Cygert, Ammar Abbas, Piotr Bilinski, Roberto Barra-Chicote, Daniel Korzekwa, Jaime Lorenzo-Trueba
Comparing normalizing flows and diffusion models for prosody and acoustic modelling in text-to-speech. CoRR (2023)
Sri Karlapati, Penny Karanasou, Mateusz Lajszczak, Ammar Abbas, Alexis Moinet, Peter Makarov, Ray Li, Arent van Korlaar, Simon Slangen, Thomas Drugman
CopyCat2: A Single Model for Multi-Speaker TTS and Many-to-Many Fine-Grained Prosody Transfer. CoRR (2022)
Ammar Abbas, Thomas Merritt, Alexis Moinet, Sri Karlapati, Ewa Muszynska, Simon Slangen, Elia Gatti, Thomas Drugman
Expressive, Variable, and Controllable Duration Modelling in TTS. CoRR (2022)
Peter Makarov, Ammar Abbas, Mateusz Lajszczak, Arnaud Joly, Sri Karlapati, Alexis Moinet, Thomas Drugman, Penny Karanasou
Simple and Effective Multi-sentence TTS with Expressive and Coherent Prosody. CoRR (2022)
Penny Karanasou, Sri Karlapati, Alexis Moinet, Arnaud Joly, Ammar Abbas, Simon Slangen, Jaime Lorenzo-Trueba, Thomas Drugman
A learned conditional prior for the VAE acoustic space of a TTS system. CoRR (2021)
Ammar Abbas, Bajibabu Bollepalli, Alexis Moinet, Arnaud Joly, Penny Karanasou, Peter Makarov, Simon Slangens, Sri Karlapati, Thomas Drugman
Multi-Scale Spectrogram Modelling for Neural Text-to-Speech. SSW (2021)
Sri Karlapati, Ammar Abbas, Zack Hodari, Alexis Moinet, Arnaud Joly, Penny Karanasou, Thomas Drugman
Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech. ICASSP (2021)
Penny Karanasou, Sri Karlapati, Alexis Moinet, Arnaud Joly, Ammar Abbas, Simon Slangen, Jaime Lorenzo-Trueba, Thomas Drugman
A Learned Conditional Prior for the VAE Acoustic Space of a TTS System. Interspeech (2021)
Zack Hodari, Alexis Moinet, Sri Karlapati, Jaime Lorenzo-Trueba, Thomas Merritt, Arnaud Joly, Ammar Abbas, Penny Karanasou, Thomas Drugman
Camp: A Two-Stage Approach to Modelling Prosody in Context. ICASSP (2021)
Ammar Abbas, Bajibabu Bollepalli, Alexis Moinet, Arnaud Joly, Penny Karanasou, Peter Makarov, Simon Slangen, Sri Karlapati, Thomas Drugman
Multi-Scale Spectrogram Modelling for Neural Text-to-Speech. CoRR (2021)
Sri Karlapati, Ammar Abbas, Zack Hodari, Alexis Moinet, Arnaud Joly, Penny Karanasou, Thomas Drugman
Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech. CoRR (2020)
Ammar Abbas, Andrew Zisserman
A Geometric Approach to Obtain a Bird's Eye View from an Image. CoRR (2019)
Hiba Ovais Latifee, Ammar Abbas, Taha Ahmed Siddiqui, Muhammad Nabeel, Muhammad Khurram
Assistive mobility cane exploiting skin-stroke tactile haptic feedback mechanism for visually impaired persons. ROBIO (2017)
Kashan Aqeel, Urooj Naveed, Faarah Fatima, Farah Haq, M. Arshad, Ammar Abbas, Muhammad Nabeel, Muhammad Khurram
Skin stroking haptic feedback glove for assisting blinds in navigation. ROBIO (2017)
Atiya Azmi, Nadia Ishaque, Ammar Abbas, Safeeullah Soomro
VCAN-Controller Area Network Based Human Vital Sign Data Transmission Protocol. CSEE (1) (2011)
Ammar Abbas, Sven Hoefler, Basel Fardi, Gerd Wanielik
Stereo vision based pedestrian detection using B-spline modeling. ICVES (2008)
Basel Fardi, Ammar Abbas, Gerd Wanielik
Enhanced Disparity Computation for ADAS Applications. GI Jahrestagung (2) (2007)