Quoc Truong Do

Publication Activity (10 Years)

Years Active: 2014-2023
Publications (10 Years): 22

Top Topics

Speech Recognition

Linear Regression

Translation Model

Top Venues

IEEE ACM Trans. Audio Speech Lang. Process.

Publications

Thai Binh Nguyen, Le Duc Minh Nhat, Quang Minh Nguyen, Quoc Truong Do, Chi Mai Luong, Alexander Waibel
AdapITN: A Fast, Reliable, and Dynamic Adaptive Inverse Text Normalization. ICASSP (2023)
Thu Hien Nguyen, Thai Binh Nguyen, Quoc Truong Do, Tuan-Linh Nguyen
End-to-end named entity recognition for Vietnamese speech. O-COCOSDA 2022 (2022)
Chung Tran Quang, Quang Minh Nguyen, Pham Ngoc Phuong, Quoc Truong Do
Improving Speaker Verification in Noisy Environment Using DNN Classifier. RIVF (2021)
Thi Thu Hien Nguyen, Thai Binh Nguyen, Ngoc Phuong Pham, Quoc Truong Do, Tu Luc Le, Chi Mai Luong
Toward Human-Friendly ASR Systems: Recovering Capitalization and Punctuation for Vietnamese Text. IEICE Trans. Inf. Syst. (8) (2021)
Pham Ngoc Phuong, Chung Tran Quang, Quoc Truong Do, Mai Chi Luong
A Study on Neural-Network-Based Text-to-Speech Adaptation Techniques for Vietnamese. O-COCOSDA (2021)
Thai Binh Nguyen, Quang Minh Nguyen, Hien Nguyen Thi Thu, Quoc Truong Do, Luong Chi Mai
Improving Vietnamese Named Entity Recognition from Speech Using Word Capitalization and Punctuation Recovery Models. INTERSPEECH (2020)
Thai Binh Nguyen, Quang Minh Nguyen, Hien Nguyen Thi Thu, Quoc Truong Do, Luong Chi Mai
Improving Vietnamese Named Entity Recognition from Speech Using Word Capitalization and Punctuation Recovery Models. CoRR (2020)
Binh Nguyen, Vu Bao Hung Nguyen, Hien Nguyen, Pham Ngoc Phuong, The-Loc Nguyen, Quoc Truong Do, Luong Chi Mai
Fast and Accurate Capitalization and Punctuation for Automatic Speech Recognition Using Transformer and Chunk Merging. CoRR (2019)
Binh Nguyen, Vu Bao Hung Nguyen, Hien Nguyen, Pham Ngoc Phuong, The-Loc Nguyen, Quoc Truong Do, Luong Chi Mai
Fast and Accurate Capitalization and Punctuation for Automatic Speech Recognition Using Transformer and Chunk Merging. O-COCOSDA (2019)
Pham Ngoc Phuong, Quoc Truong Do, Luong Chi Mai
A high quality and phonetic balanced speech corpus for Vietnamese. CoRR (2019)
Hien Nguyen Thi Thu, Binh Nguyen Thai, Vu Bao Hung Nguyen, Quoc Truong Do, Luong Chi Mai, Huyen Nguyen Thi Minh
Recovering Capitalization for Automatic Speech Recognition of Vietnamese using Transformer and Chunk Merging. KSE (2019)
Thai Binh Nguyen, Quang Minh Nguyen, Thu Hien Nguyen, Pham Ngoc Phuong, The-Loc Nguyen, Quoc Truong Do
VAIS Hate Speech Detection System: A Deep Learning based Approach for System Combination. CoRR (2019)
Sashi Novitasari, Quoc Truong Do, Sakriani Sakti, Dessi Puji Lestari, Satoshi Nakamura
Construction of English-French Multimodal Affective Conversational Corpus from TV Dramas. LREC (2018)
Quoc Truong Do, Sakriani Sakti, Satoshi Nakamura
Toward Multi-Features Emphasis Speech Translation: Assessment of Human Emphasis Production and Perception with Speech and Text Clues. SLT (2018)
Sahoko Nakayama, Takatomo Kano, Quoc Truong Do, Sakriani Sakti, Satoshi Nakamura
Japanese-English Code-Switching Speech Data Construction. O-COCOSDA (2018)
Sashi Novitasari, Quoc Truong Do, Sakriani Sakti, Dessi Puji Lestari, Satoshi Nakamura
Multi-Modal Multi-Task Deep Learning For Speaker And Emotion Recognition Of TV-Series Data. O-COCOSDA (2018)
Quoc Truong Do, Sakriani Sakti, Satoshi Nakamura
Sequence-to-Sequence Models for Emphasis Speech Translation. IEEE ACM Trans. Audio Speech Lang. Process. 26 (10) (2018)
Quoc Truong Do, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura
Preserving Word-Level Emphasis in Speech-to-Speech Translation. IEEE ACM Trans. Audio Speech Lang. Process. 25 (3) (2017)
Quoc Truong Do, Sakriani Sakti, Satoshi Nakamura
Toward Expressive Speech Translation: A Unified Sequence-to-Sequence LSTMs Approach for Translating Words and Emphasis. INTERSPEECH (2017)
Quoc Truong Do, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura
A Hybrid System for Continuous Word-Level Emphasis Modeling Based on HMM State Clustering and Adaptive Training. INTERSPEECH (2016)
Oliver Adams, Graham Neubig, Trevor Cohn, Steven Bird, Quoc Truong Do, Satoshi Nakamura
Learning a Lexicon and Translation Model from Phoneme Lattices. EMNLP (2016)
Quoc Truong Do, Sakriani Sakti, Graham Neubig, Satoshi Nakamura
Transferring Emphasis in Speech Translation Using Hard-Attentional Neural Network Models. INTERSPEECH (2016)
Michael Heck, Quoc Truong Do, Sakriani Sakti, Graham Neubig, Satoshi Nakamura
The NAIST English speech recognition system for IWSLT 2015. IWSLT (Evaluation Campaign) (2015)
Quoc Truong Do, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura
Improving translation of emphasis with pause prediction in speech-to-speech translation systems. IWSLT (2015)
Quoc Truong Do, Satoshi Nakamura, Marc Delcroix, Takaaki Hori
WFST-based structural classification integrating dnn acoustic features and RNN language features for speech recognition. ICASSP (2015)
Quoc Truong Do, Michael Heck, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura
The NAIST ASR system for the 2015 Multi-Genre Broadcast challenge: On combination of deep learning systems using a rank-score function. ASRU (2015)
Quoc Truong Do, Shinnosuke Takamichi, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura
Preserving word-level emphasis in speech-to-speech translation using linear regression HSMMs. INTERSPEECH (2015)
Quoc Truong Do, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura
Collection and analysis of a Japanese-English emphasized speech corpora. O-COCOSDA (2014)