​
Login / Signup
Quoc Truong Do
ORCID
Publication Activity (10 Years)
Years Active: 2014-2023
Publications (10 Years): 22
Top Topics
Speech Recognition
Linear Regression
Deep Learning
Translation Model
Top Venues
INTERSPEECH
O-COCOSDA
CoRR
IEEE ACM Trans. Audio Speech Lang. Process.
</>
Publications
</>
Thai Binh Nguyen
,
Le Duc Minh Nhat
,
Quang Minh Nguyen
,
Quoc Truong Do
,
Chi Mai Luong
,
Alexander Waibel
AdapITN: A Fast, Reliable, and Dynamic Adaptive Inverse Text Normalization.
ICASSP
(2023)
Thu Hien Nguyen
,
Thai Binh Nguyen
,
Quoc Truong Do
,
Tuan-Linh Nguyen
End-to-end named entity recognition for Vietnamese speech.
O-COCOSDA 2022
(2022)
Chung Tran Quang
,
Quang Minh Nguyen
,
Pham Ngoc Phuong
,
Quoc Truong Do
Improving Speaker Verification in Noisy Environment Using DNN Classifier.
RIVF
(2021)
Thi Thu Hien Nguyen
,
Thai Binh Nguyen
,
Ngoc Phuong Pham
,
Quoc Truong Do
,
Tu Luc Le
,
Chi Mai Luong
Toward Human-Friendly ASR Systems: Recovering Capitalization and Punctuation for Vietnamese Text.
IEICE Trans. Inf. Syst.
(8) (2021)
Pham Ngoc Phuong
,
Chung Tran Quang
,
Quoc Truong Do
,
Mai Chi Luong
A Study on Neural-Network-Based Text-to-Speech Adaptation Techniques for Vietnamese.
O-COCOSDA
(2021)
Thai Binh Nguyen
,
Quang Minh Nguyen
,
Hien Nguyen Thi Thu
,
Quoc Truong Do
,
Luong Chi Mai
Improving Vietnamese Named Entity Recognition from Speech Using Word Capitalization and Punctuation Recovery Models.
INTERSPEECH
(2020)
Thai Binh Nguyen
,
Quang Minh Nguyen
,
Hien Nguyen Thi Thu
,
Quoc Truong Do
,
Luong Chi Mai
Improving Vietnamese Named Entity Recognition from Speech Using Word Capitalization and Punctuation Recovery Models.
CoRR
(2020)
Binh Nguyen
,
Vu Bao Hung Nguyen
,
Hien Nguyen
,
Pham Ngoc Phuong
,
The-Loc Nguyen
,
Quoc Truong Do
,
Luong Chi Mai
Fast and Accurate Capitalization and Punctuation for Automatic Speech Recognition Using Transformer and Chunk Merging.
CoRR
(2019)
Binh Nguyen
,
Vu Bao Hung Nguyen
,
Hien Nguyen
,
Pham Ngoc Phuong
,
The-Loc Nguyen
,
Quoc Truong Do
,
Luong Chi Mai
Fast and Accurate Capitalization and Punctuation for Automatic Speech Recognition Using Transformer and Chunk Merging.
O-COCOSDA
(2019)
Pham Ngoc Phuong
,
Quoc Truong Do
,
Luong Chi Mai
A high quality and phonetic balanced speech corpus for Vietnamese.
CoRR
(2019)
Hien Nguyen Thi Thu
,
Binh Nguyen Thai
,
Vu Bao Hung Nguyen
,
Quoc Truong Do
,
Luong Chi Mai
,
Huyen Nguyen Thi Minh
Recovering Capitalization for Automatic Speech Recognition of Vietnamese using Transformer and Chunk Merging.
KSE
(2019)
Thai Binh Nguyen
,
Quang Minh Nguyen
,
Thu Hien Nguyen
,
Pham Ngoc Phuong
,
The-Loc Nguyen
,
Quoc Truong Do
VAIS Hate Speech Detection System: A Deep Learning based Approach for System Combination.
CoRR
(2019)
Sashi Novitasari
,
Quoc Truong Do
,
Sakriani Sakti
,
Dessi Puji Lestari
,
Satoshi Nakamura
Construction of English-French Multimodal Affective Conversational Corpus from TV Dramas.
LREC
(2018)
Quoc Truong Do
,
Sakriani Sakti
,
Satoshi Nakamura
Toward Multi-Features Emphasis Speech Translation: Assessment of Human Emphasis Production and Perception with Speech and Text Clues.
SLT
(2018)
Sahoko Nakayama
,
Takatomo Kano
,
Quoc Truong Do
,
Sakriani Sakti
,
Satoshi Nakamura
Japanese-English Code-Switching Speech Data Construction.
O-COCOSDA
(2018)
Sashi Novitasari
,
Quoc Truong Do
,
Sakriani Sakti
,
Dessi Puji Lestari
,
Satoshi Nakamura
Multi-Modal Multi-Task Deep Learning For Speaker And Emotion Recognition Of TV-Series Data.
O-COCOSDA
(2018)
Quoc Truong Do
,
Sakriani Sakti
,
Satoshi Nakamura
Sequence-to-Sequence Models for Emphasis Speech Translation.
IEEE ACM Trans. Audio Speech Lang. Process.
26 (10) (2018)
Quoc Truong Do
,
Tomoki Toda
,
Graham Neubig
,
Sakriani Sakti
,
Satoshi Nakamura
Preserving Word-Level Emphasis in Speech-to-Speech Translation.
IEEE ACM Trans. Audio Speech Lang. Process.
25 (3) (2017)
Quoc Truong Do
,
Sakriani Sakti
,
Satoshi Nakamura
Toward Expressive Speech Translation: A Unified Sequence-to-Sequence LSTMs Approach for Translating Words and Emphasis.
INTERSPEECH
(2017)
Quoc Truong Do
,
Tomoki Toda
,
Graham Neubig
,
Sakriani Sakti
,
Satoshi Nakamura
A Hybrid System for Continuous Word-Level Emphasis Modeling Based on HMM State Clustering and Adaptive Training.
INTERSPEECH
(2016)
Oliver Adams
,
Graham Neubig
,
Trevor Cohn
,
Steven Bird
,
Quoc Truong Do
,
Satoshi Nakamura
Learning a Lexicon and Translation Model from Phoneme Lattices.
EMNLP
(2016)
Quoc Truong Do
,
Sakriani Sakti
,
Graham Neubig
,
Satoshi Nakamura
Transferring Emphasis in Speech Translation Using Hard-Attentional Neural Network Models.
INTERSPEECH
(2016)
Michael Heck
,
Quoc Truong Do
,
Sakriani Sakti
,
Graham Neubig
,
Satoshi Nakamura
The NAIST English speech recognition system for IWSLT 2015.
IWSLT (Evaluation Campaign)
(2015)
Quoc Truong Do
,
Sakriani Sakti
,
Graham Neubig
,
Tomoki Toda
,
Satoshi Nakamura
Improving translation of emphasis with pause prediction in speech-to-speech translation systems.
IWSLT
(2015)
Quoc Truong Do
,
Satoshi Nakamura
,
Marc Delcroix
,
Takaaki Hori
WFST-based structural classification integrating dnn acoustic features and RNN language features for speech recognition.
ICASSP
(2015)
Quoc Truong Do
,
Michael Heck
,
Sakriani Sakti
,
Graham Neubig
,
Tomoki Toda
,
Satoshi Nakamura
The NAIST ASR system for the 2015 Multi-Genre Broadcast challenge: On combination of deep learning systems using a rank-score function.
ASRU
(2015)
Quoc Truong Do
,
Shinnosuke Takamichi
,
Sakriani Sakti
,
Graham Neubig
,
Tomoki Toda
,
Satoshi Nakamura
Preserving word-level emphasis in speech-to-speech translation using linear regression HSMMs.
INTERSPEECH
(2015)
Quoc Truong Do
,
Graham Neubig
,
Sakriani Sakti
,
Tomoki Toda
,
Satoshi Nakamura
Collection and analysis of a Japanese-English emphasized speech corpora.
O-COCOSDA
(2014)