Combining Acoustic Embeddings and Decoding Features for End-of-Utterance Detection in Real-Time Far-Field Speech Recognition Systems.
Roland MaasAriya RastrowChengyuan MaGuitang LanKyle GoehnerGautam TiwariShaun JosephBjörn HoffmeisterPublished in: ICASSP (2018)