Login / Signup
Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech.
Quan Wang
Yang Yu
Jason Pelecanos
Yiling Huang
Ignacio Lopez-Moreno
Published in:
CoRR (2022)
Keyphrases
</>
language identification
speaker identification
english text
multi lingual
speaker verification
speech recognition
speech signal
feature extraction
data streams
broadcast news
pattern recognition
document images
automatic speech recognition
hidden markov models
temporal information
visual attention