Language Recognition for Telephone and Video Speech: The JHU HLTCOE Submission for NIST LRE17.
Alan McCreeDavid SnyderGregory SellDaniel Garcia-RomeroPublished in: Odyssey (2018)
Keyphrases
- recognition engine
- human activities
- text to speech
- language acquisition
- text to speech synthesis
- recognition rate
- recognition accuracy
- speech recognition
- video sequences
- video data
- programming language
- speech corpus
- activity recognition
- object recognition
- speaker diarization
- english text
- multimedia
- visual speech
- broadcast news
- spoken language
- content based video retrieval
- spoken words
- speech synthesis
- automatic speech recognition systems
- pattern recognition
- video streams
- language learning
- video frames
- natural language
- audio stream
- language generation
- speech recognition systems
- action recognition
- video database
- video retrieval
- speech signal
- noisy environments
- speech sounds
- video content
- video clips
- digital audio
- audio visual
- national institute of standards and technology
- human actions
- gesture recognition
- hidden markov models
- automatic transcription
- language processing