An automatic caption alignment mechanism for off-the-shelf speech recognition technologies.
Maria FedericoMarco FuriniPublished in: Multim. Tools Appl. (2014)
Keyphrases
- speech recognition
- language model
- hidden markov models
- speech synthesis
- automatic speech recognition
- speech processing
- speech signal
- pattern recognition
- speech recognition systems
- keyword spotting
- speech recognizer
- speech recognition technology
- speech understanding
- data mining
- noisy environments
- visual features
- speech recognizers
- speech retrieval
- speaker recognition
- computer vision
- neural network
- digital video library
- cepstral coefficients
- speaker adaptation
- multimedia systems
- video retrieval
- signal processing
- image processing