Speech balloon and speaker association for comics and manga understanding.
Christophe RigaudNam Le ThanhJean-Christophe BurieJean-Marc OgierMotoi IwataEiki ImazuKoichi KisePublished in: ICDAR (2015)
Keyphrases
- speech recognition
- audio visual
- speaker recognition
- automatic speech recognition
- speaker identification
- speaker verification
- prosodic features
- speaker dependent
- automatic speech recognition systems
- speech synthesis
- speech signal
- broadcast news
- speaker diarization
- vocal tract
- speaker adaptation
- synthesized speech
- multi modal
- noisy environments
- language model
- text to speech
- audio stream
- speech sounds
- automatic transcription
- vector quantization