Alignment of Lyrics With Accompanied Singing Audio Based on Acoustic-Phonetic Vowel Likelihood Modeling.
Yu-Ren ChienHsin-Min WangShyh-Kang JengPublished in: IEEE ACM Trans. Audio Speech Lang. Process. (2016)
Keyphrases
- acoustic features
- music information retrieval
- audio features
- speech signal
- prosodic features
- audio signal
- speech recognition
- speaker verification
- visual features
- automatic speech recognition
- music retrieval
- audio signals
- audio recordings
- audio visual
- speech synthesis
- audio stream
- music emotion classification
- multimedia
- noisy environments
- information retrieval systems
- word level
- speaker identification
- sound source
- genre classification
- vocal tract
- speaker independent
- maximum likelihood
- multi modal
- speech recognition systems
- environmental sounds
- information retrieval