Phone-to-Audio Alignment without Text: A Semi-Supervised Approach.
Jian ZhuCong ZhangDavid JurgensPublished in: ICASSP (2022)
Keyphrases
- semi supervised
- text graphics
- unlabeled data
- semi supervised learning
- labeled data
- information retrieval
- visual information
- text retrieval
- multi view
- mobile phone
- word level
- text mining
- image alignment
- semi supervised classification
- text to speech
- human language
- cross media retrieval
- semantic information
- document analysis
- active learning
- keywords
- multimedia
- database
- audio visual
- information retrieval systems
- pairwise constraints
- speaker identification
- pairwise
- spoken documents