Phone-to-audio alignment without text: A Semi-supervised Approach.
Jian ZhuCong ZhangDavid JurgensPublished in: CoRR (2021)
Keyphrases
- semi supervised
- text graphics
- text retrieval
- information retrieval
- semi supervised learning
- signal processing
- cross media retrieval
- multimedia
- keywords
- text to speech
- text mining
- text data
- unlabeled data
- pairwise
- human language
- image alignment
- labeled data
- text documents
- visual information
- audio visual
- multi view
- sequence alignment
- pairwise constraints
- word level
- database
- learning algorithm
- visual features
- audio features
- natural language processing
- training data
- spoken documents
- active learning