A Multimodality Framework for Creating Speaker/Non-Speaker Profile Databases for Real-World Video.
Jehanzeb AbbasCharlie K. DagliThomas S. HuangPublished in: CVPR (2007)
Keyphrases
- databases
- real world
- main contribution
- speech recognition
- real time
- speaker verification
- wide range
- audio visual
- video classification
- automatic speech recognition
- video content
- theoretical framework
- knowledge discovery
- video sequences
- metadata
- medical images
- synthetic data
- probabilistic model
- data sources
- video streams
- low level
- case study
- database