Multimodal Unsupervised Domain Adaptation for Predicting Speaker Characteristics from Video.
Chinchu ThomasPrateksha UdhayananAyush YadavSeethamraju PurvajDinesh Babu JayagopiPublished in: SN Comput. Sci. (2024)
Keyphrases
- audio visual
- video streams
- multimedia
- video data
- video sequences
- video database
- video frames
- video clips
- video content
- multimodal information
- video segmentation
- spatial and temporal
- online video
- video retrieval
- video surveillance
- real time
- speech recognition
- multi modal
- action recognition
- speaker verification
- real time video
- spatio temporal
- data sets