Attention Fusion for Audio-Visual Person Verification Using Multi-Scale Features.
Stefan HörmannAbdul MoizMartin KnocheGerhard RigollPublished in: FG (2020)
Keyphrases
- audio visual
- person authentication
- multimodal fusion
- multiscale
- multi modal
- multimodal biometrics
- audio features
- low level
- feature vectors
- image features
- computer vision
- visual information
- domain knowledge
- multimedia
- image representation
- co occurrence
- multi stream
- feature extraction
- data sets
- audio visual speech recognition