SpeakingFaces: A Large-Scale Multimodal Dataset of Voice Commands with Visual and Thermal Video Streams.
Madina AbdrakhmanovaAskat KuzdeuovSheikh JarjuYerbolat KhassanovMichael LewisHuseyin Atakan VarolPublished in: Sensors (2021)
Keyphrases
- video streams
- video data
- multimodal information
- thermal images
- video content
- compressed video
- infrared
- visual information
- video frames
- multimedia
- eye contact
- visual features
- video material
- video clips
- video analysis
- visual data
- audio visual
- real time
- database
- multimodal interaction
- appearance model
- bit rate
- multi modal
- detecting moving objects
- infrared camera
- semantic video analysis