Multimodal audio-visual robot fusing 3D CNN and CRNN for player behavior recognition and prediction in basketball matches.
Haiyan WangPublished in: Frontiers Neurorobotics (2024)
Keyphrases
- audio visual
- behavior recognition
- multi modal
- emotion recognition
- sports video
- sound source
- visual information
- multi stream
- mobile robot
- multimodal fusion
- visual data
- multimedia
- vision system
- humanoid robot
- audio features
- selective attention
- audio visual speech recognition
- neural network
- biologically inspired
- visual features
- image data
- image retrieval