A Toolchain for Comprehensive Audio/Video Analysis Using Deep Learning Based Multimodal Approach (A use case of riot or violent context detection).
Lam PhamPhat LamTin NguyenHieu TangAlexander SchindlerPublished in: CoRR (2024)
Keyphrases
- video analysis
- deep learning
- soccer video
- video content analysis
- event detection
- video scene
- shot boundary detection
- video data
- video indexing
- video annotation
- video indexing and retrieval
- unsupervised learning
- multimedia
- multi modal
- audio visual
- sports video
- weakly supervised
- object detection
- visual information
- signal processing
- object recognition
- pattern recognition
- multiple modalities
- machine learning
- transfer learning
- text mining
- image processing