E-Talk: Accelerating Active Speaker Detection with Audio-Visual Fusion and Edge-Cloud Computing.
Xiaojing YuLan ZhangXiang-Yang LiPublished in: SECON (2023)
Keyphrases
- audio visual
- cloud computing
- person authentication
- multi modal
- multimodal fusion
- computing resources
- cloud computing environment
- data center
- speaker verification
- data management
- visual information
- service providers
- multimedia
- multi stream
- visual data
- cloud computing platform
- cloud services
- cloud storage
- data model
- feature selection