Login / Signup
Rule-embedded network for audio-visual voice activity detection in live musical video streams.
Yuanbo Hou
Yi Deng
Bilei Zhu
Zejun Ma
Dick Botteldooren
Published in:
CoRR (2020)
Keyphrases
</>
video streams
audio visual
video data
multi modal
voice activity detection
visual information
visual data
multimedia
sports video
multi stream
video content
video frames
audio visual speech recognition
hidden markov models
high dimensional
neural network
genre classification