Sign in
The VVAD-LRS3 Dataset for Visual Voice Activity Detection.
Adrian Lubitz
Matias Valdenegro-Toro
Frank Kirchner
Published in:
VISIGRAPP (2: HUCAPP) (2023)
Keyphrases
</>
voice activity detection
visual features
visual search
visual information
noisy environments
database
feature set
speech recognition
search engine
web pages
low level
non stationary
bag of words
gaussian mixture model
synthetic datasets
visual appearance