Login / Signup
Is my Automatic Audio Captioning System so Bad? SPIDEr-max: A Metric to Consider Several Caption Candidates.
Etienne Labbé
Thomas Pellegrini
Julien Pinquier
Published in:
DCASE (2022)
Keyphrases
</>
multimedia
semi automatic
data sets
fully automatic
distance measure
speech music discrimination
visual features
visual information
web mining
metric space
evaluation metrics
audio visual
similarity metric
audio signals
text extraction
caption text