AnnoTheia: A Semi-Automatic Annotation Toolkit for Audio-Visual Speech Technologies.
José-M. Acosta-TrianaDavid Gimeno-GómezCarlos D. Martínez-HinarejosPublished in: LREC/COLING (2024)
Keyphrases
- automatic annotation
- visual speech
- visual information
- hidden markov models
- semantic annotation
- content based retrieval
- image annotation
- speaker identification
- audio signals
- broadcast news
- video signals
- noisy environments
- low level features
- speech signal
- multimedia
- acoustic features
- machine learning
- audio visual
- visual data
- audio signal
- speech recognition
- object detection
- high dimensional
- low level
- image data
- text to speech
- eye movements
- information retrieval systems