PEANUT: A Human-AI Collaborative Tool for Annotating Audio-Visual Data.
Zheng ZhangZheng NingChenliang XuYapeng TianToby Jia-Jun LiPublished in: CoRR (2023)
Keyphrases
- visual data
- visual information
- audio visual
- multimedia data
- high dimensional
- contextual information
- video data
- multimodal information
- image data
- visual features
- image sequences
- image content
- video sequences
- human motion
- high dimensional data
- metadata
- information extraction
- data sets
- low level
- human actions
- visual content
- machine learning