Login / Signup
Leveraging Text Representation and Face-head Tracking for Long-form Multimodal Semantic Relation Understanding.
Raksha Ramesh
Vishal Anand
Zifan Chen
Yifei Dong
Yun Chen
Ching-Yung Lin
Published in:
ACM Multimedia (2022)
Keyphrases
</>
head tracking
semantic relations
text representation
head motion
particle filter
face detector
bag of words
head movements
multiscale
document representation
human faces
wordnet
multi modal
text classification
eye movements
machine learning
text retrieval
co occurrence
text mining