Login / Signup

HENASY: Learning to Assemble Scene-Entities for Egocentric Video-Language Model.

Khoa VoThinh PhanKashu YamazakiMinh TranNgan Le
Published in: CoRR (2024)
Keyphrases
  • language model
  • video sequences
  • n gram
  • language modeling
  • information retrieval
  • active learning
  • generative model
  • unsupervised learning
  • context sensitive