Login / Signup

DialCLIP: Empowering Clip As Multi-Modal Dialog Retriever.

Zhichao YinBinyuan HuiMin YangFei HuangYongbin Li
Published in: ICASSP (2024)
Keyphrases
  • multi modal
  • multi modality
  • video clips
  • audio visual
  • high dimensional
  • image annotation
  • video search
  • cross modal
  • smart room
  • machine learning
  • feature selection
  • humanoid robot
  • semantic concepts