Login / Signup
TikTalk: A Video-Based Dialogue Dataset for Multi-Modal Chitchat in Real World.
Hongpeng Lin
Ludan Ruan
Wenke Xia
Peiyu Liu
Jingyuan Wen
Yixin Xu
Di Hu
Ruihua Song
Wayne Xin Zhao
Qin Jin
Zhiwu Lu
Published in:
ACM Multimedia (2023)
Keyphrases
</>
multi modal
real world
synthetic datasets
multi modality
high dimensional
audio visual
dialogue system
cross modal
feature set
semantic concepts
video search
image annotation
computer vision
feature extraction
image analysis