Login / Signup
m&m's: A Benchmark to Evaluate Tool-Use for multi-step multi-modal Tasks.
Zixian Ma
Weikai Huang
Jieyu Zhang
Tanmay Gupta
Ranjay Krishna
Published in:
CoRR (2024)
Keyphrases
</>
multi modal
multi step
single step
high dimensional
audio visual
cross modal
multi modality
semantic concepts
video search
neural network
support vector
knn
semi supervised
k nearest neighbor