Login / Signup

OLViT: Multi-Modal State Tracking via Attention-Based Embeddings for Video-Grounded Dialog.

Adnen AbdessaiedManuel von HochmeisterAndreas Bulling
Published in: CoRR (2024)
Keyphrases