AlanaVLM: A Multimodal Embodied AI Foundation Model for Egocentric Video Understanding.

Published in: CoRR (2024)

Keyphrases