Login / Signup
AlanaVLM: A Multimodal Embodied AI Foundation Model for Egocentric Video Understanding.
Alessandro Suglia
Claudio Greco
Katie Baker
Jose L. Part
Ioannis Papaioannou
Arash Eshghi
Ioannis Konstas
Oliver Lemon
Published in:
CoRR (2024)
Keyphrases
</>
computational model
probabilistic model
formal model
video analysis
mathematical model
theoretical framework
multi modal
intelligent systems
artificial intelligence
video sequences
objective function
high level
statistical model
bayesian framework
theoretical foundation
video clips
machine learning
neural network