TraveLER: A Multi-LMM Agent Framework for Video Question-Answering.
Chuyi ShangAmos YouSanjay SubramanianTrevor DarrellRoei HerzigPublished in: CoRR (2024)
Keyphrases
- question answering
- natural language processing
- open domain question answering
- qa clef
- natural language questions
- natural language
- question classification
- named entities
- passage retrieval
- cross language
- probabilistic model
- information extraction
- information retrieval
- syntactic information
- relation extraction
- video sequences
- multimedia
- video content
- video frames
- multi modal
- textual entailment recognition
- data mining