Graphical user interface agents optimization for visual instruction grounding using multi-modal artificial intelligence systems.
Tassnim DardouriLaura MinkovaJessica López EspejelWalid DahhaneEl Hassane EttifouriPublished in: CoRR (2024)
Keyphrases
- multi modal
- graphical user interface
- artificial intelligence
- cross modal
- intelligent agents
- intelligent behavior
- video search
- multi agent
- graphical user interfaces
- intelligent systems
- expert systems
- audio visual
- multi agent systems
- multi modality
- user friendly
- image annotation
- high dimensional
- visual information
- machine learning
- visualization tool
- case study
- humanoid robot
- visual features
- knowledge base
- face recognition
- data analysis
- multiple modalities