Shikra: Unleashing Multimodal LLM's Referential Dialogue Magic.
Keqin ChenZhao ZhangWeili ZengRichong ZhangFeng ZhuRui ZhaoPublished in: CoRR (2023)
Keyphrases
- mixed initiative
- multi modal
- dialogue management
- dialogue system
- human machine
- affect detection
- spoken dialogue systems
- turn taking
- natural language interfaces
- data sets
- natural language
- spoken language
- man machine
- multimodal interaction
- interactive question answering
- multimodal data
- conversational agents
- conversational agent
- mutual information
- multimedia
- genetic algorithm