Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring.
Yufei ZhanYousong ZhuHongyin ZhaoFan YangMing TangJinqiao WangPublished in: CoRR (2024)
Keyphrases
- high resolution
- low resolution
- super resolution
- high frequency
- multi modal
- high resolution images
- image processing
- field of view
- sonar images
- remote sensing
- high quality
- spatial resolution
- image generation
- visual perception
- low resolution depth
- multimodal interaction
- audio visual
- multi stream
- human perception
- multi party
- machine intelligence
- cross modal
- laser scanner
- super resolution reconstruction
- magnetic resonance images
- multimodal data
- color vision
- high resolution color
- face images