Sign in

CLIPPO: Image-and-Language Understanding from Pixels Only.

Michael TschannenBasil MustafaNeil Houlsby
Published in: CVPR (2023)
Keyphrases
  • input image
  • image pixels
  • language understanding
  • pixel values
  • neighboring pixels
  • image regions
  • low level
  • image segmentation
  • language processing
  • expert systems
  • user interface
  • knowledge based systems