Login / Signup

Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding.

Kenton LeeMandar JoshiIulia TurcHexiang HuFangyu LiuJulian EisenschlosUrvashi KhandelwalPeter ShawMing-Wei ChangKristina Toutanova
Published in: CoRR (2022)
Keyphrases