Login / Signup
Turning a CLIP Model Into a Scene Text Spotter.
Wenwen Yu
Yuliang Liu
Xingkui Zhu
Haoyu Cao
Xing Sun
Xiang Bai
Published in:
IEEE Trans. Pattern Anal. Mach. Intell. (2024)
Keyphrases
</>
high level
natural images
visual features