Login / Signup

Turning a CLIP Model Into a Scene Text Spotter.

Wenwen YuYuliang LiuXingkui ZhuHaoyu CaoXing SunXiang Bai
Published in: IEEE Trans. Pattern Anal. Mach. Intell. (2024)
Keyphrases
  • high level
  • natural images
  • visual features