Login / Signup

WordScape: a Pipeline to extract multilingual, visually rich Documents with Layout Annotations from Web Crawl Data.

Maurice WeberCarlo SiebenschuhRory ButlerAnton AlexandrovValdemar ThannerGeorgios TsolakisHaris JabbarIan T. FosterBo LiRick StevensCe Zhang
Published in: CoRR (2023)
Keyphrases