Login / Signup
Valdemar Thanner
Publication Activity (10 Years)
Years Active: 2023-2023
Publications (10 Years): 2
Top Topics
Semantic Annotation
Meta Information
Web Data
Data Points
Top Venues
CoRR
NeurIPS
</>
Publications
</>
Maurice Weber
,
Carlo Siebenschuh
,
Rory Butler
,
Anton Alexandrov
,
Valdemar Thanner
,
Georgios Tsolakis
,
Haris Jabbar
,
Ian T. Foster
,
Bo Li
,
Rick Stevens
,
Ce Zhang
WordScape: a Pipeline to extract multilingual, visually rich Documents with Layout Annotations from Web Crawl Data.
NeurIPS
(2023)
Maurice Weber
,
Carlo Siebenschuh
,
Rory Butler
,
Anton Alexandrov
,
Valdemar Thanner
,
Georgios Tsolakis
,
Haris Jabbar
,
Ian T. Foster
,
Bo Li
,
Rick Stevens
,
Ce Zhang
WordScape: a Pipeline to extract multilingual, visually rich Documents with Layout Annotations from Web Crawl Data.
CoRR
(2023)