Login / Signup
Karol Kaczmarek
Publication Activity (10 Years)
Years Active: 2019-2023
Publications (10 Years): 6
Top Topics
High Quality
Deep Web
Semantic Retrieval
Autoregressive
Top Venues
CoRR
NAACL-HLT (Findings)
ICDAR (3)
FedCSIS
</>
Publications
</>
Michal Turski
,
Tomasz Stanislawek
,
Karol Kaczmarek
,
Pawel Dyda
,
Filip Gralinski
CCpdf: Building a High Quality Corpus for Visually Rich Documents from Web Crawl Data.
CoRR
(2023)
Michal Turski
,
Tomasz Stanislawek
,
Karol Kaczmarek
,
Pawel Dyda
,
Filip Gralinski
CCpdf: Building a High Quality Corpus for Visually Rich Documents from Web Crawl Data.
ICDAR (3)
(2023)
Karol Kaczmarek
,
Jakub Pokrywka
,
Filip Gralinski
Using Transformer models for gender attribution in Polish.
FedCSIS
(2022)
Jakub Pokrywka
,
Filip Gralinski
,
Krzysztof Jassem
,
Karol Kaczmarek
,
Krzysztof Jurkiewicz
,
Piotr Wierzchon
Challenging America: Modeling language in longer time scales.
NAACL-HLT (Findings)
(2022)
Lukasz Borchmann
,
Dawid Wisniewski
,
Andrzej Gretkowski
,
Izabela Kosmala
,
Dawid Jurkiewicz
,
Lukasz Szalkiewicz
,
Gabriela Palka
,
Karol Kaczmarek
,
Agnieszka Kaliska
,
Filip Gralinski
Contract Discovery: Dataset and a Few-shot Semantic Retrieval Challenge with Competitive Baselines.
EMNLP (Findings)
(2020)
Lukasz Borchmann
,
Dawid Wisniewski
,
Andrzej Gretkowski
,
Izabela Kosmala
,
Dawid Jurkiewicz
,
Lukasz Szalkiewicz
,
Gabriela Palka
,
Karol Kaczmarek
,
Agnieszka Kaliska
,
Filip Gralinski
Searching for Legal Clauses by Analogy. Few-shot Semantic Retrieval Shared Task.
CoRR
(2019)