Login / Signup
Lukas Edman
Publication Activity (10 Years)
Years Active: 2019-2024
Publications (10 Years): 17
Top Topics
Machine Translation
Cross Lingual
Source Language
Morphological Segmentation
Top Venues
CoRR
WMT@EMNLP
WMT (2)
SemEval@NAACL
</>
Publications
</>
Lukas Edman
,
Gabriele Sarti
,
Antonio Toral
,
Gertjan van Noord
,
Arianna Bisazza
Are Character-level Translations Worth the Wait? Comparing ByT5 and mT5 for Machine Translation.
Trans. Assoc. Comput. Linguistics
12 (2024)
Lukas Edman
,
Antonio Toral
,
Gertjan van Noord
Are Character-level Translations Worth the Wait? An Extensive Comparison of Character- and Subword-level Models for Machine Translation.
CoRR
(2023)
Konstantin Chernyshev
,
Ekaterina Garanina
,
Duygu Bayram
,
Qiankun Zheng
,
Lukas Edman
LCT-1 at SemEval-2023 Task 10: Pre-training and Multi-task Learning for Sexism Detection and Classification.
SemEval@ACL
(2023)
Konstantin Chernyshev
,
Ekaterina Garanina
,
Duygu Bayram
,
Qiankun Zheng
,
Lukas Edman
LCT-1 at SemEval-2023 Task 10: Pre-training and Multi-task Learning for Sexism Detection and Classification.
CoRR
(2023)
Lukas Edman
,
Lisa Bylinina
Too Much Information: Keeping Training Simple for BabyLMs.
CoRR
(2023)
Lukas Edman
,
Antonio Toral
,
Gertjan van Noord
Subword-Delimited Downsampling for Better Character-Level Translation.
EMNLP (Findings)
(2022)
Lukas Edman
,
Antonio Toral
,
Gertjan van Noord
Patching Leaks in the Charformer for Efficient Character-Level Generation.
CoRR
(2022)
Lukas Edman
,
Antonio Toral
,
Gertjan van Noord
The Importance of Context in Very Low Resource Language Modeling.
CoRR
(2022)
Wessel Poelman
,
Gijs Danoe
,
Esther Ploeger
,
Frank van den Berg
,
Tommaso Caselli
,
Lukas Edman
RUG-1-Pegasussers at SemEval-2022 Task 3: Data Generation Methods to Improve Recognizing Appropriate Taxonomic Word Relations.
SemEval@NAACL
(2022)
Lukas Edman
,
Antonio Toral
,
Gertjan van Noord
Subword-Delimited Downsampling for Better Character-Level Translation.
CoRR
(2022)
Lukas Edman
,
Antonio Toral
,
Gertjan van Noord
The Importance of Context in Very Low Resource Language Modeling.
ICON
(2021)
Lukas Edman
,
Ahmet Üstün
,
Antonio Toral
,
Gertjan van Noord
Unsupervised Translation of German-Lower Sorbian: Exploring Training and Novel Transfer Methods on a Low-Resource Language.
WMT@EMNLP
(2021)
Lukas Edman
,
Ahmet Üstün
,
Antonio Toral
,
Gertjan van Noord
Unsupervised Translation of German-Lower Sorbian: Exploring Training and Novel Transfer Methods on a Low-Resource Language.
CoRR
(2021)
Christian Roest
,
Lukas Edman
,
Gosse Minnema
,
Kevin Kelly
,
Jennifer Spenader
,
Antonio Toral
Machine Translation for English-Inuktitut with Segmentation, Data Acquisition and Pre-Training.
WMT@EMNLP
(2020)
Lukas Edman
,
Antonio Toral
,
Gertjan van Noord
Data Selection for Unsupervised Translation of German-Upper Sorbian.
WMT@EMNLP
(2020)
Lukas Edman
,
Antonio Toral
,
Gertjan van Noord
Low-Resource Unsupervised NMT: Diagnosing the Problem and Providing a Linguistically Motivated Solution.
EAMT
(2020)
Antonio Toral
,
Lukas Edman
,
Galiya Yeshmagambetova
,
Jennifer Spenader
Neural Machine Translation for English-Kazakh with Morphological Segmentation and Synthetic Data.
WMT (2)
(2019)