Predicting lexical complexity in English texts: the Complex 2.0 dataset.

Matthew Shardlow Richard Evans Marcos Zampieri

Published in: Lang. Resour. Evaluation (2022)

Keyphrases

linguistic analysis
natural language
real world
context sensitive
brazilian portuguese
lexical information
linguistic information
linguistic features
word sense
natural language text
domain specific
machine translation
benchmark datasets
worst case
training corpus
high level
feature set
natural language processing
english text
keywords
manually constructed
machine learning