Predicting lexical complexity in English texts: the Complex 2.0 dataset.
Matthew ShardlowRichard EvansMarcos ZampieriPublished in: Lang. Resour. Evaluation (2022)
Keyphrases
- linguistic analysis
- natural language
- real world
- context sensitive
- brazilian portuguese
- lexical information
- linguistic information
- linguistic features
- word sense
- natural language text
- domain specific
- machine translation
- benchmark datasets
- worst case
- training corpus
- high level
- feature set
- natural language processing
- english text
- keywords
- manually constructed
- machine learning