BengaliLCP: A Dataset for Lexical Complexity Prediction in the Bengali Texts.
Nabila AymanMd. Akram HossainAbdul AzizRokan Uddin FaruquiAbu Nowshed ChyPublished in: LREC/COLING (2024)
Keyphrases
- prediction accuracy
- keywords
- linguistic information
- natural language text
- benchmark datasets
- prediction model
- feature set
- wordnet
- linguistic analysis
- space complexity
- prediction error
- probabilistic model
- text categorization
- domain specific
- synthetic datasets
- prediction algorithm
- statistical machine translation
- feature selection
- database