BanLemma: A Word Formation Dependent Rule and Dictionary Based Bangla Lemmatizer.
Sadia AfrinMd. Shahad Mahmud ChowdhuryMd. Ekramul IslamFaisal Ahamed KhanLabib Imam ChowdhuryMd. Motahar MahtabNazifa Nuha ChowdhuryMassud ForkanNeelima KunduHakim ArifMohammad Mamun Or RashidMohammad Ruhul AminNabeel MohammedPublished in: CoRR (2023)
Keyphrases
- word segmentation
- statistical machine translation
- handwritten documents
- string matching
- cross language information retrieval
- indian languages
- association rules
- test set
- co occurrence
- n gram
- keywords
- scene images
- character segmentation
- translation model
- word recognition
- rule sets
- query translation
- word pairs
- word sense disambiguation
- printed documents
- machine translation
- handwritten numerals
- information retrieval systems
- face recognition
- outdoor images