Word Level Language Identification in Assamese-Bengali-Hindi-English Code-Mixed Social Media Text.
Neelakshi SarmaSanasam Ranbir SinghDiganta GoswamiPublished in: IALP (2018)
Keyphrases
- language identification
- word level
- indian languages
- document images
- english text
- document analysis
- word segmentation
- machine translation
- language independent
- speaker identification
- text lines
- optical character recognition
- character recognition
- n gram
- named entity recognition
- sentence level
- machine learning
- cross lingual
- text classification