Word Level Language Identification in Code-Mixed Data using Word Embedding Methods for Indian Languages.
Inumella ChaitanyaIndeevar MadapakulaSubham Kumar GuptaS. TharaPublished in: ICACCI (2018)
Keyphrases
- word level
- language identification
- document images
- indian languages
- english text
- document analysis
- word segmentation
- document image analysis
- language independent
- n gram
- word recognition
- character recognition
- machine translation
- optical character recognition
- neural network
- sentence level
- speaker identification
- text lines
- similarity search
- natural language processing
- image processing