An Effective Bi-LSTM Word Embedding System for Analysis and Identification of Language in Code-Mixed social Media Text in English and Roman Hindi.
Shashi ShekharDilip Kumar SharmaM. M. Sufyan BegPublished in: Computación y Sistemas (2020)
Keyphrases
- indian languages
- english text
- social media
- source language
- target language
- machine translation
- english language
- language identification
- character n grams
- language specific
- document images
- word level
- natural language
- lexical information
- cross lingual
- language learning
- n gram
- document analysis
- native language
- machine translation system
- source code
- text to speech
- parallel corpus
- data analysis
- word order
- contextual features
- social networks
- syntactic categories
- statistical machine translation
- word segmentation
- word sense disambiguation
- text retrieval
- co occurrence