Leveraging Language Identification to Enhance Code-Mixed Text Classification.
Gauri TakawaneAbhishek PhaltankarVarad PatwardhanAryan PatilRaviraj JoshiMukta S. TakalikarPublished in: CoRR (2023)
Keyphrases
- language identification
- text classification
- bag of words
- text mining
- text categorization
- speaker identification
- document images
- text data
- indian languages
- n gram
- text documents
- feature selection
- machine learning
- multi label
- knn
- semantic features
- unsupervised learning
- gaussian mixture model
- cross lingual
- image processing