Text data augmentation and pre-trained Language Model for enhancing text classification of low-resource languages.
Atabay A. A. ZiyadenAmir YelenovFuad HajiyevSamir RustamovAlexandr A. PakPublished in: PeerJ Comput. Sci. (2024)
Keyphrases
- text data
- text classification
- language model
- pre trained
- n gram
- language modeling
- cross lingual
- statistical machine translation
- text mining
- text documents
- information retrieval
- text categorization
- probabilistic model
- bag of words
- translation model
- training data
- feature selection
- retrieval model
- speech recognition
- text classifiers
- machine learning
- labeled data
- query expansion
- training examples
- knn
- document collections
- query terms
- unlabeled data
- high dimensional
- multimedia
- structured data
- k nearest neighbor
- supervised learning
- natural language
- data sets