L3Cube-MahaNews: News-based Short Text and Long Document Classification Datasets in Marathi.
Saloni MittalVidula MagdumOmkar DhekaneSharayu HiwarkhedkarRaviraj JoshiPublished in: CoRR (2024)
Keyphrases
- document classification
- short text
- short texts
- topic detection
- text classification
- short text classification
- text categorization
- text mining
- web documents
- text documents
- classification algorithm
- text data
- feature selection
- keywords
- n gram
- news articles
- data sets
- k nearest neighbor
- naive bayes
- sentiment classification
- latent topics
- document collections
- information extraction
- machine learning