Login / Signup
Potrika: Raw and Balanced Newspaper Datasets in the Bangla Language with Eight Topics and Five Attributes.
Istiak Ahmad
Fahad A. Alqurashi
Rashid Mehmood
Published in:
CoRR (2022)
Keyphrases
</>
raw data
test set
information retrieval
indian languages
benchmark datasets
programming language
natural language
topic models
language learning
keywords
attribute values
related topics
categorical attributes
data sets
text documents
high level
latent dirichlet allocation
text data
key concepts
character segmentation