Curating Stopwords in Marathi: A TF-IDF Approach for Improved Text Analysis and Information Retrieval.
Rohan ChavanGaurav PatilVishal MadleRaviraj JoshiPublished in: CoRR (2024)
Keyphrases
- tf idf
- text analysis
- text documents
- information retrieval
- stop words
- text mining
- information extraction
- term weighting
- document clustering
- vector space model
- text collections
- term frequency
- text categorization
- retrieval model
- text classification
- natural language processing
- keywords
- wordnet
- topic models
- named entities
- bag of words
- ranking algorithm
- information retrieval systems
- knowledge discovery
- question answering
- web documents
- retrieval systems
- user queries
- language model
- preprocessing step
- knn
- feature selection
- search engine
- artificial intelligence