Programming Language Identification in Stack Overflow Post Snippets with Regex Based Tf-Idf Vectorization over ANN.
Aman SwarajSandeep KumarPublished in: ENASE (2023)
Keyphrases
- tf idf
- language identification
- stack overflow
- plain text
- information retrieval
- document images
- text documents
- document clustering
- text categorization
- retrieval model
- vector space model
- ranking algorithm
- neural network
- news articles
- helping users
- search engine
- web search engines
- gaussian mixture model
- feature space
- metadata
- feature selection
- natural language