TeClass: A Human-Annotated Relevance-based Headline Classification and Generation Dataset for Telugu.
Gopichand KanumoluLokesh MadasuNirmal SurangeManish ShrivastavaPublished in: CoRR (2024)
Keyphrases
- benchmark datasets
- classification systems
- decision trees
- classification method
- training dataset
- machine learning
- uci datasets
- classification accuracy
- feature extraction
- training set
- classification process
- pattern recognition
- text classification
- decision rules
- information retrieval
- automatic classification
- pattern classification
- feature space
- support vector
- feature vectors
- support vector machine svm
- machine learning methods
- classification rules
- image classification
- human subjects
- supervised learning
- search engine
- relevance feedback
- database