TeClass: A Human-Annotated Relevance-based Headline Classification and Generation Dataset for Telugu.
Gopichand KanumoluLokesh MadasuNirmal SurangeManish ShrivastavaPublished in: LREC/COLING (2024)
Keyphrases
- benchmark datasets
- pattern recognition
- uci datasets
- classification accuracy
- classification systems
- classification algorithm
- decision rules
- decision trees
- support vector
- supervised learning
- image classification
- svm classifier
- classification method
- preprocessing
- machine learning
- machine learning algorithms
- fold cross validation
- classification scheme
- pattern classification
- manually annotated
- training samples
- test collection
- unsupervised learning
- support vector machine
- feature space
- feature selection
- search engine
- information retrieval