Efficient Classification of Long Documents Using Transformers.
Hyunji Hayley ParkYogarshi VyasKashif ShahPublished in: CoRR (2022)
Keyphrases
- document classification
- decision trees
- text documents
- pattern recognition
- support vector machine
- automatic classification
- feature extraction
- document collections
- information retrieval
- feature selection
- keywords
- pattern classification
- classification method
- classification accuracy
- machine learning
- benchmark datasets
- image classification
- text classification
- document retrieval
- pre classified
- document categorization
- database
- support vector
- training set
- feature vectors
- classification algorithm
- class labels
- information retrieval systems
- relevant documents
- preprocessing
- classification models
- machine learning algorithms
- classification scheme
- multi document summarization
- text categorization
- automatic categorization
- classify documents
- training samples