The Notary in the Haystack - Countering Class Imbalance in Document Processing with CNNs.
Martin LeipertGeorg VogelerMathias SeuretAndreas K. MaierVincent ChristleinPublished in: DAS (2020)
Keyphrases
- document processing
- class imbalance
- digital libraries
- class distribution
- active learning
- document images
- cost sensitive
- information retrieval
- high dimensionality
- document analysis
- concept drift
- document clustering
- minority class
- multimedia documents
- textual documents
- text processing
- multimedia
- content based retrieval
- neural network
- machine learning
- k nearest neighbor
- natural language
- database systems