Self-supervised Deep Reconstruction of Mixed Strip-shredded Text Documents.
Thiago M. PaixãoRodrigo Ferreira BerrielMaria C. S. BoeresAlessandro L. KoerichClaudine BadueAlberto Ferreira de SouzaThiago Oliveira-SantosPublished in: CoRR (2020)
Keyphrases
- feature selection
- text documents
- text categorization
- text classification
- text mining
- text data
- document classification
- news articles
- textual information
- bag of words
- document clustering
- tf idf
- keywords
- automatic text categorization
- text collections
- topic models
- wordnet
- information extraction
- text corpus
- named entities
- neural network
- pairwise
- multiscale
- knowledge base
- search engine
- information retrieval
- data mining