LIRMM@DEFT-2018 - Modèle de classification de la vectorisation des documents (LIRMM DEFT-2018 - Document Vectorization Classification model ).
Waleed Mohamed AzmyBilel MoulahiSandra BringayMaximilien ServajeanPublished in: CORIA-TALN-RJC (DeFT) (2018)
Keyphrases
- document classification
- document collections
- text documents
- automatic categorization
- web documents
- text classification
- document clustering
- information retrieval
- document retrieval
- information retrieval systems
- classify documents
- classification algorithm
- text categorization
- relevant documents
- structured documents
- document categorization
- document analysis
- text clustering
- text classifiers
- document similarity
- document representation
- vector space model
- automatic classification
- document processing
- semi structured documents
- training documents
- digital documents
- retrieved documents
- automatic document classification
- retrieval systems
- feature selection
- electronic documents
- keywords
- multimedia documents
- term frequency
- inverted index
- document content
- automatic text classification
- document repository
- text mining
- document images
- text collections
- textual content
- document structure
- scanned documents
- document summarization
- latent topics
- document space
- document type
- wordnet
- document ranking
- textual documents
- document level