Un nouveau schéma de pondération pour la catégorisation de documents manuscrits.
Sebastián Peña SaldarriagaEmmanuel MorinChristian Viard-GaudinPublished in: TALN (Articles courts) (2009)
Keyphrases
- information retrieval
- document classification
- xml documents
- document collections
- relevant documents
- document analysis
- web documents
- text documents
- keywords
- information retrieval systems
- metadata
- document clustering
- legal documents
- query expansion
- query terms
- time stamped
- retrieved documents
- latent semantic analysis
- multi document summarization
- word spotting
- plagiarism detection
- electronic documents
- data sets
- semantic relationships
- document representation
- vector space model
- ranked list
- web data
- retrieval systems
- information extraction
- relational databases
- neural network