Fouille de collections de documents en vue d'une caractérisation thématique de connaissances textuelles.
Abdenour MokraneGérard DrayPascal PonceletPublished in: EGC (Ateliers) (2005)
Keyphrases
- document collections
- information retrieval
- data collections
- heterogeneous collections
- metadata
- text collections
- digital libraries
- information retrieval systems
- web documents
- document retrieval
- relevant documents
- xml documents
- similar documents
- data sets
- digital collections
- text retrieval
- digital documents
- document classification
- distributed information retrieval
- legal documents
- controlled vocabulary
- effective retrieval
- document archives
- automatic text classification
- digital objects
- document representation
- vector space model
- free text
- document clustering
- document analysis
- multimedia documents
- textual documents
- term weighting schemes
- text documents
- retrieval systems
- collection selection
- semantic information
- information extraction
- search engine