LinkedMDR: un modèle sémantique de représentation de corpus de documents multimédia.
Nathalie CharbelChristian SallaberrySébastien LaborieGilbert TekliRichard ChbeirPublished in: INFORSID (2017)
Keyphrases
- word frequencies
- newspaper articles
- person names
- document level
- information retrieval
- similar documents
- document collections
- text corpus
- text corpora
- text documents
- multiword
- information retrieval systems
- natural language text
- topic segmentation
- text data
- xml documents
- document retrieval
- training corpus
- sentence level
- metadata
- document corpus
- training documents
- web documents
- document classification
- text collections
- keywords
- document clustering
- document representation
- text classification
- parallel corpora
- relevant documents
- parallel corpus
- plain text
- query terms
- document space
- scientific papers
- vector space model
- user queries
- retrieval systems
- language model
- text mining
- linguistic information
- word frequency
- human judgments
- word pairs
- noun phrases
- wikipedia articles