Clustering and Retrieval of Spanish News Documents Using Self Organizing Maps.
Javier FernándezRicardo MonesIrene DíazJosé RanillaElías F. CombarroPublished in: CLEF (Working Notes) (2003)
Keyphrases
- document clustering
- information retrieval
- information retrieval systems
- document retrieval
- retrieval systems
- topic detection
- content similarity
- document indexing
- structured documents
- document collections
- clustering algorithm
- k means
- document clusters
- automatic categorization
- document analysis
- document level
- keywords
- document content
- cross media
- index terms
- clustering method
- person names
- retrieval engine
- search interface
- web documents
- news stories
- image retrieval
- multimedia documents
- document structure
- retrieval process
- retrieval strategies
- expert finding
- heterogeneous collections
- news articles
- question answering
- multimedia
- relevant documents
- text documents
- text retrieval
- news items
- monolingual retrieval
- xml documents
- related documents
- query terms
- retrieved documents
- news video
- retrieval model
- text collections
- relevance feedback
- handwritten documents
- document representation
- test collection
- query specific
- natural language processing
- language model
- query expansion
- text categorization
- term frequency
- vector space model
- distributed information retrieval
- semantic content
- online news
- cross language