Inter-document reference detection as an alternative to full text semantic analysis in document clustering.
Patrick A. De MazièreMarc M. Van HullePublished in: MLSP (2013)
Keyphrases
- document clustering
- semantic analysis
- document collections
- document clusters
- document representation
- semantic information
- text documents
- tf idf
- text mining
- natural language
- natural language processing
- information retrieval systems
- clustering method
- topic extraction
- document similarity
- clustering algorithm
- tolerance rough set
- vector space model
- automatic categorization
- similar documents
- k means
- syntactic analysis
- digital libraries
- retrieval systems
- topic models
- text classification
- co occurrence
- low level