NLU Methodologies for Capturing Non-redundant Information from Multi-documents - A Survey.
Michael T. MillsNikolaos G. BourbakisPublished in: ICSOFT (2) (2010)
Keyphrases
- document collections
- information retrieval
- information retrieval systems
- data mining
- text documents
- web documents
- digital documents
- document classification
- document clustering
- relevant documents
- web data
- xml documents
- artificial intelligence
- latent semantic analysis
- neural network
- free text
- textual content
- expert finding
- time stamped
- latent semantic indexing
- text collections
- text retrieval
- ranked list
- vector space
- document retrieval
- database
- vector space model
- text analysis
- semantic information
- structured documents
- multimedia documents
- document analysis
- text categorization
- natural language
- machine learning
- document structure
- electronic documents
- data sets