Exploring the feasibility and accuracy of Latent Semantic Analysis based text mining techniques to detect similarity between patent documents and scientific publications.
Tom MagermanBart Van LooyXiaoyan SongPublished in: Scientometrics (2010)
Keyphrases
- text mining
- scientific publications
- latent semantic analysis
- document clustering
- topic modeling
- patent documents
- information retrieval
- latent dirichlet allocation
- co occurrence
- information extraction
- scientific literature
- similarity measure
- knowledge discovery
- latent semantic indexing
- text documents
- topic models
- data mining
- prior art
- machine learning
- singular value decomposition
- distance measure
- natural language processing
- data analysis
- metadata
- named entities
- web documents
- tf idf
- text summarization
- text classification