Hot topic detection and technology trend tracking for patents utilizing term frequency and proportional document frequency and semantic information.
Khanh-Ly NguyenByung-Joo ShinSeong Joon YooPublished in: BigComp (2016)
Keyphrases
- semantic information
- term frequency
- document frequency
- document representation
- text categorization
- tf idf
- vector space model
- wordnet
- text documents
- retrieval model
- keywords
- bag of words
- text classification
- information gain
- feature selection
- average precision
- term weighting
- domain knowledge
- background knowledge
- semantic similarity
- low level
- high level
- information retrieval
- retrieved documents
- contextual information
- n gram
- databases
- metadata
- document clustering
- image content
- artificial intelligence
- test collection
- document collections
- query expansion
- probabilistic model
- xml documents