Semantic Based Text Classification of Patent Documents to a User-Defined Taxonomy.
Ashish SurekaPranav Prabhakar MirajkarPrasanna Nagesh TeliGirish AgarwalSumit Kumar BosePublished in: ADMA (2009)
Keyphrases
- user defined
- text classification
- patent documents
- prior art
- intellectual property
- patent information
- patent search
- feature selection
- bag of words
- n gram
- text categorization
- text mining
- data types
- machine learning
- text documents
- text data
- knn
- databases
- query language
- automatically extracted
- structured documents
- information retrieval
- data mining
- semantic analysis
- semantic features
- term frequency
- feature extraction
- data structure
- database
- data points