Non-disjoint grouping of text documents based Word Sequence Kernel.
Chiheb-Eddine Ben N'cirAfef ZennedNadia EssoussiPublished in: EGC (2013)
Keyphrases
- text documents
- text corpus
- keywords
- text mining
- text classification
- information extraction
- topic models
- wordnet
- text categorization
- latent topics
- term frequency
- document classification
- document clustering
- news articles
- named entities
- text collections
- co occurrence
- bag of words
- word sense disambiguation
- n gram
- tf idf
- kernel function
- support vector
- extraction patterns
- machine learning
- natural language text
- search engine
- action recognition