Document Representation Based on Maximal Frequent Sequence Sets.
Edith Hernández-ReyesJosé Francisco Martínez TrinidadJesús Ariel Carrasco-OchoaRené Arnulfo García-HernándezPublished in: CIARP (2006)
Keyphrases
- document representation
- maximal frequent
- bag of words
- vector space model
- document collections
- frequent sets
- document clustering
- web documents
- language model
- vector space
- data fusion
- text documents
- semantic information
- frequent patterns
- condensed representations
- background knowledge
- text classification
- image classification
- itemset mining
- databases
- semantic similarity
- text data
- frequently occurring
- feature vectors
- maximal frequent itemsets
- information retrieval