Mixed Graph of Terms: Beyond the Bags of Words Representation of a Text.
Massimo De SantoPaolo NapoletanoAntonio PietrosantoConsolatina LiguoriVincenzo PacielloFrancesco PolesePublished in: HICSS (2012)
Keyphrases
- text representation
- related words
- text documents
- keywords
- graph representation
- multiword
- linguistic information
- n gram
- text recognition
- index terms
- training corpus
- english words
- bag of words
- co occurrence
- chinese text
- vector space model
- text corpus
- textual features
- semantically related
- related documents
- document level
- word pairs
- plain text
- graphical representation
- lexical chains
- text collections
- text retrieval
- character n grams
- proper names
- text mining
- information retrieval
- printed text
- chinese texts
- short text
- textual descriptions
- text corpora
- noun phrases
- weighted graph
- word sense disambiguation
- visual words
- structured data
- image representation
- text classification
- feature vectors