Do Important Words in Bag-of-Words Model of Text Relatedness Help?
Aminul IslamEvangelos E. MiliosVlado KeseljPublished in: TSD (2015)
Keyphrases
- text documents
- keywords
- english words
- text corpus
- related words
- text recognition
- text clustering
- linguistic information
- text corpora
- proper nouns
- world knowledge
- text databases
- word pairs
- text mining
- short text
- chinese text
- syntactic analysis
- word co occurrence
- information retrieval
- syntactic categories
- multiword
- chinese texts
- co occurrence
- linguistic analysis
- text representation
- textual features
- noun phrases
- natural language text
- lexical features
- n gram
- lexical information
- lexical chains
- arabic text
- stop words
- semantic information
- computational linguistics
- arabic language
- semantic similarity
- word level
- text classification
- unknown words
- printed text
- word frequency
- document content
- word sense disambiguation
- link analysis
- semantic relationships
- semantically related
- word segmentation