Measuring Peculiarity of Text Using Relation between Words on the Web.
Takeru NakabayashiTakayuki YumotoManabu NiiYutaka TakahashiKazutoshi SumiyaPublished in: ICADL (2010)
Keyphrases
- textual features
- text documents
- web documents
- keywords
- text information
- website
- information retrieval and extraction
- english words
- text recognition
- web pages
- textual data
- chinese text
- related words
- short text
- web applications
- proper nouns
- printed text
- document representation
- text content
- multilingual documents
- semantic web
- text mining
- plain text
- newspaper articles
- text corpus
- dependency relations
- word pairs
- multiword
- noun phrases
- text retrieval
- syntactic categories
- arabic text
- information retrieval
- punctuation marks
- text representation
- text databases
- concept space
- user generated content
- word sense disambiguation
- n gram
- information extraction
- lexical features
- arabic language
- outlier detection
- topic models
- word segmentation
- co occurrence
- search engine