Text data compression ratio as a text attribute for a language-independent text art extraction method.
Tetsuya SuzukiKazuyuki HayashiPublished in: ICDIM (2010)
Keyphrases
- text data
- language independent
- text classification
- text mining
- text documents
- text retrieval
- compression ratio
- high dimensional
- image quality
- information retrieval
- structured data
- bag of words
- data sets
- high dimensional data
- n gram
- compression algorithm
- labeled data
- document collections
- databases
- image compression
- association rules
- data analysis
- keywords
- high quality