Classification of Japanese Documents and Ranking of Representative Documents by Using the Characteristic of the Frequencies of Words.
Jun KimuraYasunari YoshitomiMasayoshi TabusePublished in: J. Robotics Netw. Artif. Life (2015)
Keyphrases
- document classification
- text documents
- pre classified
- document collections
- information retrieval
- ranked list
- keywords
- relevant documents
- textual features
- expert finding
- word spotting
- information retrieval systems
- document representation
- metadata
- document ranking
- xml documents
- web documents
- document content
- classification algorithm
- retrieved documents
- machine learning
- document retrieval
- multiword
- training documents
- automatic text classification
- expert search
- relevance ranking
- document level
- user queries
- classification accuracy
- text corpus
- latent topics
- document analysis
- text classifiers
- vector space model
- retrieval systems
- topic models
- search engine