A dockerized framework for hierarchical frequency-based document clustering on cloud computing infrastructures.
Maria Th. KotouzaFotis E. PsomopoulosPericles A. MitkasPublished in: J. Cloud Comput. (2020)
Keyphrases
- cloud computing
- document clustering
- data management
- data center
- computing resources
- computing paradigm
- cloud platform
- map reduce
- clustering method
- document collections
- cloud computing environment
- database systems
- database
- text classification
- text mining
- digital libraries
- similarity measure
- machine learning
- computing infrastructure
- databases