Choosing a Data Storage Format in the Apache Hadoop System Based on Experimental Evaluation Using Apache Spark.
Vladimir BelovAndrey TatarintsevEvgeny NikulchevPublished in: Symmetry (2021)
Keyphrases
- data storage
- open source
- experimental evaluation
- data management
- map reduce
- open source software
- database management systems
- web server
- relational database systems
- cloud computing
- storage media
- source code
- aggregated data
- mailing lists
- data intensive
- data integrity
- metadata
- efficient implementation
- data access
- sensitive data
- distributed systems
- data warehouse
- association rules
- column oriented
- database systems