Managing Variant Calling Files the Big Data Way: Using HDFS and Apache Parquet.
Aikaterini BoufeaRichard FinkersMartijn van KaauwenMark KramerIoannis N. AthanasiadisPublished in: BDCAT (2017)
Keyphrases
- big data
- open source
- cloud computing
- data management
- data analysis
- social media
- data visualization
- big data analytics
- file management
- data intensive
- knowledge discovery
- data processing
- business intelligence
- open source software
- high volume
- massive data
- vast amounts of data
- web server
- unstructured data
- databases
- file system
- database
- open source projects
- data warehousing
- health informatics
- data science
- data mining
- text mining
- query processing
- database systems
- decision making