Investigating Random Undersampling and Feature Selection on Bioinformatics Big Data.
Tawfiq HasaninTaghi M. KhoshgoftaarJoffrey L. LeevyNaeem SeliyaPublished in: BigDataService (2019)
Keyphrases
- big data
- feature selection
- data analysis
- machine learning
- cloud computing
- knowledge discovery
- unstructured data
- data management
- data mining
- data intensive
- big data analytics
- data processing
- business intelligence
- text mining
- social media
- high volume
- data science
- class imbalance
- massive data
- model selection
- text categorization
- huge data
- text classification
- vast amounts of data
- data warehousing
- massive datasets
- query processing
- databases
- relational databases
- high dimensionality
- decision support