Parallel computation of information gain using Hadoop and MapReduce.
Eftim ZdravevskiPetre LameskiAndrea KulakovSonja FiliposkaDimitar TrajanovBoro JakimovskiPublished in: FedCSIS (2015)
Keyphrases
- parallel computation
- information gain
- map reduce
- parallel algorithm
- text categorization
- parallel processing
- decision trees
- mutual information
- feature selection
- parallel computing
- chi squared
- parallel programming
- parallel implementation
- chi square
- shared memory
- correlation coefficient
- cloud computing
- mapreduce framework
- occurrence frequency
- machine learning
- integrated circuit
- text classification
- distributed systems
- text mining
- active learning