Maximizing Information Gain in Privacy-Aware Active Learning of Email Anomalies.
Mu-Huan ChungSharon LiJaturong KongmaneeLu WangYuhong YangCalvin GiangKhilan JerathAbhay RamanDavid LieMark H. ChignellPublished in: CoRR (2024)
Keyphrases
- information gain
- privacy aware
- active learning
- personal information
- text categorization
- decision trees
- anomaly detection
- feature selection
- chi squared
- mutual information
- privacy preservation
- training set
- unlabeled data
- naive bayes
- learning algorithm
- labeled data
- machine learning
- semi supervised learning
- supervised learning
- semi supervised
- databases
- data sets
- privacy policies
- location based services
- feature space
- text mining
- feature subset
- correlation coefficient
- frequent itemsets
- machine learning algorithms
- data analysis
- relevance feedback
- role based access control
- query engine
- information retrieval