Privacy- and Utility-Preserving NLP with Anonymized Data: A case study of Pseudonymization.
Oleksandr YermilovVipul RahejaArtem ChernodubPublished in: CoRR (2023)
Keyphrases
- anonymized data
- data anonymization
- differential privacy
- information loss
- data publishing
- natural language processing
- privacy protection
- privacy guarantees
- privacy preserving
- privacy preserving data mining
- information extraction
- privacy preservation
- data mining
- private data
- natural language
- categorical data
- personal information
- association rules
- sensitive attributes
- data sharing
- data quality
- data access
- language model
- machine learning
- database