Adaptable N-gram classification model for data leakage prevention.
Sultan AlneyadiElankayer SithirasenanVallipuram MuthukkumarasamyPublished in: ICSPCS (2013)
Keyphrases
- n gram
- data leakage prevention
- language model
- language independent
- insider threat
- text classification
- language modeling
- viterbi algorithm
- variable length
- inside outside algorithm
- language modelling
- part of speech
- word segmentation
- web documents
- personal information
- privacy violations
- databases
- intellectual property
- text categorization
- natural language processing