A digital corpus resource of authentic anonymized French text messages: 88milSMS - What about transcoding and linguistic annotation?
Rachel PanckhurstPublished in: Digit. Scholarsh. Humanit. (2017)
Keyphrases
- text messages
- hand crafted
- linguistic features
- annotated corpus
- natural language text
- linguistic information
- natural language
- linguistic patterns
- learning environment
- transform domain
- semantic annotation
- bitstream
- reference resolution
- natural language processing
- active learning
- automatic annotation
- image annotation
- resource allocation
- metadata
- manually annotated
- digital photos
- inter annotator agreement
- information loss
- video streams
- named entities
- information extraction
- privacy preservation
- multiword
- compressed video
- privacy protection
- sensitive information
- video data
- wordnet
- anonymized data