Fake Document Generation for Cyber Deception by Manipulating Text Comprehensibility.
Prakruthi KarunaHemant PurohitSushil JajodiaRajesh GanesanÖzlem UzunerPublished in: IEEE Syst. J. (2021)
Keyphrases
- text documents
- digital documents
- web documents
- document analysis
- information retrieval
- keywords
- document processing
- text generation
- document content
- text content
- text clustering
- text mining
- document images
- textual documents
- textual content
- technical papers
- multimedia documents
- latent semantic analysis
- extractive summarization
- scientific documents
- document structure
- text corpus
- retrieval engine
- printed documents
- generation process
- text collections
- scientific papers
- database
- structured documents
- information extraction
- document categorization
- keyword extraction
- semantic information
- document clustering
- document representation
- text summarization
- text classifiers
- handwritten text
- pdf files
- topic models
- page layout analysis
- text retrieval
- text representation
- electronic documents
- text data
- web pages
- document set
- document collections
- document level
- authorship attribution
- content and structure
- scanned documents
- free text
- document classification
- temporal expressions
- text categorization
- vector space model
- document corpus
- language model
- textual data
- cross references
- retrieval systems