VisualWordGrid: Information Extraction From Scanned Documents Using A Multimodal Approach.
Mohamed KerroumiOthmane SayemAymen ShabouPublished in: CoRR (2020)
Keyphrases
- scanned documents
- information extraction
- document images
- noise removal
- text mining
- optical character recognition
- natural language processing
- text detection
- information retrieval
- named entities
- machine learning
- text documents
- textual data
- anisotropic diffusion
- web documents
- noise reduction
- computer vision
- scanned images