Keyphrases
- document analysis
- rewriting systems
- recognition rate
- rewriting rules
- graph structure
- document images
- object recognition
- graph theory
- database
- graph representation
- web documents
- recognition algorithm
- recognition accuracy
- information retrieval
- information retrieval systems
- feature extraction
- structured data
- preprocessing stage
- graph databases
- structured documents
- social networks
- document collections
- document classification
- html documents
- pattern recognition
- graph mining
- regular path queries
- rewrite rules
- text lines
- retrieval systems
- document representation
- random walk
- character recognition
- document clustering
- directed graph
- document retrieval