Optimizing Nepali PDF Extraction: A Comparative Study of Parser and OCR Technologies.
Prabin PaudelSupriya KhadkaRanju G. C.Rahul ShahPublished in: CoRR (2024)
Keyphrases
- probability density function
- natural language
- optical character recognition
- post processing
- emerging technologies
- automatic extraction
- error correction
- text extraction
- natural language processing
- character recognition
- web intelligence
- dependency parsing
- document images
- st century
- semantic analysis
- data mining
- web technologies
- comparative study
- statistical methods
- preprocessing
- natural language parsing