Information Extraction from Arabic and Latin scanned invoices.
Najoua RahalMaroua TounsiMohamed Ben JlaielAdel M. AlimiPublished in: ASAR (2018)
Keyphrases
- information extraction
- optical character recognition
- document images
- language identification
- handwritten documents
- arabic documents
- natural language processing
- text mining
- precision and recall
- scanned documents
- free text
- named entities
- web documents
- text processing
- named entity recognition
- relation extraction
- arabic language
- semi structured
- word forms
- machine learning
- information extraction systems
- text documents
- open domain
- natural language
- machine translation
- relational learning
- structured data
- morphological analysis
- information retrieval
- extracting meaningful
- semantic tagging
- ontology based information extraction
- natural language text
- web mining
- data extraction
- text summarization
- textual data
- character recognition
- question answering
- line extraction
- conditional random fields
- feature extraction