Keyphrases
- information extraction
- free text
- text mining
- document processing
- textual data
- text documents
- unstructured text
- text recognition
- text processing
- open domain
- information retrieval
- web documents
- information extraction systems
- natural language text
- text analysis
- printed documents
- natural language processing
- fuzzy sets
- fuzzy logic
- optical character recognition
- text extraction
- named entities
- structured data
- post processing
- character recognition
- fuzzy numbers
- named entity recognition
- text summarization
- document analysis
- precision and recall
- scanned documents
- preprocessing
- membership functions
- linguistic patterns
- ocr systems
- page layout
- machine learning
- document images
- web mining
- relation extraction
- recognition errors
- question answering
- entity extraction
- conditional random fields
- fuzzy rules
- semi structured
- text retrieval
- fuzzy clustering
- text data
- fuzzy controller
- textual information
- document clustering
- extraction rules
- semantic information
- error correction
- digital libraries
- data mining
- printed text