Effect of OCR-errors on the transformation of semi-structured text data into relational database.
Kolyo Z. OnkovPublished in: AND (2009)
Keyphrases
- semi structured
- text data
- structured data
- text mining
- relational databases
- xml databases
- unstructured data
- xml documents
- keyword search
- information extraction
- text classification
- data extraction
- semi structured data
- free text
- text documents
- unstructured text
- data sources
- databases
- data model
- structured knowledge
- metadata
- web data sources
- textual data
- document collections
- database
- data mining
- natural language processing
- high dimensional
- xml data
- web documents
- machine learning
- data sets
- named entities
- knowledge discovery
- data analysis
- information retrieval