Corpora and Data Preparation for Information Extraction.
Lynn CarlsonBoyan A. OnyshkevychMary Ellen OkurowskiPublished in: TIPSTER (1993)
Keyphrases
- data preparation
- information extraction
- natural language processing
- knowledge discovery
- machine learning
- knowledge discovery in databases
- preprocessing
- data quality
- web usage mining
- data mining
- web mining
- data analysis
- text mining
- pattern discovery
- information retrieval
- knowledge discovery and data mining
- feature selection
- structured data
- web documents
- semi structured
- instance selection
- named entities
- feature extraction
- classification accuracy
- data reduction
- wordnet
- natural language
- knowledge driven
- databases
- web pages
- data modeling
- rough sets