Mining Web Sites Using Wrapper Induction, Named Entities, and Post-processing.
Georgios SigletosGeorgios PaliourasConstantine D. SpyropoulosMichael HatzopoulosPublished in: EWMF (2003)
Keyphrases
- post processing
- wrapper induction
- named entities
- information extraction
- text mining
- web news
- website
- named entity recognition
- semi structured
- web mining
- preprocessing
- active learning
- natural language processing
- relation extraction
- text documents
- data mining process
- web documents
- question answering
- wrapper generation
- web information extraction
- information retrieval
- co occurrence
- structured data
- machine learning
- data mining
- multi view
- knowledge discovery
- sequential patterns
- human experts
- mining algorithm
- domain knowledge
- data analysis
- learning algorithm
- semi supervised learning
- natural language
- feature extraction
- data sets