Unsupervised learning of mDTD extraction patterns for Web text mining.
Dongseok KimHanmin JungGary Geunbae LeePublished in: Inf. Process. Manag. (2003)
Keyphrases
- text mining
- unsupervised learning
- extraction patterns
- text documents
- information extraction
- web mining
- textual data
- information extraction systems
- manually annotated
- relation extraction
- web documents
- text classification
- natural language processing
- knowledge discovery
- supervised learning
- named entities
- web pages
- text corpora
- semi structured
- document classification
- topic models
- document clustering
- machine learning
- data mining
- feature selection
- heuristic rules
- information retrieval
- named entity recognition
- semi supervised
- artificial intelligence