Automatic extraction of titles from general documents using machine learning.
Yunhua HuHang LiYunbo CaoDmitriy MeyerzonQinghua ZhengPublished in: JCDL (2005)
Keyphrases
- automatic extraction
- machine learning
- natural language text
- relation extraction
- html documents
- information retrieval
- special case
- biomedical literature
- pattern recognition
- text mining
- database
- document retrieval
- wrapper generation
- data analysis
- text documents
- document collections
- knowledge acquisition
- supervised learning
- text classification
- natural language processing
- xml documents
- machine learning methods
- vector space
- web data
- reinforcement learning
- metadata
- computer vision
- learning algorithm
- information extraction