Automatic extraction of titles from general documents using machine learning.
Yunhua HuHang LiYunbo CaoLi TengDmitriy MeyerzonQinghua ZhengPublished in: Inf. Process. Manag. (2006)
Keyphrases
- automatic extraction
- machine learning
- natural language text
- html documents
- special case
- document collections
- relation extraction
- information retrieval
- pattern recognition
- web documents
- information retrieval systems
- biomedical literature
- document retrieval
- information extraction
- knowledge discovery
- machine learning algorithms
- text retrieval
- document classification
- term extraction
- electronic documents
- artificial intelligence
- feature selection
- document analysis
- metadata
- decision trees
- computer science
- machine learning methods
- xml documents
- knowledge representation
- knowledge acquisition
- database
- domain specific
- natural language processing