A hybrid approach to recognize generic sections in scholarly documents.
Shoubin LiQing WangPublished in: Int. J. Document Anal. Recognit. (2021)
Keyphrases
- document collections
- information retrieval
- digital libraries
- information retrieval systems
- document classification
- web documents
- document retrieval
- relevant documents
- legal documents
- database
- electronic documents
- document structure
- free text
- document clustering
- domain specific
- metadata
- xml documents
- high level
- text documents
- digital documents
- xml format
- latent semantic analysis
- text analysis
- keywords
- web data
- semantic information
- retrieval systems
- automatic recognition
- vector space model
- retrieved documents
- multi document summarization
- neural network