Layout-aware text extraction from full-text PDF of scientific articles.
Cartic RamakrishnanAbhishek PatniaEduard H. HovyGully A. P. C. BurnsPublished in: Source Code Biol. Medicine (2012)
Keyphrases
- text extraction
- scientific articles
- scientific literature
- probability density function
- topic modeling
- digital libraries
- text processing
- natural scenes
- information retrieval systems
- complex background
- document classification
- text recognition
- text segmentation
- text information
- retrieval systems
- information retrieval
- optical character recognition
- artificial intelligence
- action recognition
- prior knowledge
- natural language
- bayesian networks