Recognising document components in XML-based academic articles.
Angelo Di IorioSilvio PeroniFrancesco PoggiFabio VitaliDavid M. ShottonPublished in: ACM Symposium on Document Engineering (2013)
Keyphrases
- google scholar
- scientific documents
- citation analysis
- keywords
- pdf documents
- document classification
- text documents
- document images
- retrieval systems
- web documents
- information retrieval systems
- information retrieval
- extensible markup language
- xml data
- wikipedia articles
- document collections
- software components
- news articles
- document clustering
- vector space model
- markup language
- scientific articles
- building blocks
- activity recognition