Zombie cheminformatics: extraction and conversion of Wiswesser Line Notation (WLN) from chemical documents.
Michael BlakeySamantha KanzaJeremy G. FreyPublished in: J. Cheminformatics (2024)
Keyphrases
- document collections
- information retrieval
- web documents
- information extraction
- automatic extraction
- relevant documents
- text documents
- line extraction
- legal documents
- metadata
- xml documents
- information retrieval systems
- document classification
- document clustering
- document retrieval
- digital documents
- keywords
- database
- knowledge extraction
- semantic relationships
- document set
- time stamped
- feature selection
- retrieval systems
- ranked list
- free text
- user queries
- vector space model
- text mining
- automatically extracted
- document structure
- machine learning