Scaling Out and Evaluation of OBSecAn, an Automated Section Annotator for Semi-Structured Clinical Documents, on a Large VA Clinical Corpus.
Le-Thuy T. TranGuy DivitaAndrew ReddMarjorie E. CarterMatthew H. SamoreAdi V. GundlapalliPublished in: AMIA (2015)
Keyphrases
- semi structured
- web documents
- semi structured documents
- free text
- patient records
- structured data
- data collections
- web data
- information extraction
- data model
- semi structured data
- content and structure
- xml documents
- information integration
- data extraction
- text mining
- knowledge rich
- structured knowledge
- wrapper generation
- document collections
- information retrieval
- search interface
- information retrieval systems
- unstructured data
- expert search
- html pages
- artificial intelligence
- machine learning
- text data
- semantic information
- keywords
- metadata
- web data extraction
- database