A Rule-based Natural Language Processing System in Tagging and Categorizing Phenotype Variables in NCBI's database of Genotypes and Phenotypes (dbGaP).
Son DoanKo-Wei LinRebecca WalkerSeena FarzanehNeda AlipanahHyeoneui KimPublished in: AMIA (2013)
Keyphrases
- database
- natural language processing
- database systems
- databases
- gene expression
- information extraction
- metadata
- machine learning
- complex diseases
- data management
- genome wide
- question answering
- text mining
- data model
- relational databases
- data driven
- database management systems
- biologically meaningful
- computational linguistics
- caenorhabditis elegans
- single nucleotide polymorphisms
- tag recommendation
- text summarization
- part of speech
- causal models
- protein sequences
- rule base
- high throughput
- random variables
- database applications
- text categorization
- expert systems
- natural language