TBGA: a large-scale Gene-Disease Association dataset for Biomedical Relation Extraction.
Stefano MarchesinGianmaria SilvelloPublished in: BMC Bioinform. (2022)
Keyphrases
- relation extraction
- biomedical literature
- automatic extraction
- information extraction
- entity extraction
- complex diseases
- manually annotated
- biological entities
- gene sets
- named entities
- domain specific
- text mining
- semantic role labeling
- semantic features
- broad coverage
- named entity recognition
- tree kernels
- semantic relations
- dependency trees
- information retrieval
- annotated corpus
- question answering
- linkage disequilibrium
- natural language processing
- microarray
- gene expression
- structured data
- higher level
- domain knowledge
- parse tree
- gene ontology
- machine learning