BioInfer: a corpus for information extraction in the biomedical domain.
Sampo PyysaloFilip GinterJuho HeimonenJari BjörneJorma BobergJouni JärvinenTapio SalakoskiPublished in: BMC Bioinform. (2007)
Keyphrases
- information extraction
- genia corpus
- text mining
- open domain
- named entities
- named entity recognition
- information extraction systems
- specific domains
- entity extraction
- relation extraction
- domain specific
- natural language text
- precision and recall
- semi structured
- manually annotated
- natural language processing
- machine learning
- relational learning
- free text
- ontology based information extraction
- information retrieval
- question answering
- linguistic patterns
- domain ontology
- domain independent
- biomedical literature
- web documents
- text summarization
- textual data
- machine translation
- conditional random fields
- word sense disambiguation
- structured data
- biomedical images
- cross domain
- probabilistic model
- data analysis
- natural language
- web mining