Taec: a Manually annotated text dataset for trait and phenotype extraction and entity linking in wheat breeding literature.
Claire NédellecClara SauvionRobert BossyMariya BorovikovaLouise DelégerPublished in: CoRR (2024)
Keyphrases
- manually annotated
- extraction patterns
- relation extraction
- ground truth
- entity linking
- automatic extraction
- information extraction
- text mining
- unstructured text
- domain knowledge
- scientific literature
- text fragments
- biomedical literature
- named entities
- knowledge base
- semantic relations
- topic modeling
- text documents
- question answering
- knowledge discovery
- visual information
- data mining
- semantic search
- wordnet
- feature set
- natural language
- keywords
- information retrieval