A Natural Language Processing Pipeline to Extract Phenotypic Data from Formal Taxonomic Descriptions with a Focus on Flagellate Plants.
Lorena EndaraJ. Gordon BurleighLaurel CooperPankaj JaiswalMarie-Angélique LaporteHong CuiPublished in: ICBO (2018)
Keyphrases
- data sets
- synthetic data
- processing pipeline
- database
- data analysis
- data collection
- data structure
- image data
- statistical analysis
- natural language
- association rules
- data sources
- databases
- high quality
- data quality
- raw data
- data distribution
- spatial data
- high level
- high dimensional data
- labeled data
- data processing
- data mining techniques
- knowledge discovery
- end users
- prior knowledge