A Scalable Dataset Indexing Infrastructure for the bioCADDIE Data Discovery System.
Jeffrey S. GretheIbrahim Burak ÖzyurtHua XuXiaoling ChenRuiling LiuErgin SoysalAnupama E. GururajHyeon-Eui KimTrevor CohenTodd R. JohnsonMandana SalimiSaeid PournejatiMin JiangClaudiu FarcasAlejandra N. González-BeltránPhilippe Rocca-SerraMuhamamd F. AmithCui TaoIan ForeRonald MargolisGeorge AlterSusanna-Assunta SansoneLucila Ohno-MachadoPublished in: AMIA (2016)
Keyphrases
- data collection
- database
- data sets
- data processing
- original data
- training data
- missing data
- data points
- data sources
- data analysis
- knowledge discovery
- data mining techniques
- high quality
- test data
- statistical analysis
- data distribution
- sensor data
- attribute values
- network infrastructure
- synthetic data
- multi dimensional
- small number
- data streams
- decision trees
- information retrieval
- neural network