Uncovering hidden duplicated content in public transcriptomics data.
Marta RosikiewiczAurélie ComteAnne NiknejadMarc Robinson-RechaviFrederic B. BastianPublished in: Database J. Biol. Databases Curation (2013)
Keyphrases
- data sets
- synthetic data
- data collection
- data analysis
- complex data
- multimedia
- data records
- database
- data repositories
- data mining applications
- sensor data
- statistical analysis
- knowledge discovery
- databases
- high quality
- image data
- neural network
- raw data
- multimedia data
- data distribution
- small number
- data points
- training data
- xml documents
- data acquisition
- data sources
- statistical methods
- data processing
- data objects
- noisy data
- database systems
- information systems