Jaccard/Tanimoto similarity test and estimation methods for biological presence-absence data.
Neo Christopher ChungBlazej MiasojedowMichal StartekAnna GambinPublished in: BMC Bioinform. (2019)
Keyphrases
- data sets
- statistical methods
- database
- data analysis
- data mining methods
- high dimensional data
- significant improvement
- data mining techniques
- data collection
- test data
- spatial data
- statistical significance
- statistical tests
- data quality
- noisy data
- image data
- data structure
- user defined
- missing values
- network structure
- raw data
- knowledge discovery
- statistical analysis
- spectral clustering
- preprocessing
- training data
- similarity metric