A Most Resource-Consuming Disease Estimation Method from Electronic Claim Data Based on Labeled LDA.
Yasutaka HatakeyamaTakahiro OgawaHironori IkedaMiki HaseyamaPublished in: IEICE Trans. Inf. Syst. (2016)
Keyphrases
- synthetic data
- noisy data
- input data
- data sets
- missing data
- statistical methods
- preprocessing
- prior knowledge
- data sources
- data analysis
- data structure
- training data
- probability distribution
- knowledge discovery
- training samples
- missing values
- test data
- support vector machine
- density estimation
- estimation algorithm
- clustering method
- support vector machine svm
- active learning
- training set
- similarity measure
- feature extraction