Machine-learning classifiers for logographic name matching in public health applications: approaches for incorporating phonetic, visual, and keystroke similarity in large-scale probabilistic record linkage.
Philip A. CollenderZhiyue Tom HuCharles LiQu ChengXintong LiYue YouSong LiangChanghong YangJustin V. RemaisPublished in: CoRR (2020)
Keyphrases
- public health
- record linkage
- machine learning
- machine learning algorithms
- machine learning approaches
- machine learning methods
- approximate matching
- entity resolution
- similarity measure
- duplicate detection
- privacy preserving
- data sets
- data mining
- data cleaning
- policy makers
- decision trees
- artificial intelligence
- data integration
- expert systems