Login / Signup
Distributed Record Linkage in Healthcare Data with Apache Spark.
Mohammad Heydari
Reza Sarshar
Mohammad Ali Soltanshahi
Published in:
CoRR (2024)
Keyphrases
</>
record linkage
raw data
data sets
data quality
data cleaning
database
distributed data
information technology
end users
data processing
data sources
original data
data analysis
open source
collected data
source code
data integration
missing values
xml documents
information loss
information retrieval