Login / Signup

On tuning parameters guiding similarity computations in a data deduplication pipeline for customers records: Experience from a R&D project.

Witold AndrzejewskiBartosz BebelPawel BoinskiRobert Wrembel
Published in: Inf. Syst. (2024)
Keyphrases
  • data sets
  • training data
  • record linkage
  • probability distribution
  • neural network
  • genetic algorithm
  • similarity measure
  • artificial neural networks
  • control system
  • np hard
  • data points
  • labeled data
  • regression model