The WDC Training Dataset and Gold Standard for Large-Scale Product Matching.
Anna PrimpeliRalph PeetersChristian BizerPublished in: WWW (Companion Volume) (2019)
Keyphrases
- gold standard
- training dataset
- ground truth
- semi automatic
- training data
- data samples
- training set
- manual segmentation
- training samples
- pattern matching
- life cycle
- matching algorithm
- image matching
- mechanical turk
- feature points
- class labels
- support vectors
- matching process
- medical images
- decision trees
- learning algorithm
- imbalanced datasets
- data sets