-
Notifications
You must be signed in to change notification settings - Fork 80
Open
Description
I've used cvsdedupe to try and match up a list of ~77,000 unmapped entries to a master list of ~141,000 known things. It worked and has given a list of ~30,000 matches.
I've since done a bunch of manual work to not only check the ML mapping from csvdedupe, but also from some other sources, so I now have a definitive list of matches that I'd like to feed back to cvsdedupe before rerunning it to try and refine and improve my results. I can't figure out how to do that.
The format of the training.json
file seems pretty straight-forward, but I can't tell what marks something as a positive or negative match ... or even if that is what that file is about. Can anyone help me?
Metadata
Metadata
Assignees
Labels
No labels