I have been trying to explore the Reprocess and Rematch options in MDM.
From this post I understand that Reprocess is used to reprocess the records. For example, lets say, after an initial load where the email values were not being modified in contact_clean.comp, we are now to determine which emails have the domain company-name.com and are now being updated to have domain companyname.com . So, I need to run the reprocess job.
Rematch is to run the match step on top of the reprocess since the email values have been modified. What I do not understand is do I run a rematch job only if the email value participates in determining the master record or do I need to run it whenever the reprocess jobs is run irrespective of the changes that took place due to a reprocess?
There is another question I have. When we run a load job, the change detection is done right. Does this change detection run on all the records or just the delta records. Post the change detection, matching and merging is run only on the records that changed or all the records?
I am looking for an answer because I scheduled a workflow to load records and export them, assuming that each time any new records are added or old records get modified, the master records will be updated(when there is a change in one of the related instance records) due to change detection. I did not change any code.
But a new record got added and matched to an already existing master record. However, the match_rule that was mentioned in the master record did not justify the matching. I ran a reprocess and rematch job and the new record became a different master record.
I am looking for an explanation on this and also a suggestion on if I need to run the rematch job everytime I am loading records and need to export them for downstream sources?
Usage of Rematch and Reprocess jobs
Reply
Login to the Ataccama Community
No account yet? Create an account
For Ataccama Customers and Partners
or
Enter your E-mail address. We'll send you an e-mail with instructions to reset your password.