The matching component is slow at the extended unifications step for a incremental load. When a record like āWALGREEN%ā is loaded and there are 20K records with similar name on the instance, the records are all considered through the process of matching. Hence the process is slow when processing the increment and even if the incremental load has 11 records to process 30K records have the transaction Id and omni_modified date updated with the work order related on the instance table. When I check history, nothing changes on these records except the omni_modified_date and transaction_id to that of the work order. Tweaking the union keys to create combination of name and key factors or adding separate groups for the high count records is not helping. Any thoughts or suggestions?
Solved
DQC match - slow incremental load
Best answer by AKislyakov
Hi,
Usually, the solution is indeed to introduce additional components to union keys to minimize number of records fetched from the repository on expansion phase.
- Can you share execution logs, preferably with Increased unification step logging level? (To increase unification step logging level set:
unify.verbose=4
parameter.) - Also, you can check cardinality of resulting keys, by querying <REPO>_keys table, to ensure that introduction of new key factors reduced the cardinality.
- Finally, what version are you using? and how many operations are there in the Extended Unification step?āā
Reply
Login to the Ataccama Community
No account yet? Create an account
For Ataccama Customers and Partners
or
Enter your E-mail address. We'll send you an e-mail with instructions to reset your password.