I am working on a requirement where I have to check that any values which were present in previous profiling/monitoring project run inside column_A of table_1 should not be absent in current/latest run of column_A table_1 if those are present I have to highlight those, requesting community to help me with this.
Hi
Thank you for your question.
As Ataccama does not store the actual data permanently, the solution will have to be configures outside of Ataccama, as well.
- But first, please confirm my understanding - if the value inside column_A of table_1 of the last run is again present inside column_A of table_1 of the next run, should you flag it? Or is it the other way around, if it is NOT present again, you should flag it?
- Will you need to compare only the 2 closest runs or will you need the whole story of runs?
Looking forward to your answers!
Kind regards,
Ekaterina
Thanks for the reply, here are answers to your questions:
1. It is the other way around, if any data from previous run is not present in current run, then it needs to be flagged in entity containing previous data.
2. Yes, i need to compare only the 2 closest runs.
Hi
One of the possibilities could look like:
- You could create a lookup from column_A of table_1 after the DQ evaluation is run on top of it. Depending on how you plan to run Monitoring Projects, you can rebuild lookups manually or schedule it.
- Create a rule that will compare the data in the next run against value in the lookup.
Please note that this is only one of the possible options, which one you choose heavenly depends on the whole solution design and the motivation behind this requirement.
Please let me know if you have any further questions.
Kind regards,
Ekaterina
Does this make sense?
Kind regards,
Ekaterina
Reply
Login to the Ataccama Community
No account yet? Create an account
For Ataccama Customers and Partners
Enter your E-mail address. We'll send you an e-mail with instructions to reset your password.