Skip to main content
Question

How to use subset of records while executing and processing DQ results instead of the whole set of records ?

  • April 7, 2025
  • 1 reply
  • 21 views

We want to evaluate Data Quality Score of Rule after applying the filter condition so that DQ Score gets calculated based on subset of records which are applicable for a rule rather on the whole set.

For example, we want to execute the DQ checks only on the records satisfying the filter condition such as ‘Product_Type’ = ‘DS’

We have taken the approach of SQL Catalog item to filter the records. Is there any other workaround for this problem ?

With the SQL Catalog option, we will have to create multiple SQL Catalog items to implement the different filter condition if needed for each rule. For example, if there are 2 rules, we need to use 2 SQL Catalog items and similarly would need to use 10 SQL Catalog items for 10 such rules.

Therefore, we would like to understand if there is any workaround or alternate to deal with this situation ?

Please suggest.

ivan.kozlov
Ataccamer
Forum|alt.badge.img+3

Hello ​@Apurva Kapoor,

I believe Data Slicing feature might the helpful in this case. It should allow you to run the DQ Evaluation  in MP’s on top of a subset of records instead of the whole dataset.
Here’s a link to document:
https://docs.ataccama.com/one/latest/catalog-items/create-data-slice.html

This functionality should be available starting from version 15.3.0.

I hope this helps,
Ivan


Reply


Cookie policy

We use cookies to enhance and personalize your experience. If you accept you agree to our full cookie policy. Learn more about our cookies.

 
Cookie settings