Did you know our DQ Analyser for Windows (32 or 64bit) can help you to gain valuable insights into your data, and the application is free to use.
Visit https://www.ataccama.com/download/dq-analyzer
Some of the features include;
- Duplicate count
- Pattern & mask analysis
- Business rule analysis
- Numeric statistics
- Ability to drill down into the data on a per record basis
- Foreign key analysis
- Dependency analysis
- Primary key analysis
Concerned about duplicate values in your datasets?
Frequency Analysis can tell us that we have a value ‘SYSTEM MIGRATION’ in the customer name column 4551 times. It is clear that these customer names were not transferred into the current system correct - supposedly during previous migration activity.
One of my personal favorites for the identification of duplicate records is Group Frequency analysis.
Here we can see the same value of 4551 as a group size, with a group count of 1. We can interpret the group analysis as there is 1 value that appears 4551 times.
Or using another example, there are 73 values which appear 3 times. (total of 219 records).
We can also see that there are 130464 values which appear only 1 time.