Skip to main content
Question

Duplicate table Detection

  • June 25, 2025
  • 1 reply
  • 20 views

HI 

I need to find duplicate tables by comparing the row to row from two or more than two tables. i have 300 tables i need to find duplicate tables. Please help to find this out

Did this topic help you find an answer to your question?

1 reply

Lisa Kovalskaia
Ataccamer
Forum|alt.badge.img+3

Hi ​@pickle21 I think the easiest way to do a high level check is to run a reconciliation project on your data: https://docs.ataccama.com/one/16.0.0/data-quality/data-reconciliation.html My understanding is that you have some candidates and you need to verify whether they really are duplicate tables (while you don’t need to validate every single duplicate value or row). A reconciliation project should give you enough information to confirm if tables are duplicates, or sufficiently similar to be considered duplicates.

It may or may not be the best approach depending on how your data is stored and structured. What is(are) the datasource(s)? What are the data formats? Will Ataccama be able to identify pairs of tables based on their names? Are attribute names the same in both tables in a given pair, or it’s necessary to actually compare the data points while disregarding column names? Please let me know if you think reconciliation projects will do or you’re looking for something else.


Reply


Cookie policy

We use cookies to enhance and personalize your experience. If you accept you agree to our full cookie policy. Learn more about our cookies.

 
Cookie settings