Skip to main content
Solved

Ataccama Naming Best Practice

  • June 26, 2024
  • 3 replies
  • 205 views

Hi everyone,Ā 

Just wondering if Ataccama provides any best practice guides on naming conventions when working with the platform. This includes best practice for rule names, error names, sql catalogue item names etc.Ā 

Ā 

Kind Regards,Ā 

Ā 

Vishy

Best answer by Samuel Muvdi

Hi!

When it comes to ONE desktop, we have a pretty standard best practice list for column prefixes:

src_Ā  : The source value of that field, it does not get altered

pur_ : This is a column that had transformations done to it

std_ : This is a column that has been standardized, usually with a dictionary or standardization transformation

cio_ : The best possible value that can be achieved after processing is done, usually the std_

mat_ : This is used for matching purposes only, for MDM for example

dic_ : A value that was returned or verified by using a dictionary

Ā 

When it comes to scoring, we want to score between 0 (no changes or transformations done) to 10,000,000 (null value)Ā for naming these scoring reasons, we can put things like ā€œID_IS_NULLā€ and set the score to 10,000,000 , or we can do ā€œADDRESS_INCORRECTā€ and set the score to 1,000,000.Ā 

Usually when scoring, we will set a null value to be the score of 10,000,000 , while ones that have smaller rules broken to be under 10,000,000

Ā 

Some more tips:

  • Try to be as descriptive as possible, but keep the names within a reasonable length
  • ā€œcustomers_NAā€ vs ā€œcustomers_north_americaā€ the customers_NA does the job and is more compact
  • ā€œFIRST_NAME_IS_NULLā€ vs ā€œFN_NULLā€ we can use the first letter of words to get the same point accross and keep it more compact
  • There are times though where we can also just shorten the word a bit like ā€œFST_NM_NULLā€ it's nice and compact but does the job! it helps describe better what it is while also not letting it get too long

Ā 

Hope this helps! -

Samuel :)Ā 

Ā 

3 replies

Forum|alt.badge.img

Hi @vdatlaĀ ,

Here are some best practices while creating any rulesĀ  The rule must have a descriptive name. Since this is a collaborative platform, we want others to
understand what the rule does and reuse it, instead of re-creating the same rule from scratch.
The rule name should have the format <attribute> is/has/contains/etc <missing, invalid,
duplicate, between 0 and 10,000 etc>. Examples for rule names include:
ā— Account ID is all zeros
ā— Attribute (string) is duplicate
ā— Attribute (string) is missing
ā— Attribute (date) is missing
ā— Email address is test
ā— Email address format is invalid
ā— ID contains a lowercase character
ā— Market code is invalid
ā— Account Balance is between 0 and 10,000

Ā 

Ā 

Hope this helps !

Regards,

Srija Piratla


Forum|alt.badge.img+2
  • Ataccamer
  • Answer
  • July 1, 2024

Hi!

When it comes to ONE desktop, we have a pretty standard best practice list for column prefixes:

src_Ā  : The source value of that field, it does not get altered

pur_ : This is a column that had transformations done to it

std_ : This is a column that has been standardized, usually with a dictionary or standardization transformation

cio_ : The best possible value that can be achieved after processing is done, usually the std_

mat_ : This is used for matching purposes only, for MDM for example

dic_ : A value that was returned or verified by using a dictionary

Ā 

When it comes to scoring, we want to score between 0 (no changes or transformations done) to 10,000,000 (null value)Ā for naming these scoring reasons, we can put things like ā€œID_IS_NULLā€ and set the score to 10,000,000 , or we can do ā€œADDRESS_INCORRECTā€ and set the score to 1,000,000.Ā 

Usually when scoring, we will set a null value to be the score of 10,000,000 , while ones that have smaller rules broken to be under 10,000,000

Ā 

Some more tips:

  • Try to be as descriptive as possible, but keep the names within a reasonable length
  • ā€œcustomers_NAā€ vs ā€œcustomers_north_americaā€ the customers_NA does the job and is more compact
  • ā€œFIRST_NAME_IS_NULLā€ vs ā€œFN_NULLā€ we can use the first letter of words to get the same point accross and keep it more compact
  • There are times though where we can also just shorten the word a bit like ā€œFST_NM_NULLā€ it's nice and compact but does the job! it helps describe better what it is while also not letting it get too long

Ā 

Hope this helps! -

Samuel :)Ā 

Ā 


Forum|alt.badge.img+1

Hi @vdatla, welcome to the community and thank you for posting! I’m closing this thread for now, if you have any follow-up questions please do share them in the comments below or create a new post šŸ™‹ā€ā™€ļø