Hi everyone,
Just wondering if Ataccama provides any best practice guides on naming conventions when working with the platform. This includes best practice for rule names, error names, sql catalogue item names etc.
Kind Regards,
Vishy
Hi everyone,
Just wondering if Ataccama provides any best practice guides on naming conventions when working with the platform. This includes best practice for rule names, error names, sql catalogue item names etc.
Kind Regards,
Vishy
Hi
Here are some best practices while creating any rules The rule must have a descriptive name. Since this is a collaborative platform, we want others to
understand what the rule does and reuse it, instead of re-creating the same rule from scratch.
The rule name should have the format <attribute> is/has/contains/etc <missing, invalid,
duplicate, between 0 and 10,000 etc>. Examples for rule names include:
● Account ID is all zeros
● Attribute (string) is duplicate
● Attribute (string) is missing
● Attribute (date) is missing
● Email address is test
● Email address format is invalid
● ID contains a lowercase character
● Market code is invalid
● Account Balance is between 0 and 10,000
Hope this helps !
Regards,
Srija Piratla
Hi!
When it comes to ONE desktop, we have a pretty standard best practice list for column prefixes:
src_ : The source value of that field, it does not get altered
pur_ : This is a column that had transformations done to it
std_ : This is a column that has been standardized, usually with a dictionary or standardization transformation
cio_ : The best possible value that can be achieved after processing is done, usually the std_
mat_ : This is used for matching purposes only, for MDM for example
dic_ : A value that was returned or verified by using a dictionary
When it comes to scoring, we want to score between 0 (no changes or transformations done) to 10,000,000 (null value) for naming these scoring reasons, we can put things like “ID_IS_NULL” and set the score to 10,000,000 , or we can do “ADDRESS_INCORRECT” and set the score to 1,000,000.
Usually when scoring, we will set a null value to be the score of 10,000,000 , while ones that have smaller rules broken to be under 10,000,000
Some more tips:
Hope this helps! -
Samuel :)
Hi
No account yet? Create an account
Enter your E-mail address. We'll send you an e-mail with instructions to reset your password.