Hi everyone,
Just wondering if Ataccama provides any best practice guides on naming conventions when working with the platform. This includes best practice for rule names, error names, sql catalogue item names etc.
Kind Regards,
Vishy
Hi everyone,
Just wondering if Ataccama provides any best practice guides on naming conventions when working with the platform. This includes best practice for rule names, error names, sql catalogue item names etc.
Kind Regards,
Vishy
Best answer by Samuel Muvdi
Hi!
When it comes to ONE desktop, we have a pretty standard best practice list for column prefixes:
src_ : The source value of that field, it does not get altered
pur_ : This is a column that had transformations done to it
std_ : This is a column that has been standardized, usually with a dictionary or standardization transformation
cio_ : The best possible value that can be achieved after processing is done, usually the std_
mat_ : This is used for matching purposes only, for MDM for example
dic_ : A value that was returned or verified by using a dictionary
When it comes to scoring, we want to score between 0 (no changes or transformations done) to 10,000,000 (null value) for naming these scoring reasons, we can put things like “ID_IS_NULL” and set the score to 10,000,000 , or we can do “ADDRESS_INCORRECT” and set the score to 1,000,000.
Usually when scoring, we will set a null value to be the score of 10,000,000 , while ones that have smaller rules broken to be under 10,000,000
Some more tips:
Hope this helps! -
Samuel :)
No account yet? Create an account
Enter your E-mail address. We'll send you an e-mail with instructions to reset your password.