I am profiling tables and am looking at the profiles. I noticed that for date and datetime the profile does not show the MIN or MAX values. That is something I had expected though. Working on the data quality of this data, I really would like to know the MIN and MAX values for these attributes. Is there a way to change the configuration and include these profile statistics for these datatypes?
kind regards, Jur Dördregter
Page 1 / 1
Hi @jdordregter !
Thank you for your question!
As you are correctly pointing out, numerical statistics are calculated only for numeric attributes. However, you can have a look at the Quantiles - They display the values that occur at designated intervals in the ordered data set, with the first value on the list shown at 0% and the last value at 100%. If your date type attributes have standardized format, like YYYY-MM-DD, it will work for you, as well.
Hope this will help you!
Kind regards, Ekaterina
Hi @ekaterina.ponomareva,
I tried your solution, however it does not show the quantiles in my profiling, is this due to our version or the source used?
kind regards, Jur Dördregter
Hi @jdordregter ,
I can see that your data source is Hadoop, so it is indeed because of that.
Kind regards, Ekaterina
Hi @jdordregter I’m closing this thread for now. If you have any follow up questions please feel free to share the in the comments or create a new post ♀️
@ekaterina.ponomareva Good for you to know: in the backend this information is stored. Should not be rocket science to show this information also in web application.
Hi @jdordregter, ONE Desktop and Ataccama ONE does have some differences and thank you for taking the time to share your feedback. I’ve passed this on to our team.