Skip to main content
Solved

Data Quality Monitoring Project Result Download

  • March 12, 2024
  • 5 replies
  • 302 views

Forum|alt.badge.img
  • Universe Traveller
  • 2 replies

I have a data quality monitoring project to which I’ve deployed a post-processing component. Results are written to a CSV file, which I presume is stored within Ataccama’s object storage. When I navigate to the data quality monitoring project and select the Export tab, I see the below screenshot, which appears to include a file path along with a download button. 

I’m trying to determine how to automate download of the below-depicted CSV file. Is there a way to determine the fully-qualified path to the file and download from Ataccama server?

 

Best answer by anna.spakova

Hello @jbrant , the files are stored in Minio, which is a mandatory component installed with the platform. Usually, the UI is accessible by https://minio-console.<rest of the url to one>.  You should have the link and credentials from the platform onboarding. If not, please contact your Ataccama contact person or our support (in case of the PaaS).

You can then automate the download using e.g. the orchestration server. You would define the connection to Minio as a server connection and use that in a plan to read the data from the files.

You can define the connection to minio in ONE Desktop:

and then use it with resource://minio/export/<rest of the path to the data> in a file path in the Text File Reader step. The bucket in your case is the shared one.

Let me know if this helps or if you have further questions.

Kind regards,

Anna

5 replies

anna.spakova
Ataccamer
Forum|alt.badge.img+3
  • Ataccamer
  • 211 replies
  • Answer
  • March 12, 2024

Hello @jbrant , the files are stored in Minio, which is a mandatory component installed with the platform. Usually, the UI is accessible by https://minio-console.<rest of the url to one>.  You should have the link and credentials from the platform onboarding. If not, please contact your Ataccama contact person or our support (in case of the PaaS).

You can then automate the download using e.g. the orchestration server. You would define the connection to Minio as a server connection and use that in a plan to read the data from the files.

You can define the connection to minio in ONE Desktop:

and then use it with resource://minio/export/<rest of the path to the data> in a file path in the Text File Reader step. The bucket in your case is the shared one.

Let me know if this helps or if you have further questions.

Kind regards,

Anna


may_kwok
Star Blazer L3
Forum|alt.badge.img+4
  • Star Blazer L3
  • 93 replies
  • March 12, 2024

@jbrant , if your organisation has s3 bucket, ADLS Gen2 storage, or google cloud storage, I’d recommend trying to save the file directly over to those locations. This way you have your own dedicated space, and you won’t need to store things alongside critical application components in minio.

I personally prefer to touch minio as little as possible - if you accidentally move / delete a component on minio it could have very bad consequences.


Forum|alt.badge.img+1
  • Universe Traveller
  • 13 replies
  • November 3, 2025

I am currently trying to retrieve files from the Minio storage. In Desktop I have a workflow which gets a file from Minio using the above method, which works as expected. I've exported the runtimeConfig from desktop and put it on the orchestration server. When I trigger the workflow on the server, the workflow can't find the file and gives the following error:
 

As I mentioned, in One desktop it works as expected with the exact same configuration of server and workflow.

@anna.spakova You mentioned it can be used for automation on the orchestration server, so I assumed this would work.

Are there any other settings I have to take into account? Or any changes to the runtime config to make it useable on the runtime server perhaps?


may_kwok
Star Blazer L3
Forum|alt.badge.img+4
  • Star Blazer L3
  • 93 replies
  • November 5, 2025

I could think of 2 things to check:

  1. Is the minio you have set up on your ONE Desktop the same environment you have set up for the runtime server?
  2. Is the file still there?

I’m inclined to believe that the minio connection on the runtime server is a valid connection, otherwise it would say connection not found. So it’s connected to some minio.

Also, when you tested it to “retrieve” the file, was it a MOVE or a COPY? Just in case you ran the workflow on ONE Desktop and it MOVE the file which means the file won’t be there anymore when you test it on the server?

You should be able to verify the file is there on your ONE Desktop (connect to minio on ONE Desktop and browse, or connect to minio on web browser), then trigger the workflow on runtime server.


Forum|alt.badge.img+1
  • Universe Traveller
  • 13 replies
  • November 6, 2025

EDIT: There was a typo in input of the variable and I just kept on copy-pasting the same input. Everything works as expected.

 

Hi May,

Thanks for your response.

  1. Minio is indeed set to the same environment. Double checked everything just to be sure, but the orchestration server is the same env. as the minio server. Also checked port connections from orchestration server to minio server, which are all good.
  2. The file is still there, I'm just using copy. 

While typing my response I seem to have found the issue. Leaving my answers in here though.

It seems that variables in the workflow are not supported (?)

The workflow is set up to be able to iterate over several files on a daily basis, so I have a variable for the date and a variable for the filename. This makes the defined path in the workflow as follows: 

resource://MINIO/export/${today}/${filename}.csv

When the workflow fails, it does however show the correct path in the error message, which would imply that the variables work as intended, so I still don't quite understand why it fails.