Skip to main content
Solved

Parquet File output


Forum|alt.badge.img

Hi,

 

I need to extract metadata using One Desktop and store in Parquet file format in ADLS. Do we have any recommendations on how this can be achieved in One Desktop?

Best answer by ivan.kozlov

Hi ​@sumisha , unfortunately the error doesn't look familiar and the writer step configuration looks correct. I’d suggest you to create a support request with all the details and our team will investigate.

As for the ADLS2 connection, you can find a connection example below:
    <config class="com.ataccama.dqc.azure.config.AzureGen2Contributor">
        <azureGen2Connections>
            <azureGen2Connection clientId="$azure_client_id" authenticateUser="false" clientKey="$encrypted_client_secret" containerName="$container_name" name="$connection_name" storageAccount="$storage_account" authTokenEndpoint="$auth_endpoint_link"/>
         </azureGen2Connections>
    </config>

$azure_client_id, $encrypted_client_secret, $container_name, $connection_name, #storage_account and $auth_endpoint_link will have to be replaced with values relevant for your target ADLS2 container.

$connection_name is what you will use in your file path using resource://$connection_name/… notation.

View original
Did this topic help you find an answer to your question?

ivan.kozlov
Ataccamer
Forum|alt.badge.img+3

Hi Sumisha,

I’m aware of some issues that can appear when Parquet Writer step is writing directly to ADLS container however it would help if you could clarify which error messages you see and which version of product you’re using. Depending on specific problem you have there might be a fix available.

In the past we had to resolve similar issue and the working approach was to write file to local directory and then using workflow with Operate On File to copy or move the file from local directory to ADLS. Workflow step configuration might look like this. You’ll need to have ADLS storage added to your runtime config so you can then reference it using β€œresource://$resourcename” notation like on example below.
 

I hope this helps.
Ivan


Forum|alt.badge.img
  • Data Pioneer
  • June 4, 2025

Thanks Ivan, that was really helpful. To start with, I am trying to extract metadata from Catalog and save in Parquet format as a plan’s output. But it gives me error: Internal error occurred during run of the plan: Illegal char <:> at index 4: file:/c:…..

Would you know if this is not right way to save result in parquet format in local folder?


ivan.kozlov
Ataccamer
Forum|alt.badge.img+3

Could you please post the screenshot of your Parquet Writer step?
Based on the error message it looks like you might have some error in the output file path.

Here’s an example of Parquet Writer configuration with pathvar
 

And here’s another with relative filepath:
 

 


Forum|alt.badge.img
  • Data Pioneer
  • June 4, 2025

This is my output step


Forum|alt.badge.img
  • Data Pioneer
  • June 5, 2025

Also, can you share what details are required to get ADLS storage added to runtime config? 


ivan.kozlov
Ataccamer
Forum|alt.badge.img+3

Hi ​@sumisha , unfortunately the error doesn't look familiar and the writer step configuration looks correct. I’d suggest you to create a support request with all the details and our team will investigate.

As for the ADLS2 connection, you can find a connection example below:
    <config class="com.ataccama.dqc.azure.config.AzureGen2Contributor">
        <azureGen2Connections>
            <azureGen2Connection clientId="$azure_client_id" authenticateUser="false" clientKey="$encrypted_client_secret" containerName="$container_name" name="$connection_name" storageAccount="$storage_account" authTokenEndpoint="$auth_endpoint_link"/>
         </azureGen2Connections>
    </config>

$azure_client_id, $encrypted_client_secret, $container_name, $connection_name, #storage_account and $auth_endpoint_link will have to be replaced with values relevant for your target ADLS2 container.

$connection_name is what you will use in your file path using resource://$connection_name/… notation.


Forum|alt.badge.img
  • Data Pioneer
  • June 6, 2025

Thanks Ivan. Appreciate your support here.


Forum|alt.badge.img
  • Data Pioneer
  • June 17, 2025

Hello, I am trying to make connectivity to ADLS in Azure subscription from One Desktop. What would be the Authentication token endpoin?

 


ivan.kozlov
Ataccamer
Forum|alt.badge.img+3

​@sumisha the auth endpoint usually looks like β€œhttps://login.microsoft.com/$tenantId/oauth2/token”.
Where $tenantId is a unique id/identified of your organization in Azure


Reply


Cookie policy

We use cookies to enhance and personalize your experience. If you accept you agree to our full cookie policy. Learn more about our cookies.

 
Cookie settings