CData Python Connector for HPCC

Build 21.0.8011

From Petl

The provider can be used to create ETL applications and pipelines for CSV data in Python using Petl.

Install Required Modules

Install the Petl modules using the pip utility.

pip install petl


Import the modules, including the CData Python Connector for HPCC. You can then use the provider's connect function to create a connection using a valid HPCC connection string. A SQLAlchemy engine may also be used instead of a direct connection.

import petl as etl
import cdata.hpcc as mod
cnxn = mod.connect("URL=;User=test;Password=xA123456;Version=1;Cluster=hthor;")

Extract, Transform, and Load the HPCC Data

Create a SQL query string and store the query results in a DataFrame.

sql = "SELECT	Id, Name FROM hpcc::test::person "
table1 = etl.fromdb(cnxn,sql)

Loading Data

With the query results stored in a DataFrame, you can load your data into any supported Petl destination. The following example loads the data into a CSV file.


Copyright (c) 2021 CData Software, Inc. - All rights reserved.
Build 21.0.8011