CData Python Connector for HPCC

Build 21.0.7930

From Pandas

When combined with the connector, Pandas can be used to generate data frames which contains your HPCC data. Once created, a data frame can be passed to various other python packages.

Connecting

Pandas will need to be imported before it can be used. Pandas will also rely on a SQLAlchemy engine when executing queries, as below:

import pandas as pd
from sqlalchemy import create_engine
engine = create_engine("hpcc:///?URL=http://127.0.0.1:8510;User=test;Password=xA123456;Version=1;Cluster=hthor;")

Querying Data

SELECT queries are provided in a call to the "read_sql()" method in pandas, alongside a relevant connection object. Pandas will execute the query on that connection, and return the results in the form of a data frame, which are used for a variety of purposes.

df = pd.read_sql("""
	SELECT
	   Id,
	   Name,
     $exNumericCol;
	FROM hpcc::test::person;""", engine)
print(df)

Copyright (c) 2021 CData Software, Inc. - All rights reserved.
Build 21.0.7930