SSIS Components for HDFS

Build 20.0.7587

CData SSIS Components for HDFS

Overview

The CData SSIS Components for HDFS enable you to connect SQL Server with HDFS data through SSIS workflows. The components wrap the complexity of accessing HDFS data in standard SSIS data flow components.

The components abstract the underlying data source into tables, views, and stored procedures that can be used to retrieve data. You can then connect and synchronize HDFS tables with SQL Server tables.

The components hide the complexity of accessing data and provide additional security features, smart caching, batching, socket management, and more.

Key Features

  • Collaborative query processing.
  • Access HDFS data in real time.
  • Integrate HDFS data without the need for custom development.

Getting Started

Getting Started covers Establishing a Connection with the Connection Manager and selecting rows Using the Source Component. See the HDFS integration guides for information on connecting from other applications.

SQL Compliance

See SQL Compliance for a syntax reference and code examples outlining the supported SQL.

Data Model

See Data Model to find more information on how the component models the HDFS APIs as tables, views, and stored procedures.

Collaborative Query Processing

The component enhances the data source's capabilities with additional client side processing, when needed, to enable analytic summaries of data such as SUM, AVG, MAX, MIN, and so on.

See SupportEnhancedSQL, in the Connection section, for more information.

Connection Properties

The Connection properties describe the various options that can be used to establish a connection.

Copyright (c) 2020 CData Software, Inc. - All rights reserved.
Build 20.0.7587