SSIS Components for HDFS

Build 21.0.7930

CData SSIS Components for HDFS

Overview

The CData SSIS Components for HDFS enable you to connect SQL Server with HDFS data through SSIS workflows. The components wrap the complexity of accessing HDFS data in standard SSIS data flow components.

The components abstract the underlying data source into tables, views, and stored procedures that can be used to retrieve data. You can then connect and synchronize HDFS tables with SQL Server tables.

The components hide the complexity of accessing data and provide additional security features, smart caching, batching, socket management, and more.

Key Features

  • Collaborative query processing.
  • Access HDFS data in real time.
  • Integrate HDFS data without the need for custom development.

Getting Started

Getting Started covers Establishing a Connection with the Connection Manager and selecting rows Using the Source Component. See the HDFS integration guides for information on connecting from other applications.

Advanced Features

Advanced Features details additional features supported by the component, such as defining user defined views, ssl configuration, remoting, firewall/proxy settings, and advanced logging.

SQL Compliance

See SQL Compliance for a syntax reference and code examples outlining the supported SQL.

Data Model

See Data Model to find more information on how the component models the HDFS APIs as tables, views, and stored procedures.

Collaborative Query Processing

The component enhances the data source's capabilities with additional client side processing, when needed, to enable analytic summaries of data such as SUM, AVG, MAX, MIN, and so on.

See SupportEnhancedSQL, in the Connection section, for more information.

Connection Properties

The Connection properties describe the various options that can be used to establish a connection.

Copyright (c) 2021 CData Software, Inc. - All rights reserved.
Build 21.0.7930