CData SSIS Components for Apache Hive
The CData SSIS Components for Apache Hive enable you to connect SQL Server with Apache Hive data through SSIS workflows. The components wrap the complexity of accessing Apache Hive data in standard SSIS data flow components.
The components abstract the underlying data source into tables, views, and stored procedures that can be used to retrieve and update data. You can then connect and synchronize Apache Hive tables with SQL Server tables.
The components hide the complexity of accessing data and provide additional security features, smart caching, batching, socket management, and more.
- Create, read, update, and delete (CRUD) support.
- Collaborative query processing.
- Access Apache Hive data in real time.
- Integrate Apache Hive data without the need for custom development.
Getting Started covers Establishing a Connection with the Connection Manager and selecting rows Using the Source Component, and making changes Using the Destination Component. See the Apache Hive integration guides for information on connecting from other applications.
Advanced Features details additional features supported by the component, such as defining user defined views, ssl configuration, remoting, firewall/proxy settings, and advanced logging.
See SQL Compliance for a syntax reference and code examples outlining the supported SQL.
See Data Model to find more information on how the component models the Apache Hive APIs as tables, views, and stored procedures.
Collaborative Query Processing
The component enhances the data source's capabilities with additional client side processing, when needed, to enable analytic summaries of data such as SUM, AVG, MAX, MIN, and so on.
See SupportEnhancedSQL, in the Connection section, for more information.
The Connection properties describe the various options that can be used to establish a connection.