CData Excel Add-In for HDFS
The CData Excel Add-In for HDFS provides the easiest way to connect to HDFS data from Excel. From the CData ribbon, you can select Get Data and choose From HDFS to import tables and columns into the spreadsheet. The spreadsheet will then be linked with the remote data.
- Select tables and columns using the CData ribbon.
- Schedule automatic data refresh in the data selection dialog.
See Establishing a Connection to connect your workbook to HDFS. To easily share the workbook, store the connection in the workbook. Refresh the spreadsheet automatically or from the ribbon.
Using the Excel Add-In
See Using the Excel Add-In to use the CData ribbon to execute operations supported by the add-in:
- Querying Data: You can use the Data Selection wizard (click Edit Query from the ribbon) to select tables, columns, and filters, building a query to HDFS. You can also write your own SQL to query HDFS just as you would query a traditional database; by Writing Parameterized Queries, you can create dynamic spreadsheets that take cell values as the query inputs. Click Refresh Worksheet in the ribbon to pull the latest changes.
See SQL Compliance for a syntax reference and code examples outlining the supported SQL.
See Data Model to find more information on how the add-in models the HDFS APIs as tables, views, and stored procedures, as in a relational database.
Collaborative Query Processing
The add-in enhances the data source's capabilities with additional client side processing, when needed, to enable analytic summaries of data such as SUM, AVG, MAX, MIN, and so on.
See SupportEnhancedSQL, in the Connection section, for more information.
The Connection properties describe the various options that can be used to establish a connection.