CData ODBC Driver for Spark SQL
The CData ODBC Driver for Spark SQL enables real-time access to Spark SQL data, directly from any applications that support ODBC connectivity, the most widely supported interface for connecting applications with data.
The driver wraps the complexity of accessing Spark SQL data in a standard ODBC driver compliant with ODBC 3.8. Applications can then access Spark SQL as a traditional database. The driver hides the complexity of accessing data and provides additional powerful security features, smart caching, batching, socket management, and more.
- Bidirectional access.
- Collaborative query processing.
- Access, analyze, and report on data with your SQL-based tool of choice.
- Supports 32-bit and 64-bit applications.
- Supports the Unicode ODBC APIs and ODBC 3.8.
- Drivers for Windows, Linux, and macOS/OS X.
See Getting Started to create an ODBC data source name (DSN) and connect to Spark SQL from your platform.
Advanced Features details additional features supported by the driver, such as defining user defined views, ssl configuration, remoting, caching, firewall/proxy settings, and advanced logging.
Using the ODBC Driver
See Using from Tools for information on using the ODBC driver in some of the popular applications and programming languages that connect via ODBC. See Using ODBC to connect to Spark SQL from the following programming languages and for information on the supported interfaces:
See the Spark SQL integration guides for information on connecting from other applications.
You can use SQL Linked Server to connect to the CData SQL Gateway. SQL Server treats the CData SQL Gateway as a linked SQL Server instance, so you can write full SQL queries to query your Spark SQL data without losing performance as would happen connecting to an ODBC data source directly. The connection can be made locally or to the Gateway located on a separate machine.
See Create a Linked Server for more information.
See SQL Gateway to configure MySQL or SQL Server entry points for Spark SQL on your server, using the included CData SQL Gateway application. The SQL Gateway listens for incoming MySQL and SQL Server connections and brokers the SQL request to the ODBC data source. This enables any MySQL or SQL Server client to connect, from PHP applications, SQL Server linked servers, and so on.
See SQL Compliance for a syntax reference and code examples outlining the supported SQL.
See Data Model for the available database objects. This section also provides more detailed information on querying specific Spark SQL entities.
Collaborative Query Processing
The driver enhances the data source's capabilities with additional client side processing, when needed, to enable analytic summaries of data such as SUM, AVG, MAX, MIN, and so on.
See SupportEnhancedSQL, in the Connection section, for more information.
The Connection properties describe the various options that can be used to establish a connection.