Establishing a Connection
Creating a JDBC Data Source
You can create a JDBC data source to connect from your Java application. Creating a JDBC data source based on the CData JDBC Driver for HDFS consists of three basic steps:
- Add the driver JAR file to the classpath. The JAR file is located in the lib subfolder of the installation directory. Note that the .lic file must be located in the same folder as the JAR file.
- Provide the driver class. For example:
cdata.jdbc.hdfs.HDFSDriver
- Provide the JDBC URL. For example:
jdbc:hdfs:Host=sandbox-hdp.hortonworks.com;Port=50070;Path=/user/root; or jdbc:cdata:hdfs:Host=sandbox-hdp.hortonworks.com;Port=50070;Path=/user/root;
The second format above can be used whenever there is a conflict in your application between drivers using the same URL format to ensure you are using the CData driver. The URL must start with either "jdbc:hdfs:" or "jdbc:cdata:hdfs:" and can include any of the connection properties in name-value pairs separated with semicolons.
Connecting to HDFS
In order to connect, set the following connection properties:
- Host: Set this value to the host of your HDFS installation.
- Port: Set this value to the port of your HDFS installation. Default port: 50070
- UseSSL: (Optional) Set this value to 'True', to negotiate TLS/SSL connections to the HDFS server. Default: 'False'.
Authenticating to HDFS
There are two authentication methods available for connecting to the HDFS data source, Anonymous Authentication and Negotiate (Kerberos) Authentication.
Anonymous Authentication
In some situations, HDFS may be connected to without any authentication connection properties. To do so, set the AuthScheme to None (default).
Kerberos
When authentication credentials are required, you can use Kerberos. See Using Kerberos for details on how to authenticate with Kerberos.