Establishing a Connection
Creating a JDBC Data Source
You can create a JDBC data source to connect from your Java application. Creating a JDBC data source based on the CData JDBC Driver for HDFS consists of three basic steps:
- Add the driver JAR file to the classpath. The JAR file is located in the lib subfolder of the installation directory. Note that the .lic file must be located in the same folder as the JAR file.
- Provide the driver class. For example:
- Provide the JDBC URL. For example:
jdbc:hdfs:Host=sandbox-hdp.hortonworks.com;Port=50070;Path=/user/root; or jdbc:cdata:hdfs:Host=sandbox-hdp.hortonworks.com;Port=50070;Path=/user/root;
The second format above can be used whenever there is a conflict in your application between drivers using the same URL format to ensure you are using the CData driver. The URL must start with either "jdbc:hdfs:" or "jdbc:cdata:hdfs:" and can include any of the connection properties in name-value pairs separated with semicolons.
Connecting to HDFS
In order to connect, set the following connection properties:
- Host: Set this value to the host of your HDFS installation.
- Port: Set this value to the port of your HDFS installation. Default port: 50070
Authenticating to HDFS
There are two authentication methods available for connecting to the HDFS data source, the Anonymous Authentication and the and Negotiate (Kerberos) Authentication.
In some situations, HDFS may be connected to without any authentication connection properties. To do so, simply set the AuthScheme to None (default).
Authenticate using Kerberos
When authentication credentials are required, authentication over Kerberos may be used. Please see Using Kerberos for details on how to authenticate with Kerberos.