CData ADO.NET Provider for HDFS
The CData ADO.NET Provider for HDFS offers the most natural way to access HDFS data from .NET applications. The provider wraps the complexity of accessing HDFS data in an easy-to-integrate, fully managed ADO.NET Data Provider. .NET applications can then access HDFS as a traditional database through the provider.
The provider hides the complexity of accessing data and provides additional powerful security features, smart caching, batching, socket management, and more.
- Seamlessly integrate HDFS data into .NET Framework, .NET Core, Entity Framework, and Entity Framework Core applications.
- Real-time access to HDFS.
- Compatible with existing Microsoft tools such as SSAS, SSRS, and Visual Studio.
Advanced Features details additional features supported by the provider, such as defining user defined views, ssl configuration, remoting, caching, firewall/proxy settings, and advanced logging.
The provider has the same ADO.NET architecture as the native .NET data providers for SQL Server and OLEDB. Code with familiar classes such as HDFSConnection, HDFSCommand, HDFSDataAdapter, HDFSDataReader, HDFSDataSource, HDFSParameter, and so on. See Using ADO.NET for guides relating to these and other ADO.NET features such as batch processing, connection pooling, and calling stored procedures.
You can leverage Entity Framework to quickly and easily model database resources using .NET objects. In Using Entity Framework, you can find instructions related to EF6 setup and installation, as well as the creation of both model-first and code-first data models.
Entity Framework Core
Entity Framework Core provides a streamlined, cross-platform solution for modelling database resources as .NET objects. Using Entity Framework Core covers how to surface HDFS data using EF Core console and ASP.NET applications. Additionally, you can find instructions for automatically building data models from data source metadata using reverse engineering (scaffolding).
You can use the provider to enable real-time connectivity to HDFS within your SSRS reports. Using SSRS details provider deploy, the creation of shared and embedded data sources and datasets, and the publishing of SSRS reports.
The provider supports the creation of strongly typed DbProviderFactory and DBConnection objects in order to facilitate connecting to HDFS with generic code. Using DbProviderFactory describes how to get connected as well as create, configure, and execute DbCommands.
See Schema Discovery to use standard ADO.NET schema collections to discover schema information and other metadata. Query the System Tables to access additional metadata, such as data source capabilities.
See SQL Compliance for a syntax reference and code examples outlining the supported SQL.
See Data Model for information on the available database objects and how to use SQL to work with HDFS APIs. This section provides more detailed information on querying specific HDFS entities.
Collaborative Query Processing
The provider enhances the data source's capabilities with additional client side processing, when needed, to enable analytic summaries of data such as SUM, AVG, MAX, MIN, and so on.
See SupportEnhancedSQL, in the Connection section, for more information.
Connection String Options
The Connection properties describe the various options that can be used to establish a connection.