CData ADO.NET Provider for Parquet
The CData ADO.NET Provider for Parquet offers the most natural way to access Parquet data from .NET applications. The provider wraps the complexity of accessing Parquet data in an easy-to-integrate, fully managed ADO.NET Data Provider. .NET applications can then access Parquet as a traditional database through the provider.
The provider hides the complexity of accessing data and provides additional powerful security features, smart caching, batching, socket management, and more.
- Seamlessly integrate Parquet data into .NET Framework, .NET Core, Entity Framework, and Entity Framework Core applications.
- Real-time access to Parquet.
- Compatible with existing Microsoft tools such as SSAS, SSRS, and Visual Studio.
The provider has the same ADO.NET architecture as the native .NET data providers for SQL Server and OLEDB. Code with familiar classes such as ParquetConnection, ParquetCommand, ParquetDataAdapter, ParquetDataReader, ParquetDataSource, ParquetParameter, and so on. See Using ADO.NET for guides relating to these and other ADO.NET features such as batch processing, connection pooling, and calling stored procedures.
You can leverage Entity Framework to quickly and easily model database resources using .NET objects. In Using Entity Framework, you can find instructions related to EF6 setup and installation, as well as the creation of both model-first and code-first data models.
Entity Framework Core
Entity Framework Core provides a streamlined, cross-platform solution for modelling database resources as .NET objects. Using Entity Framework Core covers how to surface Parquet data using EF Core console and ASP.NET applications. Additionally, you can find instructions for automatically building data models from data source metadata using reverse engineering (scaffolding).
You can use the provider to enable real-time connectivity to Parquet within your SSRS reports. Using SSRS details provider deploy, the creation of shared and embedded data sources and datasets, and the publishing of SSRS reports.
The provider supports the creation of strongly typed DbProviderFactory and DBConnection objects in order to facilitate connecting to Parquet with generic code. Using DbProviderFactory describes how to get connected as well as create, configure, and execute DbCommands.
See Schema Discovery to use standard ADO.NET schema collections to discover schema information and other metadata. Query the System Tables to access additional metadata, such as data source capabilities.
See SQL Compliance for a syntax reference and code examples outlining the supported SQL.
See Caching Data to configure replication and caching for a range of scenarios common to remote data access. Configurations include:
- Autocache: Automatically cache data to a lightweight database. Save data for later offline use or enable fast reporting from the cache.
- Replication: Copy data to local and cloud data stores such as Oracle, SQL Server, Google Cloud SQL, and so on. The replication commands allow for intelligent incremental updates to cached data.
- No caching: Work with remote data only. No local cache file is created.
See Data Model for information on the available database objects and how to use SQL to work with Parquet APIs. This section provides more detailed information on querying specific Parquet entities.
Collaborative Query Processing
The provider enhances the data source's capabilities with additional client side processing, when needed, to enable analytic summaries of data such as SUM, AVG, MAX, MIN, and so on.
See SupportEnhancedSQL, in the Connection section, for more information.
Connection String Options
The Connection properties describe the various options that can be used to establish a connection.