Cmdlets for Parquet

Build 24.0.9060

DataModel Parameter (Connect-Parquet Cmdlet)

Specifies the data model to use when parsing Parquet documents and generating the database metadata.

Syntax

Connect-Parquet -DataModel string

Possible Values

None, Document, FlattenedDocuments, Relational

Data Type

cstr

Default Value

"Document"

Remarks

The cmdlet splits documents into rows based on the objects nested in arrays. Select a DataModel configuration to configure how the cmdlet models nested object arrays into tables.

Selecting a Data Modeling Strategy

The following DataModel configurations are available.

  • Document

    Returns a single table representing a row for each top-level object. In this data model, any nested object arrays will not be flattened and will be returned as aggregates.

  • FlattenedDocuments

    Returns a single table representing a SQL CROSS JOIN of the available documents in the file.

  • Relational

    Returns multiple tables, one for each nested object array. In this data model, any nested documents (object arrays) will be returned as relational tables that contain a primary key and a foreign key that links to the parent table.

See Also

Copyright (c) 2024 CData Software, Inc. - All rights reserved.
Build 24.0.9060