DataModel Parameter (Connect-Parquet Cmdlet)
Specifies the data model to use when parsing Parquet documents and generating the database metadata.
Syntax
Connect-Parquet -DataModel string
Possible Values
None, Document, FlattenedDocuments, RelationalData Type
cstr
Default Value
"Document"
Remarks
The cmdlet splits documents into rows based on the objects nested in arrays. Select a DataModel configuration to configure how the cmdlet models nested object arrays into tables.
Selecting a Data Modeling Strategy
The following DataModel configurations are available.
-
Document
Returns a single table representing a row for each top-level object. In this data model, any nested object arrays will not be flattened and will be returned as aggregates.
-
FlattenedDocuments
Returns a single table representing a SQL CROSS JOIN of the available documents in the file.
-
Relational
Returns multiple tables, one for each nested object array. In this data model, any nested documents (object arrays) will be returned as relational tables that contain a primary key and a foreign key that links to the parent table.
See Also
- FlattenArrays and FlattenObjects: Customize the columns that will be identified for each of these data models.
- Parsing Hierarchical Data: Compare the schemas resulting from different DataModel settings, with example queries.