ADO.NET Provider for Amazon Redshift

Build 24.0.9060

Batch Processing

The CData ADO.NET Provider for Amazon Redshift enables you to take advantage of the bulk load support in Amazon Redshift through RedshiftDataAdapters. You can use the Batch API to execute related SQL data manipulation statements simultaneously.

Using the ADO.NET Batch API

Performing a batch update consists of the following basic steps:

  1. Define custom parameterized SQL statements in RedshiftCommand objects.
  2. Set the UpdatedRowSource property of the RedshiftCommand object to "UpdateRowSource.None".
  3. Assign the RedshiftCommand objects to the RedshiftDataAdapter.
  4. Add the parameters to the command.
  5. Call the RedshiftDataAdapter's Update method. Pass in a DataSet or DataTable containing your changes.

Controlling Batch Size

Depending on factors such as the size of the request, your network resources, and the performance of the server, you may gain performance by executing several smaller batch requests. You can control the size of each batch by setting the RedshiftDataAdapter's UpdateBatchSize property to a positive integer.

Bulk INSERT

The following code prepares a single batch that inserts records in bulk. The example executes a batch INSERT of new DataRows, which have the "Added" state.

C#

RedshiftDataAdapter adapter = new RedshiftDataAdapter();

using (RedshiftConnection conn = new RedshiftConnection("User=admin;Password=admin;Database=dev;Server=examplecluster.my.us-west-2.redshift.amazonaws.com;Port=5439;")) {
  conn.Open();
  adapter.InsertCommand = conn.CreateCommand();
  adapter.InsertCommand.CommandText = "INSERT INTO \"sales_db\".\"public\".Orders (ShipCity) VALUES (@ShipCity)";
  adapter.InsertCommand.UpdatedRowSource = UpdateRowSource.None;
  adapter.InsertCommand.Parameters.Add("@ShipCity", "ShipCity");

  DataTable batchDataTable = new DataTable();
  batchDataTable.Columns.Add("ShipCity", typeof(string));
  batchDataTable.Rows.Add("Raleigh");
  batchDataTable.Rows.Add("New York");
  adapter.UpdateBatchSize = 2;
  adapter.Update(batchDataTable);
}

VB.NET

 
Dim adapter As New RedshiftDataAdapter()

Using conn As New RedshiftConnection("User=admin;Password=admin;Database=dev;Server=examplecluster.my.us-west-2.redshift.amazonaws.com;Port=5439;")
  conn.Open()
  adapter.InsertCommand = conn.CreateCommand()
  adapter.InsertCommand.CommandText = "INSERT INTO \"sales_db\".\"public\".Orders (ShipName) VALUES (@ShipCity)"
  adapter.InsertCommand.UpdatedRowSource = UpdateRowSource.None
  adapter.InsertCommand.Parameters.Add("@ShipCity", "ShipCity")

  Dim batchDataTable As New DataTable()
  batchDataTable.Columns.Add("ShipCity", GetType(String))
  batchDataTable.Rows.Add("New York")
  batchDataTable.Rows.Add("Raleigh")
  adapter.UpdateBatchSize = 2
  adapter.Update(batchDataTable)
End Using

Bulk Update

A batch update additionally requires the primary key of each row to update. The following example executes a batch for all DataRow records with a "Modified" state:

C#

RedshiftDataAdapter adapter = new RedshiftDataAdapter();

using (RedshiftConnection conn = new RedshiftConnection("User=admin;Password=admin;Database=dev;Server=examplecluster.my.us-west-2.redshift.amazonaws.com;Port=5439;")) { 
  conn.Open();
  adapter.UpdateCommand = conn.CreateCommand();
  adapter.UpdateCommand.CommandText = "UPDATE \"sales_db\".\"public\".Orders SET ShipCity=@ShipCity WHERE Id=@Id";
  adapter.UpdateCommand.Parameters.Add("@ShipCity", "ShipCity");
  adapter.UpdateCommand.Parameters.Add("@Id", "Id");
  adapter.UpdateCommand.UpdatedRowSource = UpdateRowSource.None; 
  DataTable batchDataTable = new DataTable();
  batchDataTable.Columns.Add("ShipCity", typeof(string));
  batchDataTable.Rows.Add("Raleigh");
  batchDataTable.Rows.Add("New York");
  adapter.UpdateBatchSize = 2;
  adapter.Update(dataTable);
}

VB.NET

 
Dim adapter As New RedshiftDataAdapter()

Using conn As New RedshiftConnection("User=admin;Password=admin;Database=dev;Server=examplecluster.my.us-west-2.redshift.amazonaws.com;Port=5439;")
  conn.Open()
  adapter.UpdateCommand = conn.CreateCommand()
  adapter.UpdateCommand.CommandText = "UPDATE \"sales_db\".\"public\".Orders SET ShipCity=@ShipCity WHERE Id=@Id"
  adapter.UpdateCommand.Parameters.Add("@ShipCity", "ShipCity")
  adapter.UpdateCommand.Parameters.Add("@Id", "Id")
  adapter.UpdateCommand.UpdatedRowSource = UpdateRowSource.None
  Dim batchDataTable As New DataTable()
  batchDataTable.Columns.Add("ShipCity", GetType(String))
  batchDataTable.Rows.Add("New York")
  batchDataTable.Rows.Add("Raleigh")
  adapter.UpdateBatchSize = 2
  adapter.Update(dataTable)
End Using

Bulk Delete

The following code prepares a single batch that deletes records in bulk. The primary key for each row is required. The following example executes a batch for all DataRow records with a "Deleted" state:

C#

RedshiftDataAdapter adapter = new RedshiftDataAdapter();

using (RedshiftConnection conn = new RedshiftConnection("User=admin;Password=admin;Database=dev;Server=examplecluster.my.us-west-2.redshift.amazonaws.com;Port=5439;")) {
  conn.Open();
  adapter.DeleteCommand = conn.CreateCommand();
  adapter.DeleteCommand.CommandText = "DELETE FROM \"sales_db\".\"public\".Orders WHERE Id=@Id";
  adapter.DeleteCommand.Parameters.Add("@Id", "Id");
  adapter.DeleteCommand.UpdatedRowSource = UpdateRowSource.None; 
  adapter.UpdateBatchSize = 2;
  adpater.Update(table);
}

VB.NET

Dim adapter As New RedshiftDataAdapter()

Using conn As New RedshiftConnection("User=admin;Password=admin;Database=dev;Server=examplecluster.my.us-west-2.redshift.amazonaws.com;Port=5439;")
  conn.Open()
  adapter.DeleteCommand = conn.CreateCommand()
  adapter.DeleteCommand.CommandText = "DELETE FROM \"sales_db\".\"public\".Orders WHERE Id=@Id"
  adapter.DeleteCommand.Parameters.Add("@Id", "Id")
  adapter.DeleteCommand.UpdatedRowSource = UpdateRowSource.None 
  adapter.UpdateBatchSize = 2
  adpater.Update(table)
End Using

Copyright (c) 2024 CData Software, Inc. - All rights reserved.
Build 24.0.9060