SFTP Connector

Version 23.4.8839


SFTP Connector


The SFTP Connector transfers files to and from a remote SFTP server.

Overview

Each SFTP Connector connects uploads to a specified folder and downloads from a set of folders on the remote SFTP server.

Files that reach the SFTP Connector in the CData Arc flow are uploaded to the upload target folder, and files found in the download target folder(s) are downloaded and entered into the Arc flow. The connector supports upload and download filters to only send and receive specified file names and/or file extensions.

Connector Configuration

This section contains all of the configurable connector properties.

Settings Tab

Host Configuration

Settings related to establishing the connection to the remote SFTP host.

  • Connector Id The static, unique identifier for the connector.
  • Connector Type Displays the connector name and a description of what it does.
  • Connector Description An optional field to provide a free-form description of the connector and its role in the flow.
  • Remote Host The hostname or IP address of the SFTP server.
  • Remote Port The port on which to connect to the SFTP server.

Client Authentication

Settings related to authenticating to the SFTP server.

  • Authentication Mode The type of authentication to use with the SFTP server.
  • User The username for logging in to the SFTP server.
  • Password The password for logging in to the SFTP server.
  • Client Certificate The certificate with a private key to use during public key authentication. Only available when Authentication Mode is set to Public Key or Multi-Factor Authentication. Optionally, you can use the Create Certificate button to create a new certificate.
  • Certificate Password The password required to access the Client Certificate.

Server Authentication

Setting related to verifying the SFTP server’s identity.

  • Server Thumbprint The thumbprint of the certificate that identifies the SFTP server. You can also set this to a certificate file containing the server’s public key. When you test the connection, the connector prompts you to accept/trust the thumbprint presented by the server.

Upload

Setting related to uploading files to the SFTP server.

  • Remote Path Files processed by the connector are uploaded to this remote folder.

Download

Settings related to downloading from the SFTP server.

  • Remote Path The remote folder from which files are downloaded. Use a comma-delimited list to specify multiple paths.
  • File Mask A glob pattern that determines which files in the Remote Path should be downloaded (for example, *.txt).
  • Delete files (after received) Check this to delete successfully downloaded files from the SFTP server.

Caching

Settings related to caching and comparing files between multiple downloads.

  • File Size Comparison Check this to keep a record of downloaded file names and sizes. Previously downloaded files are skipped unless the file size is different than the last download.
  • Timestamp Comparison Check this to keep a record of downloaded file names and last-modified timestamps. Previously downloaded files are skipped unless the timestamp is different than the last download.

Note: When you enable caching, the file names are case-insensitive. For example, the connector cannot distinguish between TEST.TXT and test.txt.

Automation Tab

Automation Settings

Settings related to the automatic processing of files by the connector.

  • Upload Whether files arriving at the connector should automatically be uploaded.
  • Retry Interval The number of minutes before a failed upload is retried.
  • Max Attempts The maximum number of times the connector processes the input file. Success is measured based on a successful server acknowledgement. If you set this to 0, the connect retries the file indefinitely.
  • Download Whether the connector should automatically poll the remote download path(s) for files to download.
  • Download Interval The interval between automatic download attempts.
  • Minutes Past the Hour The minutes offset for an hourly schedule. Only applicable when the interval setting above is set to Hourly. For example, if this value is set to 5, the automation service downloads at 1:05, 2:05, 3:05, etc.
  • Time The time of day that the attempt should occur. Only applicable when the interval setting above is set to Daily, Weekly, or Monthly.
  • Day The day on which the attempt should occur. Only applicable when the interval setting above is set to Weekly or Monthly.
  • Minutes The number of minutes to wait before attempting the download. Only applicable when the interval setting above is set to Minute.
  • Cron Expression A five-position string representing a cron expression that determines when the attempt should occur. Only applicable when the interval setting above is set to Advanced.

Performance

Settings related to the allocation of resources to the connector.

  • Max Workers The maximum number of worker threads consumed from the threadpool to process files on this connector. If set, this overrides the default setting on the Settings > Automation page.
  • Max Files The maximum number of files sent by each thread assigned to the connector. If set, this overrides the default setting on the Settings > Automation page.

Alerts Tab

Settings related to configuring alerts and Service Level Agreements (SLAs).

Connector Email Settings

Before you can execute SLAs, you need to set up email alerts for notifications. Clicking Configure Alerts opens a new browser window to the Settings page where you can set up system-wide alerts. See Alerts for more information.

Service Level Agreement (SLA) Settings

SLAs enable you to configure the volume you expect connectors in your flow to send or receive, and to set the time frame in which you expect that volume to be met. CData Arc sends emails to warn the user when an SLA is not met, and marks the SLA as At Risk, which means that if the SLA is not met soon, it will be marked as Violated. This gives the user an opportunity to step in and determine the reasons the SLA is not being met, and to take appropriate actions. If the SLA is still not met at the end of the at-risk time period, the SLA is marked as violated, and the user is notified again.

To define an SLA, click Add Expected Volume Criteria.

  • If your connector has separate send and receive actions, use the radio buttons to specify which direction the SLA pertains to.
  • Set Expect at least to the minimum number of transactions (the volume) you expect to be processed, then use the Every fields to specify the time frame.
  • By default, the SLA is in effect every day. To change that, uncheck Everyday then check the boxes for the days of the week you want.
  • Use And set status to ‘At Risk’ to indicate when the SLA should be marked as at risk.
  • By default, notifications are not sent until an SLA is in violation. To change that, check Send an ‘At Risk’ notification.

The following example shows an SLA configured for a connector that expects to receive 1000 files every day Monday-Friday. An at-risk notification is sent 1 hour before the end of the time period if the 1000 files have not been received.

Advanced Tab

Server Commands

These options allow you to execute custom commands on the remote server at various times during connector execution.

  • After Connect Execute this command directly after connecting.
  • After Get Execute this command after downloading each file.
  • After Put Execute this command after uploading each file.
  • Before Get Execute this command before downloading each file.
  • Before Put Execute this command before uploading each file.

You can use the following commands:

  • onerror: Pass the fail argument to instruct the connector to treat errors that come from the custom scripts as errors for the overall transaction. Wthout fail the connector logs any exceptions and continues processing.
  • cd: Change directory.
  • mkdir: Create directory.
  • rn or move: Rename or move file.
  • del or rm: Delete or remove file.
  • rmdir: Remove directory.

CData Arc supports the following macros with these commands. They all use the following syntax: %Macro%. Some macros, such as %Date% and %SourceFilename%, do not require an argument, but others do. The macros that take an argument use the following syntax: %Macro:argument%.

Macro Description
MsgHeader:headername Evaluates to the header value of the message (for example: %MsgHeader:headername%).
SourceFilename Evaluates to the filename (extension included) of the file currently being processed.
SourceFilenameNoExt Evaluates to the filename (without the extension) of the file currently being processed.
Date Evaluates to the date the file was last modified in a yyyy-MM-dd format (for example, 2024-02-13).
DateTime Evaluates to the datetime the file was last modified in a yyyy-MM-dd’T’HH:mm:ss format (for example, 2024-02-13T08:14:22).
Time Evaluates to the time the file was last modified in a HH:mm:ss format (for example, 08:14:22).
DateTime.tz Evaluates to the datetime the file was last modified in a yyyy-MM-dd’T’HH:mm:ss zzz format (for example, 2024-02-13T08:14:22 PST).
Time.tz Evaluates to the time the file was last modified in a HH:mm:ss zzz format (for example, 08:14:22 PST).
DateFormat:format Evaluates to the current datetime of the system in the specified format (format). See Sample Date Formats for the available datetime formats.

Example: After downloading, you want to move each file from the download folder to a History subfolder. Use the After Get field to supply the move command with the %SourceFilename% macro.

Proxy Settings

These are a collection of settings that identify and authenticate to the proxy through which the SFTP connection should be routed. By default, this section uses the global settings on the Settings Page. Clear the checkbox to supply settings specific to your SFTP connector.

  • Proxy Type The protocol used by a proxy-based firewall.
  • Proxy Host The name or IP address of a proxy-based firewall.
  • Proxy Port The TCP port for a proxy-based firewall.
  • Proxy User The user name to use to authenticate with a proxy-based firewall.
  • Proxy Password A password used to authenticate to a proxy-based firewall.
  • Authentication Scheme Leave the default None or choose from one of the following authentication schemes: Basic, Digest, Proprietary, or NTLM.

Advanced Settings

Settings not included in the previous categories.

  • Connection Pooling Check this to have the connector utilize a pool to access and store connections. This is useful in scenarios where the server places restrictions on the frequency of logon attempts.
  • Download Excluded Extensions A comma-delimited list of file extensions that should not be downloaded by the connector.
  • Download Excluded Prefixes A comma-delimited list of prefixes that instructs the connector not to download filenames that begin with a specified prefix.
  • Max Download Files The maximum number of files to be downloaded from the SFTP server in one polling interval. Set to -1 to allow all files to be downloaded.
  • Minimum File Size The minimum file size to process. If a file being uploaded is smaller than the specified size, it is finalized as Skipped. For files being downloaded, the connector adds a transaction log entry indicating that the file was skipped because of its size.
  • Monitor File Growth Check this to have the application store the file sizes of the remote files between polling intervals, and only download files that do not change in size. This prevents the connector from downloading files that are still in the process of being written on the server.
  • Processing Delay The amount of time (in seconds) by which the processing of files placed in the Input folder is delayed. This is a legacy setting. Best practice is to use a File connector to manage local file systems instead of this setting.
  • Recurse Check this to download files in subfolders of the target remote path. The subdirectories are preserved for the received files.
  • Resume Downloads If enabled, when an error occurs during a file download, the connector stores the portion of the file that was downloaded and attempts to download the remainder of the file on the next download attempt.
  • Resume Uploads If enabled, when an error occurs during a file upload, the connector notes the number of bytes that were transferred and attempts to upload the remainder of the file on the next upload attempt.
  • SSH Encryption Algorithms A comma-delimited list of algorithms to use during SSH encryption. By default, all algorithms are enabled. Arc supports the following algorithms:

    aes256-ctr, aes256-cbc, aes192-ctr, aes192-cbc, aes128-ctr, aes128-cbc, 3des-ctr, 3des-cbc, cast128-cbc, blowfish-cbc, arcfour, arcfour128, arcfour256, aes128-gcm@openssh.com, aes256-gcm@openssh.com, chacha20-poly1305@openssh.com
  • Local File Scheme A scheme for assigning filenames to messages that are output by the connector. You can use macros in your filenames dynamically to include information such as identifiers and timestamps. For more information, see Macros.
  • Receive Filter A glob pattern filter to determine which files should be downloaded from the remote storage (for example, *.txt). You can use negative patterns to indicate files that should not be downloaded (for example, -*.tmp). Use this setting when you need multiple File Mask patterns. Multiple patterns can be separated by commas, with later filters taking priority except when an exact match is found.
  • Temp Download Extension A temporary file extension to append to file downloads while the transfer is in progress.
  • Temp Upload Extension A temporary file extension to append to file uploads while the transfer is in progress.
  • Temp Upload Prefix A temporary file prefix to prepend to file uploads while the transfer is in progress.
  • Temp Upload Path A temporary path on the remote SFTP server where files are uploaded. The file is moved into the permanent upload path after the transfer is complete.
  • Timeout The duration the connector waits for a connection response before throwing a timeout error.
  • Wait for Channel Close Check this to have the conector wait for the server to close the SSH channel before disconnecting. If timeouts occur during the logoff sequence, you should disable this setting.

Message

  • Save to Sent Folder Check this to copy files processed by the connector to the Sent folder for the connector.
  • Sent Folder Scheme Instructs the connector to group messages in the Sent folder according to the selected interval. For example, the Weekly option instructs the connector to create a new subfolder each week and store all messages for the week in that folder. The blank setting tells the connector to save all messages directly in the Sent folder. For connectors that process many messages, using subfolders helps keep messsages organized and improves performance.

Logging

  • Log Level The verbosity of logs generated by the connector. When you request support, set this to Debug.
  • Log Subfolder Scheme Instructs the connector to group files in the Logs folder according to the selected interval. For example, the Weekly option instructs the connector to create a new subfolder each week and store all logs for the week in that folder. The blank setting tells the connector to save all logs directly in the Logs folder. For connectors that process many transactions, using subfolders helps keep logs organized and improves performance.
  • Log Messages Check this to have the log entry for a processed file include a copy of the file itself. If you disable this, you might not be able to download a copy of the file from the Input or Output tabs.

Miscellaneous

Miscellaneous settings are for specific use cases.

  • Other Settings Enables you to configure hidden connector settings in a semicolon-separated list (for example, setting1=value1;setting2=value2). Normal connector use cases and functionality should not require the use of these settings.

Establishing a Connection

The following settings are required to connect to a remote SFTP server:

  • Remote Host
  • Remote Port
  • Authentication Mode
  • User
  • Password or Client Certificate/Certificate Password
  • Server Thumbprint

When testing the connection to the server, the connector prompts you to accept/trust the thumbprint of the server’s public key.

Uploading Files

The SFTP Connector uploads files from the Input Folder. Files are automatically uploaded to the SFTP server if Upload Automation is enabled in the connector.

Remote Path should be set to the folder on the remote SFTP server where files should be uploaded. The application changes directories to the specified folder after logging in to the server.

If any files could not be uploaded, the connector attempts to send them again after Retry Interval minutes have elapsed. This process continues until the Max Attempts setting is reached, after which the connector raises an error and the file is longer sent automatically.

Downloading Files

Downloaded files are displayed in the Output tab of the connector, or passed along to the next connector in the flow. Files are automatically downloaded from the SFTP server if Download Automation is enabled in the connector, and can be manually downloaded by clicking the Receive button on the Output tab.

Remote Path should be set to the folder on the remote SFTP server from which files should be downloaded. The application changes directories to the specified folder after logging in to the server. The File Mask property filters files so that only filenames matching the glob pattern are downloaded from the server.

You can use the Delete files (after download) setting to delete the files on the remote server after successfully downloading them. On the Output tab, click the Receive button to test the settings and retrieve the matching files from the server.

Subfolder Headers

If Recurse is checked on the Advanced tab, when receiving a file from the Remote Path in the SFTP connector:

  • The message for the file that is received contains a Subfolder header in its metadata.
  • This header contains the subfolder, relative to the Remote Path in the SFTP connector, that the file was received from.
  • This subfolder header is supported by other connectors that support a subfolder header in Send operations.

When sending files, if a Subfolder header is present on the message that is sent to the Upload Path in the SFTP connector:

  • The file is placed in the subfolder, relative to the Upload Path in the SFTP connector, that is specified in the subfolder header.
  • The subfolder is created if permissions are available.

Macros

Using macros in file naming strategies can enhance organizational efficiency and contextual understanding of data. By incorporating macros into filenames, you can dynamically include relevant information such as identifiers, timestamps, and header information, providing valuable context to each file. This helps ensure that filenames reflect details important to your organization.

CData Arc supports these macros, which all use the following syntax: %Macro%.

Macro Description
ConnectorID Evaluates to the ConnectorID of the connector.
Ext Evaluates to the file extension of the file currently being processed by the connector.
Filename Evaluates to the filename (extension included) of the file currently being processed by the connector.
FilenameNoExt Evaluates to the filename (without the extension) of the file currently being processed by the connector.
MessageId Evaluates to the MessageId of the message being output by the connector.
RegexFilename:pattern Applies a RegEx pattern to the filename of the file currently being processed by the connector.
Header:headername Evaluates to the value of a targeted header (headername) on the current message being processed by the connector.
LongDate Evaluates to the current datetime of the system in long-handed format (for example, Wednesday, January 24, 2024).
ShortDate Evaluates to the current datetime of the system in a yyyy-MM-dd format (for example, 2024-01-24).
DateFormat:format Evaluates to the current datetime of the system in the specified format (format). See Sample Date Formats for the available datetime formats
Vault:vaultitem Evaluates to the value of the specified vault item.

Examples

Some macros, such as %Ext% and %ShortDate%, do not require an argument, but others do. All macros that take an argument use the following syntax: %Macro:argument%

Here are some examples of the macros that take an argument:

  • %Header:headername%: Where headername is the name of a header on a message.
  • %Header:mycustomheader% resolves to the value of the mycustomheader header set on the input message.
  • %Header:ponum% resolves to the value of the ponum header set on the input message.
  • %RegexFilename:pattern%: Where pattern is a regex pattern. For example, %RegexFilename:^([\w][A-Za-z]+)% matches and resolves to the first word in the filename and is case insensitive (test_file.xml resolves to test).
  • %Vault:vaultitem%: Where vaultitem is the name of an item in the vault. For example, %Vault:companyname% resolves to the value of the companyname item stored in the vault.
  • %DateFormat:format%: Where format is an accepted date format (see Sample Date Formats for details). For example, %DateFormat:yyyy-MM-dd-HH-mm-ss-fff% resolves to the date and timestamp on the file.

You can also create more sophisticated macros, as shown in the following examples:

  • Combining multiple macros in one filename: %DateFormat:yyyy-MM-dd-HH-mm-ss-fff%%EXT%
  • Including text outside of the macro: MyFile_%DateFormat:yyyy-MM-dd-HH-mm-ss-fff%
  • Including text within the macro: %DateFormat:'DateProcessed-'yyyy-MM-dd_'TimeProcessed-'HH-mm-ss%