Amazon S3

Amazon S3 or Amazon Simple Storage Service is a service offered by Amazon Web Services that provides object storage through a web service interface. Amazon S3 uses the same scalable storage infrastructure that Amazon.com uses to run its global e-commerce network.
More Info on AWS

Authentication Connection

Authentication connectors are used to authenticate repository/output connections that need certain authentication fields like access tokens or refresh tokens. Click here for more information on setting up authentication connections.

Authentication Configuration
Proxy Information

Authentication Configuration Fields

Name: Unique name for this auth connector.
Client ID: The Access Key to connect to the client. For more information about AWS Access Keys, please visit this link.
Client Secret: The Secret key associated with the above Access Key.
S3 Region: The AWS Region where your instance is located, It will be in the aws console. default is us-east-1
End Point: If using Amazon Glacier, set your instances' url here. It will override the region.
Connection Timeout: Set the connection timeout. Higher values may be needed when moving large files.

Installed AWS Credentials

If you leave the Client ID and Client Secret empty, Simflofy will attempt to authenticate with your installed AWS credentials

Discovery Connector

note

There is no available Discovery Instance Connector available for Amazon S3.

Integration Connection

Most Integration Connections can act in both repository (read) and output (write) modes. If it can't, it will not appear as an option when creating or editing a job. This connection can only be used as a repository connection. Click here for more information on setting up an integration connection.

Integration Connection Fields

Description: Description for this connection
Authentication Connection: Your Amazon Auth connector

Job Configuration

Folders (Repo)
Basic Configuration (Output)
Advanced Configuration (Output)

Specification Tab: S3 Folders (Repo)

List of S3 Keys: A comma delimited keys of s3 keys (folders) to crawl.
Bucket Name: The bucket where the keys are located
Retrieve File Tags: File tags will be added as metadata with prefix "tag."

Content Service Connection

This section covers the S3 specific configuration of the Content Service Connector. For a description of how to set up a content services connector generically see Content Service Connectors.

Configuration Fields

This section covers the S3 specific configuration of the Content Service Connector.

S3 Document Ids

S3 file ids always take the form of /bucket/(key).

Bucket Name: The target bucket for creating a file.
Output Folder Path: The key of the folder to target when creating a file.
ACL Name: Canned ACL to add to all new content uploaded via this connection.
Content Disposition: Default Disposition of any content added via this connection. Will be added to objects metadata

Supported Method

Create File - Will take full /bucket/key as folderId parameter to bucket and folder configuration
Delete Object by ID
Get File Content
Get Object Properties
Update File
Update Properties
List Folder Items (3.1.1+)
Get ACLs
Set ACLs - Special(see below)
Delete ACL

S3 Access Control

See this page for information on grantees and permissions.

ACL Examples (3.1.1+)

Get Permissions
Set Permissions
Delete Permissions

GET /api/repo/s3/acls?id=/test-bucket/archive/testdoc.txt

{
  "success": true,
  "results": [
    "7cfbdbb50b0682227896f2b416777d4d74906ded4df472db3ace75768962c134:(adminuser):FULL_CONTROL"
  ]
}

The items before the semicolon is called the canonical ID of the user. It can be used to remove or update permissions for the user.

Groups use a url instead of a Canonical ID. Such as http://acs.amazonaws.com/groups/global/AllUsers. They will appear as <url>:(Group):<Permission>

POST /api/repo/s3/acls?id=/test-bucket/archive/testdoc.txt

To add a user to a document, you can use their canonical id or email

Requires a JSON as a request body in the following format:

{"7cfb11150b0682227896f2b416777d4d74906ded4df472db3ace75769062c134":"READ"}

{"testuser@gmail.com":"READ"}

which will result in

{
  "success": true,
  "results": [
    "7cfbdbb50b0682227896f2b416777d4d74906ded4df472db3ace75768962c134:(adminuser):FULL_CONTROL",
    "7cfb11150b0682227896f2b416777d4d74906ded4df472db3ace75769062c134:(testuser):READ"
  ]
}

To add a group, you'll need the group's URI, such as:

{"http://acs.amazonaws.com/groups/s3/LogDelivery":"WRITE"}

resulting in

{
  "success": true,
  "results": [
    "7cfbdbb50b0682227896f2b416777d4d74906ded4df472db3ace75768962c134:(adminuser):FULL_CONTROL",
    "http://acs.amazonaws.com/groups/s3/LogDelivery:(Group):WRITE"
  ]
}

DELETE /api/repo/s3/acls?id=/test-bucket/archive/testdoc.txt&aclId=7cfb11150b0682227896f2b416777d4d74906ded4df472db3ace75769062c134

The aclId parameter can either be the Canonical ID of a user, or the url of the group.

The return will simply be the aclId, but a followup GET call will produce

{
  "success": true,
  "results": [
    "7cfbdbb50b0682227896f2b416777d4d74906ded4df472db3ace75768962c134:(adminuser):FULL_CONTROL"
  ]
}

Need help integrating S3? We can help.

Amazon S3

Authentication Connection

Installed AWS Credentials

Discovery Connector

note

Integration Connection

Job Configuration

tip

important

Content Service Connection

Configuration Fields

S3 Document Ids

S3 Access Control

Authentication Connection​

Installed AWS Credentials

Discovery Connector​

note

Integration Connection​

Job Configuration​

tip

important

Content Service Connection​

Configuration Fields​

S3 Document Ids

S3 Access Control

Authentication Connection

Discovery Connector

Integration Connection

Job Configuration

Content Service Connection

Configuration Fields