Introduction

This document provides an overview and the functional specifications of the Amazon S3 Adapter.

An S3 bucket is a public cloud object-storage offering. In Amazon S3, data is stored as an object, which is the fundamental storage unit. An object is comprised of data and the descriptive metadata of the data. These objects are organized into buckets. Like file folders contain files, these buckets store objects. A single object can be up to 5 terabytes in size. The S3 buckets are located at various geographic regions as served by Amazon Web Services (AWS).

The function of an Amazon S3 adapter is to fetch objects or data from S3 buckets and send data to S3 buckets for storing as objects. The S3 adapter needs to be first authenticated by an AWS access key and secret key (based on bucket permission) for gaining access to an S3 bucket. The AWS denotes every geographic region by a region-code. This code needs to be in the exact format to that of AWS’s when specifying a region in the adapter properties.

The Amazon S3 adapter does not support the predefined-schema-import option. An AWS Identity and Access Management (IAM) user can be created in AWS for accessing the S3 services. This user represents a person or application that will interact with AWS. It is not the same as the AWS account root user. For more information, refer: https://docs.aws.amazon.com/IAM/latest/UserGuide/id_users.html.

Amazon S3 adapter as a source

Amazon S3 adapter as a source:
  • Reads a Blob
  • Reads and deletes a Blob

Amazon S3 adapter as a target

  • Creates an Object
  • Overrides an Object