-
Create an S3 account. You can submit a ticket for CTek in Jira to request an account. Please reach out if you would like an example of a ticket.
-
Install required software:
a. AWS CLI: instructions to install are here. We won’t actually be using this, but it might become useful down the road if you want to interface with AWS through your command line.
b. Chocolatey (only needed if you are on Windows): I followed these instructions for non-admin install because I don’t have a pr account and they worked for me, but feel free to install however you like!
c. saml2aws: you can install this using chocolatey if you are on Windows. Installation instructions can be found in the ‘Install’ section of the README. For Linux HPCs, change the
~/.local/bin/saml2aws
to/home/<username>/
and run all commands using/home/<username>/saml2aws <command>
.d. boto3: python package that can be installed using conda or pip
-
Create an S3 bucket. You can create a bucket using the AWS CLI, or follow these instructions to use the web GUI:
a. Log in to the AWS web GUI here. You will want to select the option to log in to the dev environment with a developer role.
b. Find the S3 Storage Service in the list of AWS services and click on it.
c. Click the orange Create bucket button.
d. Fill in the required information and create the bucket. Enable versioning if you would like to track versions of your data files. You must have a tag added with the key set to
wma:project_id
and the value set to a billable project in this list.e. After you have created your bucket, you can add any sub-directories that you want to use with the web GUI as well. Note: You have read/write access to all of the buckets in S3 by default, so be careful not to tamper with other users' buckets.
-
Set up your S3 credentials on your local computer for programmatic access.
a. Make sure you are connected through VPN.
b. Open your command line, and configure your saml2aws by typing
saml2aws configure
. This will prompt you with a series of questions, and the responses will be stored in a file called~\.saml2aws
. Here is a screenshot of my configuration:c. Create a new temporary access token by typingThe attached notebook will execute the shell command to generate credentials automatically usingsaml2aws login
. Choose the option to work in the dev environment with the developer access.saml2aws login
. This will store a temporary set of credentials in your~\.aws\credentials
file. Note that you will have to regenerate this token every hour by doing a saml2aws login, unless you change the aws_session_duration in your~\.saml2aws
file. -
You are ready to do the tutorial!