1 Introduction

sevenbridges is an R/Bioconductor package that provides an interface for Seven Bridges public API. The supported platforms includes the Seven Bridges Platform, Cancer Genomics Cloud (CGC), and Cavatica.

Learn more from our documentation on the Seven Bridges Platform and the Cancer Genomics Cloud (CGC).

1.1 R Client for Seven Bridges API

The sevenbridges package only supports v2+ versions of the API, since versions prior to V2 are not compatible with the Common Workflow Language (CWL). This package provides a simple interface for accessing and trying out various methods.

There are two ways of constructing API calls. For instance, you can use low-level API calls which use arguments like path, query, and body. These are documented in the API reference libraries for the Seven Bridges Platform and the CGC. An example of a low-level request to “list all projects” is shown below. In this request, you can also pass query and body as a list.

library("sevenbridges")
a <- Auth(token = "your_token", platform = "aws-us")
a$api(path = "projects", method = "GET")

(Advanced user option) The second way of constructing an API request is to directly use the httr package to make your API calls, as shown below.

a$project()

1.2 API General Information

Before we start, keep in mind the following:

offset and limit

Every API call accepts two arguments named offset and limit.

Offset defines where the retrieved items started.
Limit defines the number of items you want to get.

By default, offset is set to 0 and limit is set to 100. As such, your API request returns the first 100 items when you list items or search for items by name. To search and list all items, use complete = TRUE in your API request.

Search by ID

When searching by ID, your request will return your exact resource as it is unique. As such, you do not have to set offset and limit manually. It is a good practice to find your resources by their ID and pass this ID as an input to your task. You can find a resource’s ID in the final part of the URL on the visual interface or via the API requests to list resources or get a resource’s details.

Search by name

Search by name returns all partial matches unless you specify exact = TRUE.

1.3 Installation

The sevenbridges package is available on both the release and devel branch from Bioconductor.

To install it from the release branch, use:

install.packages("BiocManager")
BiocManager::install("sevenbridges")

To install it from the devel branch, use:

install.packages("BiocManager")
BiocManager::install("sevenbridges", version = "devel")

Since we are constantly improving our API and client libraries, please also visit our GitHub repository for the most recent news and the latest version of the package.

If you do not have devtools

This installation requires that you have the devtools package. If you do not have this package, you can install it from CRAN.

install.packages("devtools")

You may get an error for missing system dependencies such as curl and openssl. For example, in Ubuntu, you probably need to do the following first to install devtools and to build vignettes since you need pandoc.

apt-get update
apt-get install libcurl4-gnutls-dev libssl-dev pandoc pandoc-citeproc

If devtools is already installed

Install the latest version for sevenbridges from GitHub with the following:

install.packages("BiocManager")
install.packages("readr")

devtools::install_github(
  "sbg/sevenbridges-r",
  repos = BiocManager::repositories(),
  build_vignettes = TRUE, dependencies = TRUE
)

If you have trouble with pandoc and do not want to install it, set build_vignettes = FALSE to avoid the vignettes build.

2 Quickstart

For more details about how to use the API client in R, please consult the Seven Bridges API Reference section below for a complete guide.

2.1 Create `Auth` Object

Before you can access your account via the API, you have to provide your credentials. You can obtain your credentials in the form of an “authentication token” from the Developer Tab under Account Settings on the visual interface. Once you’ve obtained this, create an Auth object, so it remembers your authentication token and the path for the API. All subsequent requests will draw upon these two pieces of information.

Let’s load the package first:

library("sevenbridges")

You have three different ways to provide your token. Choose from one method below:

Direct authentication. This explicitly and temporarily sets up your token and platform type (or alternatively, API base URL) in the function call arguments to Auth().
Authentication via system environment variables. This will read the credential information from two system environment variables: SB_API_ENDPOINT and SB_AUTH_TOKEN.
Authentication via the user configuration file. This file, by default $HOME/.sevenbridges/credentials, provides an organized way to collect and manage all your API authentication information for Seven Bridges platforms.

Method 1: Direct authentication

This is the most common method to construct the Auth object. For example:

(a <- Auth(platform = "cgc", token = "your_token"))

Using platform: cgc
== Auth ==
url : https://cgc-api.sbgenomics.com/v2/
token : <your_token>

Method 2: Environment variables

To set the two environment variables in your system, you could use the function sbg_set_env(). For example:

sbg_set_env("https://cgc-api.sbgenomics.com/v2", "your_token")

Note that this change might be just temporary, please feel free to use the standard method to set persistent environment variables according to your operating system.

Create an Auth object:

a <- Auth(from = "env")

Method 3: User configuration file

Assume we have already created the configuration file named credentials under the directory $HOME/.sevenbridges/:

[aws-us-rfranklin]
api_endpoint = https://api.sbgenomics.com/v2
auth_token = token_for_this_user

# This is a comment:
# another user on the same platform
[aws-us-rosalind-franklin]
api_endpoint = https://api.sbgenomics.com/v2
auth_token = token_for_this_user

[default]
api_endpoint = https://cgc-api.sbgenomics.com/v2
auth_token = token_for_this_user

[gcp]
api_endpoint = https://gcp-api.sbgenomics.com/v2
auth_token = token_for_this_user

To load the user profile aws-us-rfranklin from this configuration file, simply use:

a <- Auth(from = "file", profile_name = "aws-us-rfranklin")

If profile_name is not specified, we will try to load the profile named [default]:

a <- Auth(from = "file")

Note: API paths (base URLs) differ for each Seven Bridges environment. Be sure to provide the correct path for the environment you are using. API paths for some of the environments are:

Platform Name	API Base URL	Short Name
Seven Bridges Platform (US)	`https://api.sbgenomics.com/v2`	`"aws-us"`
Seven Bridges Platform (EU)	`https://eu-api.sbgenomics.com/v2`	`"aws-eu"`
Seven Bridges Platform (China)	`https://api.sevenbridges.cn/v2`	`"ali-cn"`
Cancer Genomics Cloud (CGC)	`https://cgc-api.sbgenomics.com/v2`	`"cgc"`
Cavatica	`https://cavatica-api.sbgenomics.com/v2`	`"cavatica"`
BioData Catalyst Powered by Seven Bridges	`https://api.sb.biodatacatalyst.nhlbi.nih.gov/v2`	`"f4c"`

Please refer to the API reference section for more usage and technical details about the three authentication methods.

Complete Guide for Seven Bridges API R Client

2024-10-29

1 Introduction

1.1 R Client for Seven Bridges API

1.2 API General Information

1.3 Installation

2 Quickstart

2.1 Create Auth Object

2.2 Get User Information

2.3 Rate Limit

2.4 Show Billing Information

2.5 Create Project

2.6 Get Details about Existing Project

2.7 Copy Public Apps into Your Project

2.8 Import CWL App and Run a Task

2.9 Execute a New Task

2.9.1 Find your app inputs

2.9.2 Get your input files ready

2.9.3 Create a new draft task

2.9.4 Draft a batch task

2.10 Run a Task

2.11 Run tasks using spot instances

2.12 Execution hints per task run

2.13 Task Monitoring

3 Seven Bridges API Reference

3.1 Authentication

3.1.1 Direct authentication

3.1.2 Authentication via system environment variables

3.1.3 Authentication via user configuration file

3.2 List All API Calls

3.3 Offset, Limit, Search, and Advance Access Features

3.3.1 offset and limit

3.3.2 Search by ID

3.3.3 Search by name

3.3.4 Experiment with Advance Access features

3.4 Query Parameter 'fields'

3.5 Rate Limits

3.6 Users

3.7 Billing Group and Invoices

3.7.1 For billing

3.7.2 For invoices

3.8 Project

3.8.1 List all projects

3.8.2 Partial match project name

3.8.3 Filter by project creation date, modification date, and creator

3.8.4 Create a new project

3.8.5 Create a new project with TCGA controlled data on CGC

3.8.6 Delete a project

3.8.7 Update/edit a project

3.8.8 Project member

3.8.8.1 List members

3.8.8.2 Add a member

3.8.8.3 Update a member

3.8.8.4 Delete a member

3.8.9 List all files

3.9 Files, Metadata, and Tags

3.9.1 List all files

3.9.2 Search and filter file(s)

3.9.2.1 Rule of thumb

3.9.2.2 Search by name and id

3.9.2.3 Search by metadata

3.9.2.4 Search by tags

3.9.2.5 Search by original task id

3.9.3 Copy a file or group of files

3.9.4 Delete file(s)

3.9.5 Download files

3.9.6 Upload files via API

3.9.6.1 Upload single file

3.9.6.2 Upload a folder

3.9.6.3 Upload a list of files

3.9.6.4 Upload files via a defined manifest file

3.9.7 Upload files via command line uploader

3.9.8 Update a file

3.9.9 Metadata operations

3.9.10 Tag file(s)

3.10 Folders

3.10.1 Get project root folder

3.10.2 Create a folder

3.10.3 Copy files between folders

3.10.4 Move files between folders

2.1 Create `Auth` Object

3.3.1 `offset` and `limit`

3.4 Query Parameter `'fields'`