Chapter 2 Analysing TARA Oceans carbon export data using Tercen

This document describes a walkthrough example of data analysis and visualisation using Tercen. We will analyse data from the TARA Oceans project, coming from this publication: Guidi et al. (2016):

Guidi, L., Chaffron, S., Bittner, L., Eveillard, D., Larhlimi, A., Roux, S., … & Coelho, L. P. (2016). Plankton networks driving carbon export in the oligotrophic ocean. Nature, 532(7600), 465-470.

Our goal is to look at how eukaryotic plankton lineages are associated to environmental parameters such as carbon export. To do so, we will:

  • create a new project in Tercen

  • upload the dataset containing the correlation of different lineages to environmental parameters

  • create a heatmap representing this correlation

Creating a Tercen project

First, we will create a Tercen project for our analyses.

  • Start Tercen and log in

  • Click on New project

  • Assign the team of your choice to the project

  • Give a name and description to this project, then click on OK

  • Your project is now created !

Loading data into Tercen

We will use data from Guidi et al. (2016), more specifically the Supplementary Table 5.

1. Download the data onto your computer

You can find the data here: https://github.com/tercen/tara_guidi2016_data. In this tutorial, we will just use the SI5_eukaryotic_lineages.tsv file.

2. Go back to Tercen and load the data

Now that we have the data on our computer, we can import it into Tercen.

  • Go to the newly created project

  • Click on New data set

  • Upload the SI5_eukaryotic_lineages.tsv file (1)

  • Give a name to the data set (2)

  • Click on Next (3)

  • Check that the data set has been loaded properly and click on Next
  • Give a name to the values (here, measurement) and variables (here, environmental_variable) (1)

  • Check all the variables (2)

  • Uncheck the ones that are not part of the environmental parameters of interest (here, Latitude and Longitude) (3) and click on OK

  • The data set is now uploaded to Tercen!

Creating a workflow

Now that we have created our dataset, we will create a workflow to analyse and visualise it.

  • Click on New workflow

  • Give a name to your workflow

  • Your workflow appears now in your project. Click on it

  • Your are now on the workflow environment that is empty. We will now add the data set we uploaded before. Right click anywhere in the blank area and click on Add.

  • Select Table and click on OK.
  • Select the data set and click on OK.
  • Now the data set is ready to be used !

Making a heatmap

To analyse our data we need to:

1. Add a data step to the workflow

  • Right click on the data set and select Add
  • Select Data step

2. Make the heatmap projection

  • Double click on the newly created data step
  • Drag and drop elements to make the heatmap projection as in the following video:
  • We now have our first heatmap, our data projection should look like this:

3. Improve the heatmap by ordering rows and columns and adding dendrograms

This heatmap looks nice, but it would be better if we could order rows and columns and add dendrograms! That way we could see which species and environmental parameters cluster together.

Tercen includes a heatmap operator (called Shiny Heatmap) to make such a representation.

In short, operators take as input Tercen projections and return computations and views. In our case, we call it a Shiny operator as it is based on the R/Shiny framework designed to produce interactive visualisations such as this heatmap.

  • In the sidebar, click on the plus sign to add an operator (1)

  • Pick the shiny_heatmap_operator. It appears now the data step environment (2)

  • Then click on Operator view tab on the left to see the result (3)



Et voilà! The operator view appeared, as well as our heatmap with a dendrogram:



Making a world map

1. Add the data and a data step to the workflow

  • Right click on the blank area and select Add

  • Add the Environmental_data data table to the workflow

  • Right click on the data set and select Add

  • Select Data step

2. Make the map projection

  • Double click on the newly created data step

  • Drag and drop elements to make the worldmap projection: . We now have our first heatmap, our data projection should look like this:



3. Make a better looking worldmap

We can already see the data we want to see, however it would be much better with a base map. Tercen includes a worldmap operator (called Worldmap Shiny) to make such a representation.

In short, operators take as input Tercen projections and return computations and views. In our case, we call it a Shiny operator as it is based on the R/Shiny framework designed to produce interactive visualisations such as this worldmap.

  • In the sidebar, click on the plus sign to add an operator

  • Pick the Worldmap operator. It appears now the data step environment

  • Then click on Operator view tab on the left to see the result: