Containers and Workflow Pipelines for reproducible and automated data analysis
26 October 2020
This is a postponed course. Participants registered in the previous edition will have priority over new registrations.
This two day course is organized jointly by the Bioinformatics core facility of the Centre for Genomic Regulation, Barcelona, Spain and ELIXIR Belgium.
The first day is dedicated to Linux Containers (Docker & Singularity) which are great tools for code portability and analysis reproducibility. You will learn how to build a container from scratch, share it with others and how to re-use and modify existing containers.
On the second day, you will learn how to use Nextflow for building scalable and reproducible bioinformatics pipelines and running them on a personal computer, cluster and cloud.
- Learn the concept of and the difference between Docker & Singularity containers
- Write a Docker recipe, build and run a Docker image and containers
- Pull and push Docker container to / from Docker hub
- Docker files and layers; Docker cashing
- Working with volumes
- Pull Docker containers as a Singularity image
- Understand Nextflow's basic concepts: processes, channels, ...
- Write and run a Nextflow pipeline (using a Singularity containers)
Bioinformaticians with no or little knowledge of containers or workflow pipelines.
You're familiar with doing bioinformatics on the command-line.
Toni Hermoso Pulido
Toni Hermoso Pulido is an expert in bioinformatics software, databases and web services. He holds the degree in Biochemistry and the PhD in Biotechnology from the Autonomous University of Barcelona, Spain.
Luca Cozzuto is an expert in high-throughput sequencing data analysis and bioinformatics pipeline development. He holds the SciD in Pharmaceutical Biotechnologies and a PhD in molecular medicine from the University of Naples Federico II, Italy.
Sarah Bonnin is an an expert in high-throughput sequencing data analysis and bioinformatics training. She obtained the BSc in Biology at the University Claude Bernard Lyon 1, France and the MSc in Bioinformatics at the University Denis Diderot Paris 7, France.
Introduction to containers
- History of containers, what are containers and why should we use them?
- Containers vs. virtual machines
- Differences between Singularity and Docker: why and when to use one or the other. Pros and cons.
- Singularity recipes
- Building a basic Singularity image
- Pull and run an image with Singularity from Docker hub
- Volumes in Singularity
- Use a Singularity image interactively
- Run a simple Nextflow pipeline & obtain a thorough understanding of config and pipeline files
- Modifying a pipeline and rerun processes
- Theoretic approach to processes, channels and operators; the basics of Nextflow
- Write and run a simple Nextflow pipeline (e.g. print text, process a simple calculation)
- Including Singularity containers in Nextflow pipelines