Reproducible data science with Snakemake
11 January 2021
The Snakemake workflow management system is a tool to create reproducible and scalable data analyses. Workflows are described via a human readable, Python based language. They can be seamlessly scaled to server, cluster, grid and cloud environments, without the need to modify the workflow definition. Finally, Snakemake workflows can entail a description of required software, which will be automatically deployed to any execution environment.
With over 200k downloads on Bioconda and on average >5 new citations per week, Snakemake is a widely used and accepted standard for reproducible data science that has powered numerous high impact publications.
- Basic experience in Python programming
- A laptop with Linux, MacOS, or Windows with WSL