Table of Contents

USE6 Workflow Management Systems

Workflow Management Systems are basic tools for reproducible data analysis, especially if many data analyis steps are involved.

Workflow Management Systems cover all data analysis from A to Z, e.g. data preprocessing, quality filtering, analysis and statistical evaluation of processed data. This may include, starting jobs, staging data to compute nodes, running the computations, deleting temporary data, generating publication ready reports, archiving and cleaning work environments.

There are some Workflow Management Systems, such as Nextflow or Snakemake, which are designed to automate this process on HPC Clusters.

Learning objectives

Subskills