Week 6: Pipelines
1 Links
Lecture pages
- Tuesday — Runner scripts and pipelines
- Tuesday/Thursday — Running an nf-core pipeline
Exercises & assignments
- Exercises for this week
- Your final project proposal is due on Monday, April 8th
- I have added pointers for your final project oral presentations, which are during class in week 7.
2 Content overview
In this final week with new content, you will be learning about informal and formal (bioinformatics analysis) pipelines, and using a best-practice pipeline written with the Nextflow workflow management system by the nf-core initiative.
Some of the things you will learn this week
- Get a better understanding of the purpose and mechanics of “runner scripts”.
- What challenges you can run into when creating analysis pipelines with regular Bash scripts, including runner scripts.
- What “workflow management systems” like Snakemake and Nextflow are, and what the advantages of formal pipelines written with these are.
- What the nf-core initiative is, and how you can run one of their pipelines at OSC.
Required readings
- Perkel 2019, Nature “Toolbox” feature: Workflow systems turn raw data into scientific knowledge
- Di Tommaso et al. 2017, Nature Biotechnology: Nextflow enables reproducible computational workflows
- Ewels et al. 2020, Nature Biotechnology: The nf-core framework for community-curated bioinformatics pipelines