|
1 |
| -# GMS-artic (ncov2019-artic-nf) |
| 1 | + |
| 2 | + |
2 | 3 |
|
3 | 4 | A nextflow pipeline with a GMS touch for running the ARTIC network's fieldbioinformatics tools (https://github.com/artic-network/fieldbioinformatics).
|
4 | 5 |
|
| 6 | +### Table of contents - |
| 7 | +- [Version updates](#Version-updates) |
| 8 | +- [Pipeline Diagram](#Pipeline-Diagram) |
| 9 | +- [Requirements](#Requirements) |
| 10 | +- [Quick start guide](#Quick-Start-Guide) |
| 11 | + - [parameters setup](#Parameters-setup) |
| 12 | + - [Test Data](#Test-Data) |
| 13 | + - [Run on local server](#-Run-on-local-server) |
| 14 | + - [Requirements](#Requirements) |
| 15 | + - [Illumina pipeline](#Illumina-pipeline) |
| 16 | + - [Nanopore nanopolish pipeline](#Nanopore-nanopolish-pipeline) |
| 17 | + - [Nanopore medaka pipeline](#Nanopore-medaka-pipeline) |
| 18 | + - [How to run in NGP server](#How-to-run-in-NGP-server) |
| 19 | + - [Datafile structure](#Datafile-structure) |
| 20 | + - [Pipeline run command](#Manual-running-of-analysis-pipeline) |
| 21 | + - [Illumina pipeline](#Run-Illumina-pipeline) |
| 22 | + - [Nanopore pipeline](#Run-Nanopore-Pipeline) |
| 23 | +- [Useful information](#Useful-information) |
5 | 24 | ------------
|
6 |
| -#### Major changes |
| 25 | +# Version updates |
| 26 | +## v2.0.0 |
| 27 | +### Major updates |
| 28 | +- Docker container separated for Pangolin typing |
| 29 | + - Illumina container: [gms-artic-illumina](https://hub.docker.com/repository/docker/genomicmedicinesweden/gms-artic-illumina) |
| 30 | + - Nanopore container: [gms-artic-nanopore](https://hub.docker.com/repository/docker/genomicmedicinesweden/gms-artic-nanopore) |
| 31 | + - Pangolin container: [gms-artic-pangolin](https://hub.docker.com/repository/docker/genomicmedicinesweden/gms-artic-pangolin) |
| 32 | + - pycoQC container : [pycoqc](https://hub.docker.com/repository/docker/jd21/pycoqc) |
| 33 | +- Added separate package version files for each workflow |
| 34 | + - versions: for Illumina and Nanopore |
| 35 | + - pangoversion: for pangolin typing |
| 36 | +- Illumina analysis additional features |
| 37 | + - flagstat |
| 38 | + - depth |
| 39 | + - VEP annotation |
| 40 | +- Illumina results works for sc2reporter visualization |
| 41 | +- Nanopore analysis additional features (artic & medaka) |
| 42 | + - [fastqc](https://github.com/s-andrews/FastQC) |
| 43 | + - [multiqc](https://multiqc.info) |
| 44 | + - [pycoQC](https://github.com/a-slide/pycoQC) *(only for artic)* |
| 45 | + |
| 46 | +## v1.8.0 |
| 47 | +### Minor updates |
| 48 | + |
| 49 | +- Pangolin v4 support |
| 50 | +- Updated Picard arguments |
| 51 | +- FastQC commands can be added from config |
| 52 | +- Added version of pangolin to build_dockerfile |
| 53 | + |
| 54 | +### Bug fixes |
| 55 | +- Fixed build_dockerfile |
| 56 | +- Fixed R issue |
| 57 | +- Fixed mamba issue |
7 | 58 |
|
| 59 | +### Major changes |
8 | 60 | * The illumina and nanopore tracks automatically run pangolin and nextclade.
|
9 | 61 | * Generates report for base changes.
|
10 | 62 |
|
11 |
| -###### 1. gms-artic in ngp-gms |
| 63 | +# Pipeline Diagram |
| 64 | + |
| 65 | + |
| 66 | +Find DAG and other figures [here](workflow-image/) |
| 67 | + |
| 68 | +# Requirements |
| 69 | +- Nextflow version >=20.10, <22.0 (tested OK on NextFlow version 20.10.0, version 21.10.6) |
| 70 | +- Singularity version 3.7.1 (tested OK) |
| 71 | +- Conda version >= 4.13.0 (tested OK) |
| 72 | + |
| 73 | +# Quick Start Guide |
| 74 | +## Test Data |
| 75 | +To test the pipeline, an [example dataset](./.github/data) for both Illumina and Nanopore (nanopolish, medaka) datafiles (from ConnerLab) provided. |
| 76 | + |
| 77 | +# parameters setup |
| 78 | +## primer scheme |
| 79 | +##### --scheme: To use the primer list, add --scheme to the CLI, eg., use 'nCoV-2019/V3' for artic primers or 'midnight-primer/V1' |
12 | 80 |
|
13 |
| -*for nanopore analysis (default is "midnight")* |
14 |
| -``` |
15 |
| - sample_name |
16 |
| - |___ fast5_pass/ |
17 |
| - |___ fastq_pass/ |
18 |
| - |___ sequencing_summary.txt |
19 |
| -``` |
20 |
| -*for illumina analysis* |
21 | 81 | ```
|
22 |
| - sample_name |
23 |
| - |___ fastq/ |
| 82 | +--scheme nCoV-2019/V3/ |
| 83 | +--scheme midnight-primers/V1/ |
| 84 | +--scheme eden-primers/V1/ |
24 | 85 | ```
|
25 |
| -#### Manual running of analysis pipeline |
26 |
| -###### 2. Run Illumina pipeline |
| 86 | +**To run the artic pipeline, please change the [nanopore.config](https://github.com/JD2112/gms-artic/blob/master/conf/nanopore.config) 'min_length' (default = 400) and 'max_length' (default = 700)** |
| 87 | + |
| 88 | +**For more parameters setup, please see the [ConnerLab documentation](ConnerLab-README.md)** |
| 89 | + |
| 90 | +## Run on local server |
| 91 | +### Requirements |
| 92 | +1. Containers: [Singularity](https://singularity-tutorial.github.io/01-installation/), [Docker](https://docs.docker.com/engine/install/) |
| 93 | +2. [Nextflow>=20](https://www.nextflow.io/docs/latest/getstarted.html) |
| 94 | + |
| 95 | +### Illumina pipeline |
27 | 96 | ```
|
28 |
| -$ nextflow run main.nf -profile singularity,sge \ |
| 97 | +nextflow run main.nf -profile singularity \ |
29 | 98 | --illumina --prefix "test_illumina" \
|
30 | 99 | --directory .github/data/fastqs/ \
|
31 | 100 | --outdir illumina_test
|
32 | 101 | ```
|
33 |
| - |
34 |
| -###### 3. Run Nanopore Pipeline |
35 |
| -###### **Deafult is "midnight" protocol** |
| 102 | +### Nanopore nanopolish pipeline |
36 | 103 | ```
|
37 |
| -$ nextflow run main.nf -profile singularity \ |
38 |
| - --nanopolish --prefix "midnight" \ |
39 |
| - --basecalled_fastq /home/test/fastq_pass/ \ |
40 |
| - --fast5_pass /home/test/fast5_pass/ \ |
41 |
| - --sequencing_summary /home/test/sequencing_summary_FAP82331_657703c9.txt \ |
42 |
| - --scheme-directory midnight-primer/V1/ \ |
43 |
| - --outdir /home/test/midnight_test -with-report midnight |
| 104 | +nextflow run main.nf -profile singularity \ |
| 105 | + --nanopolish --prefix "test_nanopore_nanopolish" \ |
| 106 | + --basecalled_fastq .github/data/nanopore/20200311_1427_X1_FAK72834_a3787181/fastq_pass/ \ |
| 107 | + --fast5_pass .github/data/nanopore/20200311_1427_X1_FAK72834_a3787181/fast5_pass/ \ |
| 108 | + --sequencing_summary .github/data/nanopore/20200311_1427_X1_FAK72834_a3787181/sequencing_summary_FAK72834_298b7829.txt \ |
| 109 | + --outdir nanopore_nanopolish |
44 | 110 | ```
|
45 |
| - |
46 |
| -###### --scheme: To use the primer list, add --scheme to the CLI, eg., use 'nCoV-2019/V3' for artic primers or 'midnight-primer/V1' |
47 |
| - |
| 111 | +#### Nanopore medaka pipeline |
48 | 112 | ```
|
49 |
| ---scheme nCoV-2019/V3/ |
50 |
| ---scheme midnight-primers/V1/ |
51 |
| ---scheme eden-primers/V1/ |
52 |
| -
|
| 113 | +nextflow run main.nf -profile singularity \ |
| 114 | + --medaka --prefix "test_nanopore_medaka" \ |
| 115 | + --basecalled_fastq .github/data/nanopore/20200311_1427_X1_FAK72834_a3787181/fastq_pass/ \ |
| 116 | + --outdir nanopore_medaka |
53 | 117 | ```
|
54 |
| -###### **To run the artic pipeline, please change the [nanopore.config](https://github.com/JD2112/gms-artic/blob/master/conf/nanopore.config) 'min_length' (default = 400) and 'max_length' (default = 700)** |
55 | 118 |
|
| 119 | +## Run on NGP server |
| 120 | +### Datafile structure |
| 121 | +1. *for Nanopore analysis (default is "midnight")* |
| 122 | +``` |
| 123 | + sample_name |
| 124 | + |___ fast5_pass/ |
| 125 | + |___ fastq_pass/ |
| 126 | + |___ sequencing_summary.txt |
56 | 127 | ```
|
57 |
| -$ nextflow run main.nf -profile singularity,sge \ |
| 128 | +#### Run Nanopolish pipeline |
| 129 | +``` |
| 130 | +nextflow run main.nf -profile singularity,sge \ |
58 | 131 | --nanopolish --prefix "test_nanopore" \
|
59 | 132 | --basecalled_fastq .github/data/nanopore/20200311_1427_X1_FAK72834_a3787181/fastq_pass/ \
|
60 | 133 | --fast5_pass .github/data/nanopore/20200311_1427_X1_FAK72834_a3787181/fast5_pass/ \
|
61 | 134 | --sequencing_summary .github/data/nanopore/20200311_1427_X1_FAK72834_a3787181/sequencing_summary_FAK72834_298b7829.txt \
|
62 |
| - --outdir nanopore_test |
| 135 | + --outdir nanopore_test |
| 136 | +``` |
| 137 | + |
| 138 | +#### Run medaka pipeline |
63 | 139 | ```
|
64 |
| -#### To update your container image to the latest version from [dockerhub](https://hub.docker.com/orgs/genomicmedicinesweden/repositories), please delete your local image first before running the analysis pipeline. |
| 140 | +nextflow run main.nf -profile singularity,sge \ |
| 141 | + --medaka --prefix "test_nanopore_medaka" \ |
| 142 | + --basecalled_fastq .github/data/nanopore/20200311_1427_X1_FAK72834_a3787181/fastq_pass/ \ |
| 143 | + --outdir nanopore_medaka |
| 144 | +``` |
| 145 | +2. *for Illumina analysis* |
| 146 | +``` |
| 147 | + sample_name |
| 148 | + |___ fastq/ |
| 149 | +``` |
| 150 | +#### Run Illumina pipeline |
| 151 | +``` |
| 152 | +nextflow run main.nf -profile singularity,sge \ |
| 153 | + --illumina --prefix "test_illumina" \ |
| 154 | + --directory .github/data/fastqs/ \ |
| 155 | + --outdir illumina_test |
| 156 | +``` |
| 157 | + |
| 158 | + |
| 159 | +# Useful information |
| 160 | +1.To update your container image to the latest version from [dockerhub](https://hub.docker.com/orgs/genomicmedicinesweden/repositories), please delete your local image first before running the analysis pipeline. |
0 commit comments