annotate 0.5.0/readme/nowayout.md @ 0:3c767f9cfd88 draft default tip

planemo upload
author galaxytrakr
date Fri, 29 May 2026 13:37:56 +0000
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
1 <p align="center">
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
2 <img src="../assets/nowayout-icon.png" width="20%" height="20%" />
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
3 </p>
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
4
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
5 ---
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
6
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
7 `nowayout` is a **super-fast** automated software pipeline for taxonomic classification of Eukaryotic mitochondrial reads. It uses a custom database to first identify mitochondrial reads and performs read classification on those identified reads.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
8
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
9 ---
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
10
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
11 <!-- TOC -->
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
12
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
13 - [Minimum Requirements](#minimum-requirements)
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
14 - [HFP GalaxyTrakr](#hfp-galaxytrakr)
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
15 - [Usage and Examples](#usage-and-examples)
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
16 - [Databases](#databases)
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
17 - [Input](#input)
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
18 - [Output](#output)
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
19 - [Preset filters](#preset-filters)
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
20 - [Computational resources](#computational-resources)
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
21 - [Runtime profiles](#runtime-profiles)
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
22 - [your_institution.config](#your_institutionconfig)
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
23 - [Test Run](#test-run)
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
24 - [nowayout CLI Help](#nowayout-cli-help)
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
25
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
26 <!-- /TOC -->
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
27
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
28 \
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
29 &nbsp;
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
30
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
31 ## Minimum Requirements
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
32
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
33 1. [Nextflow version 25.04.6](https://github.com/nextflow-io/nextflow/releases/download/v25.04.6/nextflow).
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
34 - Make the `nextflow` binary executable (`chmod 755 nextflow`) and also make sure that it is made available in your `$PATH`.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
35 - If your existing `JAVA` install does not support the newest **Nextflow** version, you can try **Amazon**'s `JAVA` (OpenJDK): [Corretto](https://docs.aws.amazon.com/corretto/latest/corretto-21-ug/downloads-list.html).
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
36 2. Either of `micromamba` (version `1.5.9`) or `docker` or `singularity` installed and made available in your `$PATH`.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
37 - Running the workflow via `micromamba` software provisioning is **preferred** as it does not require any `sudo` or `admin` privileges or any other configurations with respect to the various container providers.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
38 - To install `micromamba` for your system type, please follow these [installation steps](https://mamba.readthedocs.io/en/latest/installation/micromamba-installation.html#linux-and-macos) and make sure that the `micromamba` binary is made available in your `$PATH`.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
39 - Just the `curl` step is sufficient to download the binary as far as running the workflows are concerned.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
40 - Once you have finished the installation, **it is important that you downgrade `micromamba` to version `1.5.9`**.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
41 - First check, if your version is other than `1.5.9` and if not, do the downgrade.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
42
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
43 ```bash
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
44 micromamba --version
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
45 micromamba self-update --version 1.5.9 -c conda-forge
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
46 ```
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
47
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
48 3. Minimum of 10 CPU cores and about 60 GBs for main workflow steps. More memory may be required if your **FASTQ** files are big.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
49
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
50 \
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
51 &nbsp;
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
52
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
53 ## HFP GalaxyTrakr
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
54
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
55 The `nowayout` pipeline **will** be made available for use on the newest version of [Galaxy instance supported by HFP, FDA](https://galaxytrakr.org/) (`version >= 24.x`). Please check this space for announcements in this regard.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
56
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
57 Please note that the pipeline on [HFP GalaxyTrakr](https://galaxytrakr.org) in most cases may be a version older than the one on **GitHub** due to testing prioritization.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
58
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
59 \
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
60 &nbsp;
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
61
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
62 ## Usage and Examples
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
63
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
64 Clone or download this repository and then call `cpipes`.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
65
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
66 ```bash
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
67 cpipes --pipeline nowayout [options]
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
68 ```
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
69
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
70 Alternatively, you can use `nextflow` to directly pull and run the pipeline.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
71
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
72 ```bash
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
73 nextflow pull CFSAN-Biostatistics/nowayout
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
74 nextflow list
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
75 nextflow info CFSAN-Biostatistics/nowayout
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
76 nextflow run CFSAN-Biostatistics/nowayout --pipeline nowayout --help
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
77 ```
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
78
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
79 \
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
80 &nbsp;
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
81
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
82 ### Databases
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
83
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
84 ---
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
85
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
86 The successful run of the workflow requires proper setup of the custom database files:
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
87
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
88 - `nowayout_dbs`: [Download](https://cfsan-pub-xfer.s3.amazonaws.com/Kranti.Konganti/nowayout/nowayout_dbs.tar.bz2) (~ 22 GB).
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
89
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
90 Once you have downloaded the databases, uncompress and set the **UNIX symbolic link** to the database folders in [assets](../assets/) folder as follows:
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
91
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
92 ```bash
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
93 mkdir assets/dbfiles
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
94 cd assets/dbfiles
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
95 ln -s /path/to/nowayout_dbs/kma kma
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
96 ln -s /path/to/nowayout_dbs/reference reference
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
97 ln -s /path/to/nowayout_dbs/taxonomy taxonomy
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
98 ```
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
99
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
100 That's it!
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
101
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
102 \
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
103 &nbsp;
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
104
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
105 ### Input
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
106
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
107 ---
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
108
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
109 The input to the workflow is a folder containing compressed (`.gz`) FASTQ files of long reads or short reads. Please note that the sample grouping happens automatically by the file name of the FASTQ file. If for example, a single sample is sequenced across multiple sequencing lanes, you can choose to group those FASTQ files into one sample by using the `--fq_filename_delim` and `--fq_filename_delim_idx` options. By default, `--fq_filename_delim` is set to `_` (underscore) and `--fq_filename_delim_idx` is set to 1.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
110
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
111 For example, if the directory contains FASTQ files as shown below:
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
112
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
113 - KB-01_apple_L001_R1.fastq.gz
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
114 - KB-01_apple_L001_R2.fastq.gz
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
115 - KB-01_apple_L002_R1.fastq.gz
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
116 - KB-01_apple_L002_R2.fastq.gz
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
117 - KB-02_mango_L001_R1.fastq.gz
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
118 - KB-02_mango_L001_R2.fastq.gz
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
119 - KB-02_mango_L002_R1.fastq.gz
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
120 - KB-02_mango_L002_R2.fastq.gz
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
121
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
122 Then, to create 2 sample groups, `apple` and `mango`, we split the file name by the delimitor (underscore in the case, which is default) and group by the first 2 words (`--fq_filename_delim_idx 2`).
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
123
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
124 This goes without saying that all the FASTQ files should have uniform naming patterns so that `--fq_filename_delim` and `--fq_filename_delim_idx` options do not have any adverse effect in collecting and creating a sample metadata sheet.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
125
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
126 \
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
127 &nbsp;
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
128
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
129 ### Output
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
130
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
131 ---
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
132
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
133 All the outputs for each step are stored inside the folder mentioned with the `--output` option. A `multiqc_report.html` file inside the `nowayout-multiqc` folder can be opened in any browser on your local workstation which contains a consolidated brief report.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
134
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
135 Please note that the percentage relative abundances seen are relative to the total number of mitochondrial reads and not the total number of reads per sample.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
136
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
137 \
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
138 &nbsp;
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
139
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
140 ### Preset filters
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
141
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
142 ---
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
143
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
144 There are three preset threshold filters that are available with the pipeline: `--nowo_thresholds strict`, `--nowo_thresholds mild` and `--nowo_thresholds relax`. Use these options for exploration of results via multiple runs on the same input dataset. The default is `strict` thresholds.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
145
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
146 \
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
147 &nbsp;
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
148
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
149 ### Computational resources
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
150
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
151 ---
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
152
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
153 The workflows `nowayout` require at least a minimum of 10 CPU cores and 60 GBs of memory to successfully finish the workflow.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
154
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
155 \
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
156 &nbsp;
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
157
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
158 ### Runtime profiles
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
159
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
160 ---
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
161
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
162 You can use different run time profiles that suit your specific compute environments i.e., you can run the workflow locally on your machine or in a grid computing infrastructure.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
163
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
164 \
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
165 &nbsp;
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
166
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
167 Example:
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
168
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
169 ```bash
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
170 cd /data/scratch/$USER
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
171 mkdir nf-cpipes
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
172 cd nf-cpipes
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
173 cpipes \
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
174 --pipeline nowayout \
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
175 --input /path/to/fastq_pass_dir \
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
176 --output /path/to/where/output/should/go \
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
177 -profile your_institution
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
178 ```
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
179
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
180 The above command would run the pipeline and store the output at the location per the `--output` flag and the **NEXTFLOW** reports are always stored in the current working directory from where `cpipes` is run. For example, for the above command, a directory called `CPIPES-nowayout` would hold all the **NEXTFLOW** related logs, reports and trace files.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
181
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
182 \
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
183 &nbsp;
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
184
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
185 ### `your_institution.config`
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
186
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
187 ---
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
188
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
189 In the above example, we can see that we have mentioned the run time profile as `your_institution`. For this to work, add the following lines at the end of [`computeinfra.config`](../conf/computeinfra.config) file which should be located inside the `conf` folder. For example, if your institution uses **SGE** or **UNIVA** for grid computing instead of **SLURM** and has a job queue named `normal.q`, then add these lines:
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
190
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
191 \
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
192 &nbsp;
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
193
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
194 ```groovy
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
195 your_institution {
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
196 process.executor = 'sge'
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
197 process.queue = 'normal.q'
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
198 singularity.enabled = false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
199 singularity.autoMounts = true
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
200 docker.enabled = false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
201 params.enable_conda = true
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
202 conda.enabled = true
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
203 conda.useMicromamba = true
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
204 params.enable_module = false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
205 }
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
206 ```
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
207
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
208 In the above example, by default, all the software provisioning choices are disabled except `conda`. You can also choose to remove the `process.queue` line altogether and the `nowayout` workflow will request the appropriate memory and number of CPU cores automatically, which ranges from 1 CPU, 1 GB and 1 hour for job completion up to 10 CPU cores, 1 TB and 120 hours for job completion.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
209
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
210 \
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
211 &nbsp;
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
212
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
213 ### Cloud computing
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
214
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
215 ---
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
216
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
217 You can run the workflow in the cloud (works only with proper set up of AWS resources). Add new run time profiles with required parameters per [Nextflow docs](https://www.nextflow.io/docs/latest/executor.html):
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
218
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
219 \
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
220 &nbsp;
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
221
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
222 Example:
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
223
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
224 ```groovy
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
225 my_aws_batch {
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
226 executor = 'awsbatch'
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
227 queue = 'my-batch-queue'
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
228 aws.batch.cliPath = '/home/ec2-user/miniconda/bin/aws'
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
229 aws.batch.region = 'us-east-1'
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
230 singularity.enabled = false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
231 singularity.autoMounts = true
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
232 docker.enabled = true
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
233 params.conda_enabled = false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
234 params.enable_module = false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
235 }
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
236 ```
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
237
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
238 \
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
239 &nbsp;
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
240
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
241 ## Test Run
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
242
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
243 After you make sure that you have all the [minimum requirements](#minimum-requirements) to run the workflow, you can try the `nowayout` on some datasets.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
244
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
245 - Download input reads [from S3](https://cfsan-pub-xfer.s3.amazonaws.com/Kranti.Konganti/nowayout/nowayout_test_reads.tar.bz2) (~ 8 GB).
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
246 - This dataset was part of the research for detecting and identifying insects or insect fragments in food, an essential component of food safety and regulatory monitoring. Insects such as **_Plodia interpunctella_** (Indian meal moth) and _**Tribolium castaneum**_ (red flour beetle) were intentionally spiked into wheat flour at varying concentrations to create benchmark samples. These serve as reference materials to test and validate molecular detection workflows.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
247 - Download pre-formatted databases (**MANDATORY**) [from S3](https://cfsan-pub-xfer.s3.amazonaws.com/Kranti.Konganti/nowayout/nowayout_dbs.tar.bz2) (~ 22 GB).
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
248 - After successful download, untar and add **symbolic links** in [assets](../assets) folder as described in the [Databases](#databases) section.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
249 - It is always a best practice to use absolute UNIX paths and real destinations of symbolic links during pipeline execution. For example, find out the real path(s) of your absolute UNIX path(s) and use that for the `--input` and `--output` options of the pipeline.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
250
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
251 ```bash
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
252 realpath /hpc/scratch/user/input/srr
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
253 ```
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
254
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
255 - Now run the workflow by ignoring quality values since these are simulated base qualities:
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
256
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
257 ```bash
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
258 cpipes \
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
259 --pipeline nowayout \
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
260 --input /path/to/nowayout_test_reads \
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
261 --output /path/to/nowayout_test_output \
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
262 --fq_single_end true \
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
263 -profile stdkondagac \
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
264 -resume
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
265 ```
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
266
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
267 - After succesful run of the workflow, your **MultiQC** report should look something like [this](https://cfsan-pub-xfer.s3.us-east-1.amazonaws.com/Kranti.Konganti/nowayout/CPIPES-Report_multiqc_report.html).
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
268
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
269 - `nowayout` also automatically generates [Krona](https://github.com/marbl/Krona) charts. The **Krona** chart for the above test run should look something like [this](https://cfsan-pub-xfer.s3.us-east-1.amazonaws.com/Kranti.Konganti/nowayout/CPIPES_nowayout_krona.html)
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
270
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
271 Please note that the run time profile `stdkondagac` will run jobs locally using `micromamba` for software provisioning. The first time you run the command, a new folder called `kondagac_cache` will be created and subsequent runs should use this `conda` cache.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
272
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
273 \
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
274 &nbsp;
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
275
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
276 ## `nowayout` CLI Help
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
277
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
278 ```text
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
279 cpipes --pipeline nowayout --help
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
280
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
281 N E X T F L O W ~ version 24.10.4
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
282
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
283 Launching `/home/user/nowayout/cpipes` [sleepy_pauling] DSL2 - revision: 55d6f63710
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
284
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
285 ================================================================================
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
286 (o)
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
287 ___ _ __ _ _ __ ___ ___
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
288 / __|| '_ \ | || '_ \ / _ \/ __|
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
289 | (__ | |_) || || |_) || __/\__ \
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
290 \___|| .__/ |_|| .__/ \___||___/
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
291 | | | |
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
292 |_| |_|
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
293 --------------------------------------------------------------------------------
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
294 A collection of modular pipelines at CFSAN, FDA.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
295 --------------------------------------------------------------------------------
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
296 Name : CPIPES
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
297 Author : Kranti.Konganti@fda.hhs.gov
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
298 Version : 0.8.0
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
299 Center : CFSAN, FDA.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
300 ================================================================================
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
301
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
302 Workflow : nowayout
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
303
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
304 Author : Kranti Konganti
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
305
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
306 Version : 0.5.0
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
307
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
308
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
309 Usage : cpipes --pipeline nowayout [options]
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
310
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
311
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
312 Required :
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
313
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
314 --input : Absolute path to directory containing FASTQ
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
315 files. The directory should contain only
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
316 FASTQ files as all the files within the
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
317 mentioned directory will be read. Ex: --
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
318 input /path/to/fastq_pass
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
319
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
320 --output : Absolute path to directory where all the
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
321 pipeline outputs should be stored. Ex: --
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
322 output /path/to/output
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
323
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
324 Other options :
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
325
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
326 --metadata : Absolute path to metadata CSV file
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
327 containing five mandatory columns: sample,
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
328 fq1,fq2,strandedness,single_end. The fq1
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
329 and fq2 columns contain absolute paths to
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
330 the FASTQ files. This option can be used in
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
331 place of --input option. This is rare. Ex
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
332 : --metadata samplesheet.csv
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
333
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
334 --fq_suffix : The suffix of FASTQ files (Unpaired reads
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
335 or R1 reads or Long reads) if an input
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
336 directory is mentioned via --input option.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
337 Default: _R1_001.fastq.gz
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
338
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
339 --fq2_suffix : The suffix of FASTQ files (Paired-end reads
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
340 or R2 reads) if an input directory is
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
341 mentioned via --input option. Default:
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
342 _R2_001.fastq.gz
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
343
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
344 --fq_filter_by_len : Remove FASTQ reads that are less than this
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
345 many bases. Default: 0
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
346
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
347 --fq_strandedness : The strandedness of the sequencing run.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
348 This is mostly needed if your sequencing
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
349 run is RNA-SEQ. For most of the other runs
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
350 , it is probably safe to use unstranded for
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
351 the option. Default: unstranded
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
352
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
353 --fq_single_end : SINGLE-END information will be auto-
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
354 detected but this option forces PAIRED-END
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
355 FASTQ files to be treated as SINGLE-END so
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
356 only read 1 information is included in auto
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
357 -generated samplesheet. Default: false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
358
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
359 --fq_filename_delim : Delimiter by which the file name is split
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
360 to obtain sample name. Default: _
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
361
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
362 --fq_filename_delim_idx : After splitting FASTQ file name by using
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
363 the --fq_filename_delim option, all
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
364 elements before this index (1-based) will
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
365 be joined to create final sample name.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
366 Default: 1
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
367
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
368 --fastp_run : Run fastp tool. Default: true
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
369
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
370 --fastp_failed_out : Specify whether to store reads that cannot
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
371 pass the filters. Default: false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
372
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
373 --fastp_merged_out : Specify whether to store merged output or
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
374 not. Default: false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
375
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
376 --fastp_overlapped_out : For each read pair, output the overlapped
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
377 region if it has no mismatched base.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
378 Default: false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
379
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
380 --fastp_6 : Indicate that the input is using phred64
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
381 scoring (it'll be converted to phred33, so
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
382 the output will still be phred33). Default
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
383 : false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
384
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
385 --fastp_reads_to_process : Specify how many reads/pairs are to be
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
386 processed. Default value 0 means process
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
387 all reads. Default: 0
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
388
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
389 --fastp_fix_mgi_id : The MGI FASTQ ID format is not compatible
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
390 with many BAM operation tools, enable this
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
391 option to fix it. Default: false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
392
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
393 --fastp_A : Disable adapter trimming. On by default.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
394 Default: false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
395
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
396 --fastp_adapter_fasta : Specify a FASTA file to trim both read1 and
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
397 read2 (if PE) by all the sequences in this
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
398 FASTA file. Default: false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
399
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
400 --fastp_f : Trim how many bases in front of read1.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
401 Default: 0
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
402
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
403 --fastp_t : Trim how many bases at the end of read1.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
404 Default: 0
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
405
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
406 --fastp_b : Max length of read1 after trimming. Default
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
407 : 0
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
408
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
409 --fastp_F : Trim how many bases in front of read2.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
410 Default: 0
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
411
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
412 --fastp_T : Trim how many bases at the end of read2.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
413 Default: 0
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
414
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
415 --fastp_B : Max length of read2 after trimming. Default
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
416 : 0
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
417
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
418 --fastp_dedup : Enable deduplication to drop the duplicated
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
419 reads/pairs. Default: true
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
420
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
421 --fastp_dup_calc_accuracy : Accuracy level to calculate duplication (1~
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
422 6), higher level uses more memory (1G, 2G,
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
423 4G, 8G, 16G, 24G). Default 1 for no-dedup
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
424 mode, and 3 for dedup mode. Default: 6
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
425
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
426 --fastp_poly_g_min_len : The minimum length to detect polyG in the
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
427 read tail. Default: 10
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
428
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
429 --fastp_G : Disable polyG tail trimming. Default: true
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
430
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
431 --fastp_x : Enable polyX trimming in 3' ends. Default:
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
432 false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
433
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
434 --fastp_poly_x_min_len : The minimum length to detect polyX in the
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
435 read tail. Default: 10
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
436
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
437 --fastp_cut_front : Move a sliding window from front (5') to
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
438 tail, drop the bases in the window if its
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
439 mean quality < threshold, stop otherwise.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
440 Default: true
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
441
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
442 --fastp_cut_tail : Move a sliding window from tail (3') to
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
443 front, drop the bases in the window if its
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
444 mean quality < threshold, stop otherwise.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
445 Default: false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
446
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
447 --fastp_cut_right : Move a sliding window from tail, drop the
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
448 bases in the window and the right part if
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
449 its mean quality < threshold, and then stop
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
450 . Default: true
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
451
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
452 --fastp_W : Sliding window size shared by --
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
453 fastp_cut_front, --fastp_cut_tail and --
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
454 fastp_cut_right. Default: 20
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
455
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
456 --fastp_M : The mean quality requirement shared by --
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
457 fastp_cut_front, --fastp_cut_tail and --
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
458 fastp_cut_right. Default: 30
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
459
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
460 --fastp_q : The quality value below which a base should
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
461 is not qualified. Default: 30
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
462
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
463 --fastp_u : What percent of bases are allowed to be
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
464 unqualified. Default: 40
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
465
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
466 --fastp_n : How many N's can a read have. Default: 5
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
467
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
468 --fastp_e : If the full reads' average quality is below
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
469 this value, then it is discarded. Default
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
470 : 0
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
471
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
472 --fastp_l : Reads shorter than this length will be
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
473 discarded. Default: 35
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
474
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
475 --fastp_max_len : Reads longer than this length will be
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
476 discarded. Default: 0
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
477
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
478 --fastp_y : Enable low complexity filter. The
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
479 complexity is defined as the percentage of
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
480 bases that are different from its next base
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
481 (base[i] != base[i+1]). Default: true
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
482
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
483 --fastp_Y : The threshold for low complexity filter (0~
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
484 100). Ex: A value of 30 means 30%
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
485 complexity is required. Default: 30
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
486
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
487 --fastp_U : Enable Unique Molecular Identifier (UMI)
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
488 pre-processing. Default: false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
489
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
490 --fastp_umi_loc : Specify the location of UMI, can be one of
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
491 index1/index2/read1/read2/per_index/
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
492 per_read. Default: false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
493
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
494 --fastp_umi_len : If the UMI is in read1 or read2, its length
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
495 should be provided. Default: false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
496
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
497 --fastp_umi_prefix : If specified, an underline will be used to
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
498 connect prefix and UMI (i.e. prefix=UMI,
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
499 UMI=AATTCG, final=UMI_AATTCG). Default:
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
500 false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
501
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
502 --fastp_umi_skip : If the UMI is in read1 or read2, fastp can
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
503 skip several bases following the UMI.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
504 Default: false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
505
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
506 --fastp_p : Enable overrepresented sequence analysis.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
507 Default: true
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
508
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
509 --fastp_P : One in this many number of reads will be
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
510 computed for overrepresentation analysis (1
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
511 ~10000), smaller is slower. Default: 20
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
512
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
513 --kmaalign_run : Run kma tool. Default: true
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
514
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
515 --kmaalign_int : Input file has interleaved reads. Default
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
516 : false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
517
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
518 --kmaalign_ef : Output additional features. Default: false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
519
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
520 --kmaalign_vcf : Output vcf file. 2 to apply FT. Default:
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
521 false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
522
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
523 --kmaalign_sam : Output SAM, 4/2096 for mapped/aligned.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
524 Default: false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
525
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
526 --kmaalign_nc : No consensus file. Default: true
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
527
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
528 --kmaalign_na : No aln file. Default: true
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
529
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
530 --kmaalign_nf : No frag file. Default: false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
531
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
532 --kmaalign_a : Output all template mappings. Default:
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
533 false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
534
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
535 --kmaalign_and : Use both -mrs and p-value on consensus.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
536 Default: true
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
537
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
538 --kmaalign_oa : Use neither -mrs or p-value on consensus.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
539 Default: false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
540
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
541 --kmaalign_bc : Minimum support to call bases. Default:
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
542 false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
543
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
544 --kmaalign_bcNano : Altered indel calling for ONT data. Default
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
545 : false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
546
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
547 --kmaalign_bcd : Minimum depth to call bases. Default: false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
548
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
549 --kmaalign_bcg : Maintain insignificant gaps. Default: false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
550
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
551 --kmaalign_ID : Minimum consensus ID. Default: 85.0
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
552
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
553 --kmaalign_md : Minimum depth. Default: false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
554
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
555 --kmaalign_dense : Skip insertion in consensus. Default: false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
556
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
557 --kmaalign_ref_fsa : Use Ns on indels. Default: false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
558
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
559 --kmaalign_Mt1 : Map everything to one template. Default:
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
560 false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
561
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
562 --kmaalign_1t1 : Map one query to one template. Default:
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
563 false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
564
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
565 --kmaalign_mrs : Minimum relative alignment score. Default:
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
566 0.99
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
567
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
568 --kmaalign_mrc : Minimum query coverage. Default: 0.99
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
569
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
570 --kmaalign_mp : Minimum phred score of trailing and leading
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
571 bases. Default: 30
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
572
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
573 --kmaalign_mq : Set the minimum mapping quality. Default:
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
574 false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
575
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
576 --kmaalign_eq : Minimum average quality score. Default: 30
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
577
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
578 --kmaalign_5p : Trim 5 prime by this many bases. Default:
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
579 false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
580
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
581 --kmaalign_3p : Trim 3 prime by this many bases Default:
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
582 false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
583
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
584 --kmaalign_apm : Sets both -pm and -fpm Default: false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
585
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
586 --kmaalign_cge : Set CGE penalties and rewards Default:
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
587 false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
588
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
589 --seqkit_grep_run : Run the seqkit `grep` tool. Default: true
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
590
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
591 --seqkit_grep_n : Match by full name instead of just ID.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
592 Default: undefined
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
593
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
594 --seqkit_grep_s : Search subseq on seq, both positive and
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
595 negative strand are searched, and mismatch
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
596 allowed using flag --seqkit_grep_m. Default
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
597 : undefined
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
598
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
599 --seqkit_grep_c : Input is circular genome Default: undefined
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
600
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
601 --seqkit_grep_C : Just print a count of matching records.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
602 With the --seqkit_grep_v flag, count non-
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
603 matching records. Default: undefined
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
604
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
605 --seqkit_grep_i : Ignore case while using seqkit grep.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
606 Default: undefined
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
607
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
608 --seqkit_grep_v : Invert the match i.e. select non-matching
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
609 records. Default: undefined
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
610
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
611 --seqkit_grep_m : Maximum mismatches when matching by
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
612 sequence. Default: undefined
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
613
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
614 --seqkit_grep_r : Input patters are regular expressions.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
615 Default: undefined
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
616
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
617 --salmonidx_run : Run `salmon index` tool. Default: true
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
618
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
619 --salmonidx_k : The size of k-mers that should be used for
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
620 the quasi index. Default: false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
621
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
622 --salmonidx_gencode : This flag will expect the input transcript
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
623 FASTA to be in GENCODE format, and will
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
624 split the transcript name at the first `|`
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
625 character. These reduced names will be used
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
626 in the output and when looking for these
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
627 transcripts in a gene to transcript GTF.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
628 Default: false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
629
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
630 --salmonidx_features : This flag will expect the input reference
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
631 to be in the tsv file format, and will
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
632 split the feature name at the first `tab`
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
633 character. These reduced names will be used
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
634 in the output and when looking for the
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
635 sequence of the features. GTF. Default:
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
636 false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
637
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
638 --salmonidx_keepDuplicates : This flag will disable the default indexing
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
639 behavior of discarding sequence-identical
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
640 duplicate transcripts. If this flag is
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
641 passed then duplicate transcripts that
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
642 appear in the input will be retained and
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
643 quantified separately. Default: true
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
644
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
645 --salmonidx_keepFixedFasta : Retain the fixed fasta file (without short
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
646 transcripts and duplicates, clipped, etc.)
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
647 generated during indexing. Default: false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
648
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
649 --salmonidx_filterSize : The size of the Bloom filter that will be
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
650 used by TwoPaCo during indexing. The filter
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
651 will be of size 2^{filterSize}. A value of
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
652 -1 means that the filter size will be
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
653 automatically set based on the number of
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
654 distinct k-mers in the input, as estimated
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
655 by nthll. Default: false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
656
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
657 --salmonidx_sparse : Build the index using a sparse sampling of
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
658 k-mer positions This will require less
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
659 memory (especially during quantification),
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
660 but will take longer to constructand can
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
661 slow down mapping / alignment. Default:
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
662 false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
663
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
664 --salmonidx_n : Do not clip poly-A tails from the ends of
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
665 target sequences. Default: true
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
666
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
667 --sourmashsketch_run : Run `sourmash sketch dna` tool. Default:
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
668 true
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
669
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
670 --sourmashsketch_mode : Select which type of signatures to be
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
671 created: dna, protein, fromfile or
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
672 translate. Default: dna
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
673
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
674 --sourmashsketch_p : Signature parameters to use. Default: '
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
675 abund,scaled=100,k=71
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
676
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
677 --sourmashsketch_file : <path> A text file containing a list of
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
678 sequence files to load. Default: false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
679
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
680 --sourmashsketch_f : Recompute signatures even if the file
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
681 exists. Default: false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
682
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
683 --sourmashsketch_name : Name the signature generated from each file
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
684 after the first record in the file.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
685 Default: false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
686
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
687 --sourmashsketch_randomize : Shuffle the list of input files randomly.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
688 Default: false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
689
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
690 --sourmashgather_run : Run `sourmash gather` tool. Default: true
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
691
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
692 --sourmashgather_n : Number of results to report. By default,
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
693 will terminate at --sourmashgather_thr_bp
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
694 value. Default: false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
695
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
696 --sourmashgather_thr_bp : Reporting threshold (in bp) for estimated
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
697 overlap with remaining query. Default: 100
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
698
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
699 --sourmashgather_ani_ci : Output confidence intervals for ANI
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
700 estimates. Default: true
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
701
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
702 --sourmashgather_k : The k-mer size to select. Default: 71
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
703
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
704 --sourmashgather_dna : Choose DNA signature. Default: true
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
705
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
706 --sourmashgather_rna : Choose RNA signature. Default: false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
707
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
708 --sourmashgather_nuc : Choose Nucleotide signature. Default: false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
709
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
710 --sourmashgather_scaled : Scaled value should be between 100 and 1e6
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
711 . Default: false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
712
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
713 --sourmashgather_inc_pat : Search only signatures that match this
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
714 pattern in name, filename, or md5. Default
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
715 : false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
716
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
717 --sourmashgather_exc_pat : Search only signatures that do not match
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
718 this pattern in name, filename, or md5.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
719 Default: false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
720
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
721 --sfhpy_run : Run the sourmash_filter_hits.py script.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
722 Default: true
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
723
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
724 --sfhpy_fcn : Column name by which filtering of rows
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
725 should be applied. Default: f_match
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
726
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
727 --sfhpy_fcv : Remove genomes whose match with the query
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
728 FASTQ is less than this much. Default: 0.8
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
729
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
730 --sfhpy_gt : Apply greather than or equal to condition
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
731 on numeric values of --sfhpy_fcn column.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
732 Default: true
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
733
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
734 --sfhpy_lt : Apply less than or equal to condition on
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
735 numeric values of --sfhpy_fcn column.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
736 Default: false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
737
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
738 --sfhpy_all : Instead of just the column value, print
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
739 entire row. Default: true
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
740
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
741 --gsalkronapy_run : Run the `gen_salmon_tph_and_krona_tsv.py`
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
742 script. Default: true
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
743
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
744 --gsalkronapy_sf : Set the scaling factor by which TPM values
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
745 are scaled down. Default: 10000
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
746
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
747 --gsalkronapy_smres_suffix : Find the `sourmash gather` result files
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
748 ending in this suffix. Default: false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
749
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
750 --gsalkronapy_failed_suffix : Find the sample names which failed
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
751 classification stored inside the files
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
752 ending in this suffix. Default: false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
753
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
754 --gsalkronapy_num_lin_cols : Number of columns expected in the lineages
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
755 CSV file. Default: false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
756
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
757 --gsalkronapy_lin_regex : Number of columns expected in the lineages
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
758 CSV file. Default: false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
759
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
760 --krona_ktIT_run : Run the ktImportText (ktIT) from krona.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
761 Default: true
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
762
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
763 --krona_ktIT_n : Name of the highest level. Default: all
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
764
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
765 --krona_ktIT_q : Input file(s) do not have a field for
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
766 quantity. Default: false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
767
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
768 --krona_ktIT_c : Combine data from each file, rather than
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
769 creating separate datasets within the chart
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
770 . Default: false
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
771
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
772 Help options :
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
773
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
774 --help : Display this message.
3c767f9cfd88 planemo upload
galaxytrakr
parents:
diff changeset
775 ```