cfsan_centriflaken: cfsan_centriflaken.xml comparison

comparison cfsan_centriflaken.xml @ 47:30191f39a957

"planemo upload"

author	kkonganti
date	Mon, 11 Jul 2022 12:19:48 -0400
parents	0dd0ebe5cddf
children	8202a3cedcc4

comparison

equal deleted inserted replaced

-:0dd0ebe5cddf
+:30191f39a957
 --fq_filter_by_len $fq_filter_by_len
 #end if
 	--fq_filename_delim '${fq_filename_delim}'
 	--fq_filename_delim_idx $fq_filename_delim_idx
 	--centrifuge_extract_bug '${centrifuge_extract_bug}'
-	-profile $runtime_profile
+	-profile kondagac;
--resume;
 mv './cpipes-output/${pipeline}-multiqc/multiqc_report.html' './multiqc_report.html';
 mv './cpipes-output/${pipeline}-results/kraken2_extract_contigs' kraken2_extract_contigs;
 rm -rf ./cpipes-output;
 rm -rf ./work
 ]]></command>
 <option value="pacbio_raw">PacBio regular CLR reads (&lt;20% error)</option>
 <option value="pacbio_corr">PacBio reads that were corrected with other methods (&lt;3% error)</option>
 <option value="pacbio_hifi">PacBio HiFi reads (&lt;1% error)</option>
 </param>
 <param name="fq_suffix" value=".fastq.gz" type="text" label="Suffix of the R1 FASTQ or Unpaired FASTQ"/>
-<param name="fq2_suffix" value="_R2_001.fastq.gz" type="text" label="Suffix of the R2 FASTQ"/>
+<param name="fq2_suffix" value="_R2_001.fastq.gz" type="text" label="Suffix of the R2 FASTQ"
+help="THIS OPTION IS IGNORED IF THE INPUT READS ARE UNPAIRED/LONG READS."/>
 <param name="fq_filter_by_len" optional="true" value="" type="integer" label="Enter minimum read length to retain before starting the analysis"
 help="Keep this option empty to use default values. Default for centriflaken (long reads) is 4000 bp and for centriflaken_hy (short reads) is 75 bp)"/>
 <param name="fq_filename_delim" type="text" value="_" label="File name delimitor by which samples are grouped together (--fq_filename_delim)"
 help="This is the delimitor by which samples are grouped together to display in the final MultiQC report. For example, if your input data sets are mango_replicate1.fastq.gz, mango_replicate2.fastq.gz, orange_replicate1_maryland.fastq.gz, orange_replicate2_maryland.fastq.gz, then to create 2 samples mango and orange, the value for --fq_filename_delim would be _ (underscore) and the value for --fq_filename_delim_idx would be 1, since you want to group by the first word (i.e. mango or orange) after splitting the filename based on _ (underscore)"/>
 <param name="fq_filename_delim_idx" type="integer" value="1" label="File name delimitor index (--fq_filename_delim_idx)" />
 <param name="centrifuge_extract_bug" type="text" value="Escherichia coli" label="Reads belonging to this taxa are extracted and a MAG is generated to allow for serotyping"/>
 <param name="genome_size" type="text" optional="true" value="5.5m" label="Estimated genome size" help="For example, 5m or 2.6g.">
 <validator type="regex" message="Genome size must be a float  or integer, optionally followed by the a unit prefix (kmg)">^([0-9]*[.])?[0-9]+[kmg]?$</validator>
 </param>
-<param name="runtime_profile" type="select" label="Run time profile">
+<!-- <param name="runtime_profile" type="select" label="Run time profile">
 <option value="kondagac" selected="true">conda</option>
 <option value="cingularitygac">singularity</option>
-</param>
+</param> -->
 </inputs>
 <outputs>
 <data name="multiqc_report" format="html" label="${pipeline}: MultiQC Report on ${on_string}" from_work_dir="multiqc_report.html"/>
 <collection name="assembled_mags" type="list" label="${pipeline}: Assembled MAGs on ${on_string}">
 <discover_datasets pattern="(?P&lt;name&gt;.*)\.assembly_filtered_contigs\.fasta" ext="fasta" directory="kraken2_extract_contigs"/>
 .. class:: infomark
 **Testing and Validation**
-The pipeline has been wrapped to make it work in Galaxy. It takes in either paired or unpaired short reads or long reads, generates MAGs and performs
+The CPIPES - Centriflaken Nextflow pipeline has been wrapped to make it work in Galaxy. It takes in either paired or unpaired short reads or long reads, generates MAGs and performs
 in silico-based analysis (i.e., virulence gene finding). Additionally, AMR gene finding analysis is also included in Centriflaken and performed on MAGs
 of interest. The final summary plots and tables can be downloaded from the provided MultiQC HTML report generated as part of the pipeline.
 The Centriflaken pipeline was validated with data from our previously published method (Maguire et al, 2021) and was able to replicate the detection
-and classification of STECs for each sample. We tested the pipeline with nanopore data obtained from 21 additional enriched samples from
+and classification of STECs for each sample. We tested the pipeline with Nanopore data obtained from 21 additional enriched samples from
 irrigation water and was able to perform the entire precision metagenomics analysis in less than 5 hours for all of them. All the original testing and validation was
 done on the command line on the CFSAN Raven2 HPC Cluster.
 ----

Mercurial > repos > kkonganti > cfsan_centriflaken

comparison cfsan_centriflaken.xml @ 47:30191f39a957