# HG changeset patch # User galaxytrakr # Date 1774290766 0 # Node ID 3529ce6bcde7be21855ad8162224513e4ddbc58e # Parent 8c60cd4c0ca778e069361697cf43f474984fa4d3 planemo upload for repository https://github.com/CFSAN-Biostatistics/galaxytrakr-tools commit 74cea66080315c44e81547de5fe02f046bcaa1f0 diff -r 8c60cd4c0ca7 -r 3529ce6bcde7 aws_sra.xml --- a/aws_sra.xml Mon Mar 23 18:17:47 2026 +0000 +++ b/aws_sra.xml Mon Mar 23 18:32:46 2026 +0000 @@ -1,4 +1,4 @@ - + Fetch SRA data files from NCBI's public AWS S3 buckets @@ -49,27 +49,24 @@ #elif $action.mode == 'fastq_dump' #set $acc = $action.accession.strip() - ## 1. Download the .sra file from S3 + ## 1. Download the file from S3 mkdir -p sra_cache && - aws s3 cp - --no-sign-request - '${s3_base}/sra/${acc}/${acc}.sra' - ./sra_cache/${acc}.sra && + aws s3 cp \ + --no-sign-request \ + '${s3_base}/sra/${acc}/${acc}' \ + ./sra_cache/${acc} && - ## 2. Convert with fasterq-dump --split-files. - ## Paired runs → _1.fastq + _2.fastq - ## Single runs → .fastq (no _1/_2 suffix) - ## We always use --split-files; single-end runs simply produce one file. + ## 2. Convert with fasterq-dump (CORRECTED: uses extensionless file) mkdir -p fastq_out && - fasterq-dump - ./sra_cache/${acc}.sra - --outdir ./fastq_out - --temp . - --threads \${GALAXY_SLOTS:-4} + fasterq-dump \ + ./sra_cache/${acc} \ + --outdir ./fastq_out \ + --temp . \ + --threads \${GALAXY_SLOTS:-4} \ --split-files && - ## 3. Compress with pigz (fasterq-dump does not gzip natively) + ## 3. Compress with pigz pigz -p \${GALAXY_SLOTS:-4} ./fastq_out/*.fastq && ## 4. Stage outputs @@ -147,7 +144,7 @@ - ^\[SED\]RR\[0-9\]+$ + ^[SED]RR[0-9]+$