annotate 0.2.1/modules/centrifuge/process/README.md @ 34:4f9f145d1946

"planemo upload"
author kkonganti
date Wed, 29 Jun 2022 15:27:50 -0400
parents 77494b0fa3c7
children
rev   line source
kkonganti@0 1 # NextFlow DSL2 Module
kkonganti@0 2
kkonganti@0 3 ```bash
kkonganti@0 4 CENTRIFUGE_PROCESS
kkonganti@0 5 ```
kkonganti@0 6
kkonganti@0 7 ## Description
kkonganti@0 8
kkonganti@0 9 Extract FASTQ reads given a FASTQ file originally used with `centrifuge` tool and a taxa of interest. This specific module uses a `python` script to generate the FASTQ read ids and as such requires a `bin` folder with `process_centrifuge_output.py` to be present where the NextFlow script will be executed from. See also `CENTRIFUGE_EXTRACT` module which uses only GNU Coreutils to create a list of FASTQ read ids that need to be extracted.
kkonganti@0 10
kkonganti@0 11 \
kkonganti@0 12  
kkonganti@0 13
kkonganti@0 14 ### `input:`
kkonganti@0 15
kkonganti@0 16 ___
kkonganti@0 17
kkonganti@0 18 Type: `tuple`
kkonganti@0 19
kkonganti@0 20 Takes in a tuple in order of metadata (`meta`), a `path` (`centrifuge_report`) type and another `path` (`centrifuge_report`) per sample (`id:`).
kkonganti@0 21
kkonganti@0 22 Ex:
kkonganti@0 23
kkonganti@0 24 ```groovy
kkonganti@0 25 [
kkonganti@0 26 [ id: 'FAL00870',
kkonganti@0 27 strandedness: 'unstranded',
kkonganti@0 28 single_end: true,
kkonganti@0 29 centrifuge_x: '/hpc/db/centrifuge/2022-04-12/ab'
kkonganti@0 30 ],
kkonganti@0 31 '/hpc/scratch/test/FAL000870/f1.merged.cent_out.report.txt',
kkonganti@0 32 '/hpc/scratch/test/FAL000870/f1.merged.cent_out.output.txt'
kkonganti@0 33 ]
kkonganti@0 34 ```
kkonganti@0 35
kkonganti@0 36 \
kkonganti@0 37  
kkonganti@0 38
kkonganti@0 39 #### `meta`
kkonganti@0 40
kkonganti@0 41 Type: Groovy Map
kkonganti@0 42
kkonganti@0 43 A Groovy Map containing the metadata about the FASTQ file.
kkonganti@0 44
kkonganti@0 45 Ex:
kkonganti@0 46
kkonganti@0 47 ```groovy
kkonganti@0 48 [
kkonganti@0 49 id: 'FAL00870',
kkonganti@0 50 strandedness: 'unstranded',
kkonganti@0 51 single_end: true,
kkonganti@0 52 centrifuge_x: '/hpc/db/centrifuge/2022-04-12/ab'
kkonganti@0 53 ]
kkonganti@0 54 ```
kkonganti@0 55
kkonganti@0 56 \
kkonganti@0 57  
kkonganti@0 58
kkonganti@0 59 #### `centrifuge_report`
kkonganti@0 60
kkonganti@0 61 Type: `path`
kkonganti@0 62
kkonganti@0 63 NextFlow input type of `path` pointing to `centrifuge` report file generated using `--report-file` option of `centrifuge` tool.
kkonganti@0 64
kkonganti@0 65 \
kkonganti@0 66  
kkonganti@0 67
kkonganti@0 68 #### `centrifuge_output`
kkonganti@0 69
kkonganti@0 70 Type: `path`
kkonganti@0 71
kkonganti@0 72 NextFlow input type of `path` pointing to `centrifuge` output file generated using `-S` option of `centrifuge` tool.
kkonganti@0 73
kkonganti@0 74 \
kkonganti@0 75  
kkonganti@0 76
kkonganti@0 77 ### `output:`
kkonganti@0 78
kkonganti@0 79 ___
kkonganti@0 80
kkonganti@0 81 Type: `tuple`
kkonganti@0 82
kkonganti@0 83 Outputs a tuple of metadata (`meta` from `input:`) and list of extracted FASTQ read ids.
kkonganti@0 84
kkonganti@0 85 \
kkonganti@0 86  
kkonganti@0 87
kkonganti@0 88 #### `extracted`
kkonganti@0 89
kkonganti@0 90 Type: `path`
kkonganti@0 91
kkonganti@0 92 NextFlow output type of `path` pointing to the extracted FASTQ read ids belonging to a particular taxa (`*.extract-centrifuge-bug-ids.txt`).
kkonganti@0 93
kkonganti@0 94 \
kkonganti@0 95  
kkonganti@0 96
kkonganti@0 97 #### `versions`
kkonganti@0 98
kkonganti@0 99 Type: `path`
kkonganti@0 100
kkonganti@0 101 NextFlow output type of `path` pointing to the `.yml` file storing software versions for this process.