annotate 0.4.2/modules/centrifuge/process/README.md @ 143:620bffa66bbb tip

planemo upload
author kkonganti
date Thu, 11 Jul 2024 14:48:07 -0400
parents 52045ea4679d
children
rev   line source
kkonganti@105 1 # NextFlow DSL2 Module
kkonganti@105 2
kkonganti@105 3 ```bash
kkonganti@105 4 CENTRIFUGE_PROCESS
kkonganti@105 5 ```
kkonganti@105 6
kkonganti@105 7 ## Description
kkonganti@105 8
kkonganti@105 9 Extract FASTQ reads given a FASTQ file originally used with `centrifuge` tool and a taxa of interest. This specific module uses a `python` script to generate the FASTQ read ids and as such requires a `bin` folder with `process_centrifuge_output.py` to be present where the NextFlow script will be executed from. See also `CENTRIFUGE_EXTRACT` module which uses only GNU Coreutils to create a list of FASTQ read ids that need to be extracted.
kkonganti@105 10
kkonganti@105 11 \
kkonganti@105 12  
kkonganti@105 13
kkonganti@105 14 ### `input:`
kkonganti@105 15
kkonganti@105 16 ___
kkonganti@105 17
kkonganti@105 18 Type: `tuple`
kkonganti@105 19
kkonganti@105 20 Takes in a tuple in order of metadata (`meta`), a `path` (`centrifuge_report`) type and another `path` (`centrifuge_report`) per sample (`id:`).
kkonganti@105 21
kkonganti@105 22 Ex:
kkonganti@105 23
kkonganti@105 24 ```groovy
kkonganti@105 25 [
kkonganti@105 26 [ id: 'FAL00870',
kkonganti@105 27 strandedness: 'unstranded',
kkonganti@105 28 single_end: true,
kkonganti@105 29 centrifuge_x: '/hpc/db/centrifuge/2022-04-12/ab'
kkonganti@105 30 ],
kkonganti@105 31 '/hpc/scratch/test/FAL000870/f1.merged.cent_out.report.txt',
kkonganti@105 32 '/hpc/scratch/test/FAL000870/f1.merged.cent_out.output.txt'
kkonganti@105 33 ]
kkonganti@105 34 ```
kkonganti@105 35
kkonganti@105 36 \
kkonganti@105 37  
kkonganti@105 38
kkonganti@105 39 #### `meta`
kkonganti@105 40
kkonganti@105 41 Type: Groovy Map
kkonganti@105 42
kkonganti@105 43 A Groovy Map containing the metadata about the FASTQ file.
kkonganti@105 44
kkonganti@105 45 Ex:
kkonganti@105 46
kkonganti@105 47 ```groovy
kkonganti@105 48 [
kkonganti@105 49 id: 'FAL00870',
kkonganti@105 50 strandedness: 'unstranded',
kkonganti@105 51 single_end: true,
kkonganti@105 52 centrifuge_x: '/hpc/db/centrifuge/2022-04-12/ab'
kkonganti@105 53 ]
kkonganti@105 54 ```
kkonganti@105 55
kkonganti@105 56 \
kkonganti@105 57  
kkonganti@105 58
kkonganti@105 59 #### `centrifuge_report`
kkonganti@105 60
kkonganti@105 61 Type: `path`
kkonganti@105 62
kkonganti@105 63 NextFlow input type of `path` pointing to `centrifuge` report file generated using `--report-file` option of `centrifuge` tool.
kkonganti@105 64
kkonganti@105 65 \
kkonganti@105 66  
kkonganti@105 67
kkonganti@105 68 #### `centrifuge_output`
kkonganti@105 69
kkonganti@105 70 Type: `path`
kkonganti@105 71
kkonganti@105 72 NextFlow input type of `path` pointing to `centrifuge` output file generated using `-S` option of `centrifuge` tool.
kkonganti@105 73
kkonganti@105 74 \
kkonganti@105 75  
kkonganti@105 76
kkonganti@105 77 ### `output:`
kkonganti@105 78
kkonganti@105 79 ___
kkonganti@105 80
kkonganti@105 81 Type: `tuple`
kkonganti@105 82
kkonganti@105 83 Outputs a tuple of metadata (`meta` from `input:`) and list of extracted FASTQ read ids.
kkonganti@105 84
kkonganti@105 85 \
kkonganti@105 86  
kkonganti@105 87
kkonganti@105 88 #### `extracted`
kkonganti@105 89
kkonganti@105 90 Type: `path`
kkonganti@105 91
kkonganti@105 92 NextFlow output type of `path` pointing to the extracted FASTQ read ids belonging to a particular taxa (`*.extract-centrifuge-bug-ids.txt`).
kkonganti@105 93
kkonganti@105 94 \
kkonganti@105 95  
kkonganti@105 96
kkonganti@105 97 #### `versions`
kkonganti@105 98
kkonganti@105 99 Type: `path`
kkonganti@105 100
kkonganti@105 101 NextFlow output type of `path` pointing to the `.yml` file storing software versions for this process.