kkonganti@105: # NextFlow DSL2 Module kkonganti@105: kkonganti@105: ```bash kkonganti@105: CENTRIFUGE_PROCESS kkonganti@105: ``` kkonganti@105: kkonganti@105: ## Description kkonganti@105: kkonganti@105: Extract FASTQ reads given a FASTQ file originally used with `centrifuge` tool and a taxa of interest. This specific module uses a `python` script to generate the FASTQ read ids and as such requires a `bin` folder with `process_centrifuge_output.py` to be present where the NextFlow script will be executed from. See also `CENTRIFUGE_EXTRACT` module which uses only GNU Coreutils to create a list of FASTQ read ids that need to be extracted. kkonganti@105: kkonganti@105: \ kkonganti@105:   kkonganti@105: kkonganti@105: ### `input:` kkonganti@105: kkonganti@105: ___ kkonganti@105: kkonganti@105: Type: `tuple` kkonganti@105: kkonganti@105: Takes in a tuple in order of metadata (`meta`), a `path` (`centrifuge_report`) type and another `path` (`centrifuge_report`) per sample (`id:`). kkonganti@105: kkonganti@105: Ex: kkonganti@105: kkonganti@105: ```groovy kkonganti@105: [ kkonganti@105: [ id: 'FAL00870', kkonganti@105: strandedness: 'unstranded', kkonganti@105: single_end: true, kkonganti@105: centrifuge_x: '/hpc/db/centrifuge/2022-04-12/ab' kkonganti@105: ], kkonganti@105: '/hpc/scratch/test/FAL000870/f1.merged.cent_out.report.txt', kkonganti@105: '/hpc/scratch/test/FAL000870/f1.merged.cent_out.output.txt' kkonganti@105: ] kkonganti@105: ``` kkonganti@105: kkonganti@105: \ kkonganti@105:   kkonganti@105: kkonganti@105: #### `meta` kkonganti@105: kkonganti@105: Type: Groovy Map kkonganti@105: kkonganti@105: A Groovy Map containing the metadata about the FASTQ file. kkonganti@105: kkonganti@105: Ex: kkonganti@105: kkonganti@105: ```groovy kkonganti@105: [ kkonganti@105: id: 'FAL00870', kkonganti@105: strandedness: 'unstranded', kkonganti@105: single_end: true, kkonganti@105: centrifuge_x: '/hpc/db/centrifuge/2022-04-12/ab' kkonganti@105: ] kkonganti@105: ``` kkonganti@105: kkonganti@105: \ kkonganti@105:   kkonganti@105: kkonganti@105: #### `centrifuge_report` kkonganti@105: kkonganti@105: Type: `path` kkonganti@105: kkonganti@105: NextFlow input type of `path` pointing to `centrifuge` report file generated using `--report-file` option of `centrifuge` tool. kkonganti@105: kkonganti@105: \ kkonganti@105:   kkonganti@105: kkonganti@105: #### `centrifuge_output` kkonganti@105: kkonganti@105: Type: `path` kkonganti@105: kkonganti@105: NextFlow input type of `path` pointing to `centrifuge` output file generated using `-S` option of `centrifuge` tool. kkonganti@105: kkonganti@105: \ kkonganti@105:   kkonganti@105: kkonganti@105: ### `output:` kkonganti@105: kkonganti@105: ___ kkonganti@105: kkonganti@105: Type: `tuple` kkonganti@105: kkonganti@105: Outputs a tuple of metadata (`meta` from `input:`) and list of extracted FASTQ read ids. kkonganti@105: kkonganti@105: \ kkonganti@105:   kkonganti@105: kkonganti@105: #### `extracted` kkonganti@105: kkonganti@105: Type: `path` kkonganti@105: kkonganti@105: NextFlow output type of `path` pointing to the extracted FASTQ read ids belonging to a particular taxa (`*.extract-centrifuge-bug-ids.txt`). kkonganti@105: kkonganti@105: \ kkonganti@105:   kkonganti@105: kkonganti@105: #### `versions` kkonganti@105: kkonganti@105: Type: `path` kkonganti@105: kkonganti@105: NextFlow output type of `path` pointing to the `.yml` file storing software versions for this process.