annotate 0.4.2/modules/centrifuge/process/README.md @ 0:082e0091e813 draft default tip

planemo upload
author galaxytrakr
date Fri, 29 May 2026 13:27:47 +0000
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
1 # NextFlow DSL2 Module
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
2
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
3 ```bash
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
4 CENTRIFUGE_PROCESS
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
5 ```
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
6
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
7 ## Description
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
8
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
9 Extract FASTQ reads given a FASTQ file originally used with `centrifuge` tool and a taxa of interest. This specific module uses a `python` script to generate the FASTQ read ids and as such requires a `bin` folder with `process_centrifuge_output.py` to be present where the NextFlow script will be executed from. See also `CENTRIFUGE_EXTRACT` module which uses only GNU Coreutils to create a list of FASTQ read ids that need to be extracted.
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
10
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
11 \
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
12  
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
13
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
14 ### `input:`
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
15
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
16 ___
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
17
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
18 Type: `tuple`
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
19
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
20 Takes in a tuple in order of metadata (`meta`), a `path` (`centrifuge_report`) type and another `path` (`centrifuge_report`) per sample (`id:`).
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
21
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
22 Ex:
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
23
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
24 ```groovy
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
25 [
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
26 [ id: 'FAL00870',
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
27 strandedness: 'unstranded',
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
28 single_end: true,
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
29 centrifuge_x: '/hpc/db/centrifuge/2022-04-12/ab'
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
30 ],
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
31 '/hpc/scratch/test/FAL000870/f1.merged.cent_out.report.txt',
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
32 '/hpc/scratch/test/FAL000870/f1.merged.cent_out.output.txt'
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
33 ]
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
34 ```
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
35
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
36 \
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
37  
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
38
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
39 #### `meta`
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
40
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
41 Type: Groovy Map
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
42
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
43 A Groovy Map containing the metadata about the FASTQ file.
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
44
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
45 Ex:
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
46
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
47 ```groovy
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
48 [
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
49 id: 'FAL00870',
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
50 strandedness: 'unstranded',
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
51 single_end: true,
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
52 centrifuge_x: '/hpc/db/centrifuge/2022-04-12/ab'
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
53 ]
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
54 ```
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
55
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
56 \
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
57  
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
58
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
59 #### `centrifuge_report`
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
60
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
61 Type: `path`
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
62
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
63 NextFlow input type of `path` pointing to `centrifuge` report file generated using `--report-file` option of `centrifuge` tool.
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
64
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
65 \
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
66  
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
67
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
68 #### `centrifuge_output`
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
69
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
70 Type: `path`
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
71
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
72 NextFlow input type of `path` pointing to `centrifuge` output file generated using `-S` option of `centrifuge` tool.
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
73
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
74 \
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
75  
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
76
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
77 ### `output:`
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
78
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
79 ___
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
80
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
81 Type: `tuple`
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
82
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
83 Outputs a tuple of metadata (`meta` from `input:`) and list of extracted FASTQ read ids.
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
84
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
85 \
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
86  
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
87
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
88 #### `extracted`
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
89
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
90 Type: `path`
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
91
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
92 NextFlow output type of `path` pointing to the extracted FASTQ read ids belonging to a particular taxa (`*.extract-centrifuge-bug-ids.txt`).
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
93
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
94 \
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
95  
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
96
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
97 #### `versions`
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
98
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
99 Type: `path`
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
100
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
101 NextFlow output type of `path` pointing to the `.yml` file storing software versions for this process.