annotate 0.4.2/modules/centrifuge/extract/README.md @ 0:082e0091e813 draft default tip

planemo upload
author galaxytrakr
date Fri, 29 May 2026 13:27:47 +0000
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
1 # NextFlow DSL2 Module
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
2
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
3 ```bash
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
4 CENTRIFUGE_EXTRACT
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
5 ```
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
6
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
7 ## Description
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
8
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
9 Extract FASTQ reads given a FASTQ file originally used with `centrifuge` tool and a taxa of interest. This specific module uses only GNU Coreutils to create a list of FASTQ read ids that need to be extract. See also `CENTRIFUGE_PROCESS` module which uses a `python` script to generate the FASTQ read ids.
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
10
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
11 \
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
12  
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
13
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
14 ### `input:`
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
15
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
16 ___
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
17
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
18 Type: `tuple`
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
19
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
20 Takes in the following 2 tuples:
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
21
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
22 - A tuple of metadata (`meta`) and of type `path` (`centrifuge_output`) per sample (`id:`).
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
23
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
24 - A tuple of metadata (`meta`) and of type `path` (`centrifuge_report`) per sample (`id:`).
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
25
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
26 Ex:
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
27
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
28 ```groovy
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
29 [
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
30 [ id: 'FAL00870',
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
31 strandedness: 'unstranded',
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
32 single_end: true,
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
33 centrifuge_x: '/hpc/db/centrifuge/2022-04-12/ab'
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
34 ],
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
35 '/hpc/scratch/test/FAL000870/f1.merged.cent_out.output.txt'
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
36 ]
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
37
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
38 [
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
39 [ id: 'FAL00870',
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
40 strandedness: 'unstranded',
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
41 single_end: true,
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
42 centrifuge_x: '/hpc/db/centrifuge/2022-04-12/ab'
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
43 ],
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
44 '/hpc/scratch/test/FAL000870/f1.merged.cent_out.report.txt'
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
45 ]
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
46 ```
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
47
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
48 \
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
49  
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
50
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
51 #### `meta`
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
52
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
53 Type: Groovy Map
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
54
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
55 A Groovy Map containing the metadata about the FASTQ file.
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
56
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
57 Ex:
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
58
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
59 ```groovy
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
60 [
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
61 id: 'FAL00870',
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
62 strandedness: 'unstranded',
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
63 single_end: true,
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
64 centrifuge_x: '/hpc/db/centrifuge/2022-04-12/ab'
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
65 ]
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
66 ```
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
67
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
68 \
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
69  
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
70
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
71 #### `centrifuge_report`
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
72
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
73 Type: `path`
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
74
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
75 NextFlow input type of `path` pointing to `centrifuge` report file generated using `--report-file` option of `centrifuge` tool.
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
76
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
77 \
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
78  
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
79
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
80 #### `centrifuge_output`
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
81
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
82 Type: `path`
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
83
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
84 NextFlow input type of `path` pointing to `centrifuge` output file generated using `-S` option of `centrifuge` tool.
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
85
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
86 \
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
87  
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
88
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
89 ### `output:`
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
90
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
91 ___
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
92
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
93 Type: `tuple`
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
94
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
95 Outputs a tuple of metadata (`meta` from `input:`) and list of extracted FASTQ read ids.
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
96
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
97 \
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
98  
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
99
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
100 #### `extracted`
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
101
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
102 Type: `path`
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
103
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
104 NextFlow output type of `path` pointing to the extracted FASTQ read ids belonging to a particular taxa (`*.extract-centrifuge-bug-ids.txt`).
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
105
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
106 \
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
107  
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
108
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
109 #### `versions`
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
110
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
111 Type: `path`
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
112
082e0091e813 planemo upload
galaxytrakr
parents:
diff changeset
113 NextFlow output type of `path` pointing to the `.yml` file storing software versions for this process.