view 0.4.2/modules/seqkit/rmdup/README.md @ 105:52045ea4679d

"planemo upload"
author kkonganti
date Thu, 27 Jun 2024 14:17:26 -0400
parents
children
line wrap: on
line source
# NextFlow DSL2 Module

```bash
SEQKIT_RMDUP
```

## Description

Run `seqkit rmdup` command on reads in FASTQ format. Produces a filtered FASTQ file without duplicate sequences as per the strategy set using `ext.args` within the process scope.

\
 

### `input:`

___

Type: `tuple`

Takes in the following tuple of metadata (`meta`) and a list of reads of type `path` (`reads`) per sample (`id:`).

Ex:

```groovy
[
    [ id: 'FAL00870',
       strandedness: 'unstranded',
       single_end: true,
       centrifuge_x: '/hpc/db/centrifuge/2022-04-12/ab'
    ],
    '/hpc/scratch/test/FAL000870/f1.merged.fq.gz'
]
```

\
 

#### `meta`

Type: Groovy Map

A Groovy Map containing the metadata about the FASTQ file.

Ex:

```groovy
[
    id: 'FAL00870',
    strandedness: 'unstranded',
    single_end: true
]
```

\
 

#### `reads`

Type: `path`

NextFlow input type of `path` pointing to FASTQ files on which `seqkit rmdup` should be run.

\
 

#### `args`

Type: Groovy String

String of optional command-line arguments to be passed to the tool. This can be mentioned in `process` scope within `withName:process_name` block using `ext.args` option within your `nextflow.config` file.

Ex:

```groovy
withName: 'SEQKIT_DUP' {
    ext.args = '-t dna'
}
```

### `output:`

___

Type: `tuple`

Outputs a tuple of metadata (`meta` from `input:`) and filtered gzipped FASTQ file.

\
 

#### `fastx`

Type: `path`

NextFlow output type of `path` pointing to the FASTQ format filtered gzipped file per sample (`id:`).

\
 

#### `versions`

Type: `path`

NextFlow output type of `path` pointing to the `.yml` file storing software versions for this process.