jpayne@68
|
1 '\" t
|
jpayne@68
|
2 .\" Title: mash-screen
|
jpayne@68
|
3 .\" Author: [see the "AUTHOR(S)" section]
|
jpayne@68
|
4 .\" Generator: Asciidoctor 2.0.10
|
jpayne@68
|
5 .\" Date: 2019-12-13
|
jpayne@68
|
6 .\" Manual: \ \&
|
jpayne@68
|
7 .\" Source: \ \&
|
jpayne@68
|
8 .\" Language: English
|
jpayne@68
|
9 .\"
|
jpayne@68
|
10 .TH "MASH\-SCREEN" "1" "2019-12-13" "\ \&" "\ \&"
|
jpayne@68
|
11 .ie \n(.g .ds Aq \(aq
|
jpayne@68
|
12 .el .ds Aq '
|
jpayne@68
|
13 .ss \n[.ss] 0
|
jpayne@68
|
14 .nh
|
jpayne@68
|
15 .ad l
|
jpayne@68
|
16 .de URL
|
jpayne@68
|
17 \fI\\$2\fP <\\$1>\\$3
|
jpayne@68
|
18 ..
|
jpayne@68
|
19 .als MTO URL
|
jpayne@68
|
20 .if \n[.g] \{\
|
jpayne@68
|
21 . mso www.tmac
|
jpayne@68
|
22 . am URL
|
jpayne@68
|
23 . ad l
|
jpayne@68
|
24 . .
|
jpayne@68
|
25 . am MTO
|
jpayne@68
|
26 . ad l
|
jpayne@68
|
27 . .
|
jpayne@68
|
28 . LINKSTYLE blue R < >
|
jpayne@68
|
29 .\}
|
jpayne@68
|
30 .SH "NAME"
|
jpayne@68
|
31 mash\-screen \- determine whether query sequences are within a larger pool of sequences
|
jpayne@68
|
32 .SH "SYNOPSIS"
|
jpayne@68
|
33 .sp
|
jpayne@68
|
34 \fBmash screen\fP [options] <queries>.msh <pool> [<pool>] ...
|
jpayne@68
|
35 .SH "DESCRIPTION"
|
jpayne@68
|
36 .sp
|
jpayne@68
|
37 Determine how well query sequences are contained within a pool of sequences.
|
jpayne@68
|
38 The queries must be formatted as a single Mash sketch file (.msh), created
|
jpayne@68
|
39 with the \f(CRmash sketch\fP command. The <pool> files can be contigs or reads, in
|
jpayne@68
|
40 fasta or fastq, gzipped or not, and "\-" can be given for <pool> to read from
|
jpayne@68
|
41 standard input. The <pool> sequences are assumed to be nucleotides, and will
|
jpayne@68
|
42 be 6\-frame translated if the <queries> are amino acids. The output fields are
|
jpayne@68
|
43 [identity, shared\-hashes, median\-multiplicity, p\-value, query\-ID, query\-comment],
|
jpayne@68
|
44 where median\-multiplicity is computed for shared hashes, based on the number of
|
jpayne@68
|
45 observations of those hashes within the pool.
|
jpayne@68
|
46 .SH "OPTIONS"
|
jpayne@68
|
47 .sp
|
jpayne@68
|
48 \fB\-h\fP
|
jpayne@68
|
49 .RS 4
|
jpayne@68
|
50 Help
|
jpayne@68
|
51 .RE
|
jpayne@68
|
52 .sp
|
jpayne@68
|
53 \fB\-p\fP <int>
|
jpayne@68
|
54 .RS 4
|
jpayne@68
|
55 Parallelism. This many threads will be spawned for processing.
|
jpayne@68
|
56 .RE
|
jpayne@68
|
57 .sp
|
jpayne@68
|
58 \fB\-w\fP
|
jpayne@68
|
59 .RS 4
|
jpayne@68
|
60 Winner\-takes\-all strategy for identity estimates. After counting
|
jpayne@68
|
61 hashes for each query, hashes that appear in multiple queries will
|
jpayne@68
|
62 be removed from all except the one with the best identity (ties
|
jpayne@68
|
63 broken by larger query), and other identities will be reduced. This
|
jpayne@68
|
64 removes output redundancy, providing a rough compositional outline.
|
jpayne@68
|
65 .RE
|
jpayne@68
|
66 .SS "Output"
|
jpayne@68
|
67 .sp
|
jpayne@68
|
68 \fB\-i\fP <num>
|
jpayne@68
|
69 .RS 4
|
jpayne@68
|
70 Minimum identity to report. Inclusive unless set to zero, in which
|
jpayne@68
|
71 case only identities greater than zero (i.e. with at least one
|
jpayne@68
|
72 shared hash) will be reported. Set to \-1 to output everything.
|
jpayne@68
|
73 .RE
|
jpayne@68
|
74 .sp
|
jpayne@68
|
75 \fB\-v\fP <num>
|
jpayne@68
|
76 .RS 4
|
jpayne@68
|
77 Maximum p\-value to report.
|
jpayne@68
|
78 .RE
|
jpayne@68
|
79 .SH "SEE ALSO"
|
jpayne@68
|
80 .sp
|
jpayne@68
|
81 mash(1) |