comparison CSP2/CSP2_env/env-d9b9114564458d9d-741b3de822f2aaca6c6caa4325c4afce/opt/mummer-3.23/docs/run-mummer3.README @ 69:33d812a61356

planemo upload commit 2e9511a184a1ca667c7be0c6321a36dc4e3d116d
author jpayne
date Tue, 18 Mar 2025 17:55:14 -0400
parents
children
comparison
equal deleted inserted replaced
67:0e9998148a16 69:33d812a61356
1 -=- run-mummer3 (MUMmer3.0) README -=-
2
3 ** NOTE **
4 This manual is outdated, please refer to the HTML documentation included in
5 this distribution or at:
6
7 http://mummer.sourceforge.net
8 http://mummer.sourceforge.net/manual
9 http://mummer.sourceforge.net/examples
10
11 If any of this code is used in any publication, please cite the following:
12
13 Fast algorithms for large-scale genome alignment and comparison.
14 A.L. Delcher. A. Phillippy, J. Carlton, and S.L. Salzberg. Nucleic
15 Acids Research 30:11 (2002), 2478-2483.
16
17
18 USAGE: ./run-mummer3 <reference> <query> <file prefix>
19
20 <reference> specifies the file with the first sequence in FastA
21 format. No more than on sequence is allowed.
22 <query> specifies the multi-fasta sequence FastA file that contains
23 the query sequences, to be aligned to the reference.
24 <file prefix> specifies the file prefix for the output files.
25
26 NOTE:
27 Coordinates from this script will be relative to their respective
28 strand. Thus reverse matches will have coordinates that reference the
29 reverse complemented query sequence! If this coordinate system is
30 confusing, use nucmer instead.
31
32
33 MATCHING PARAMETERS:
34 It is important to customize the command line parameters of the matching
35 and clustering algorithms to reflect the users desired alignment results.
36 All of the programs in the run-mummer3 script (mummer, mgaps, and combineMUMs)
37 can be passed various parameters to alter their performance and output. To view
38 these options, run each program from the command line with the "-h" option to
39 view their definable parameters. Then to make a permanent change to the script,
40 simply add the desired parameters to the script using a standard text editor.
41
42 NOTE: For SNP hunters the -D option to combineMUMs will be very handy for
43 locating and parsing the difference positions.
44
45
46 MATCHING ALGORITHMS:
47 It is also possible to change the matching algorithm used in the alignment
48 generation. type 'mummer -help' to see the algorithm switches.
49
50
51 OUTPUT FILES:
52 The four output files of this script are:
53 <prefix>.out
54 <prefix>.gaps
55 <prefix>.errorsgaps
56 <prefix>.align
57
58 + <prefix>.out
59 This file lists the coordinates of the matches found and the length
60 of each match found. A group of matches to a specific sequence in
61 the query file is headed by the FastA tag of that sequence. The first
62 and second columns list the start coordinates in the reference and
63 query sequences respectively. The final column is the length of the
64 match.
65
66 + <prefix>.gaps
67 The headers and 1st - 3rd columns of this file are exactly like the
68 <prefix>.out file above. However, matches are now sorted and clustered
69 according to their position in each of the sequences. Clusters of
70 matches are separated by a "#" character and matches from different
71 sequences in the query file are still separated by the FastA header
72 for that sequence.
73 The additional columns in this file (4th - 6th) describe
74 the gaps between adjacent matches. The 4th column represents the
75 number of overlapping characters between the current match and the
76 previous match. The 5th column displays the number of characters
77 between the beginning of this match in the reference and the end
78 of the previous match in the reference. Finally, the 6th column
79 displays the number of characters between the beginning of this
80 match in the query sequence and the end of the previous match in
81 the query sequence.
82
83 + <prefix>.errorsgaps
84 This file is identical to the <prefix>.gaps file, except for the
85 addition of a 7th column that displays the number of errors in the
86 gap described by columns 5 and 6. This is perhaps the most helpful
87 output file of the script, as it is easy to parse and interpret.
88
89 + <prefix>.align
90 This file also expands on the <prefix>.gaps file, but in a different
91 way than the witherrors.gaps file. This file intersperses the lines
92 of the <prefix>.gaps file with an actual alignment of the gap between
93 the previous two matches. Additionally, wherever there was a "#"
94 character in the <prefix>.gaps file, the <prefix>.align file adds
95 a line that lists the encompassing start and stop coordinates of the
96 previous alignment region, a error ratio, and an error percentage.
97
98
99 Email questions, comments or bug reports to: <mummer-help@lists.sourceforge.net>