annotate test-data/README.md @ 7:de2011569700 draft default tip

planemo upload commit 50e7d8610c04e06eb15f09131d6a2d1b204de6a9
author galaxytrakr
date Fri, 13 Mar 2026 15:43:54 +0000
parents 3e93d920163b
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
6
3e93d920163b planemo upload commit 3773c71b3fac75f58b5af497e19d54dca0d81dfd
estrain
parents:
diff changeset
1 # Test data for AMRFinderPlus
3e93d920163b planemo upload commit 3773c71b3fac75f58b5af497e19d54dca0d81dfd
estrain
parents:
diff changeset
2
3e93d920163b planemo upload commit 3773c71b3fac75f58b5af497e19d54dca0d81dfd
estrain
parents:
diff changeset
3 ## Create `test-db`
3e93d920163b planemo upload commit 3773c71b3fac75f58b5af497e19d54dca0d81dfd
estrain
parents:
diff changeset
4
3e93d920163b planemo upload commit 3773c71b3fac75f58b5af497e19d54dca0d81dfd
estrain
parents:
diff changeset
5 1. Download ARMFinderPlus database from NCBI using the data-manager Python script after modifying the `main` function to keep only `amrfinderplus_download.download_amrfinderplus_db()`
3e93d920163b planemo upload commit 3773c71b3fac75f58b5af497e19d54dca0d81dfd
estrain
parents:
diff changeset
6 2. Remove `AMR_DNA` files that are not `Enterococcus`
3e93d920163b planemo upload commit 3773c71b3fac75f58b5af497e19d54dca0d81dfd
estrain
parents:
diff changeset
7 3. Remove lines not related to `Enterococcus` in `taxgroup.tab`
3e93d920163b planemo upload commit 3773c71b3fac75f58b5af497e19d54dca0d81dfd
estrain
parents:
diff changeset
8 4. Keep in `AMR_CDS`, `AMRProt`, and `ReferenceGeneCatalog.txt` only sequences listed in `to_keep_from_test-db`
3e93d920163b planemo upload commit 3773c71b3fac75f58b5af497e19d54dca0d81dfd
estrain
parents:
diff changeset
9 5. Keep in `amr_targets.fa` only sequences listed in `amr_targets_to_keep_from_test-db`
3e93d920163b planemo upload commit 3773c71b3fac75f58b5af497e19d54dca0d81dfd
estrain
parents:
diff changeset
10 6. Copy the `AMR.LIB` from a previous version
3e93d920163b planemo upload commit 3773c71b3fac75f58b5af497e19d54dca0d81dfd
estrain
parents:
diff changeset
11 7. Run the data-manager Python script after modifying `main` function like this:
3e93d920163b planemo upload commit 3773c71b3fac75f58b5af497e19d54dca0d81dfd
estrain
parents:
diff changeset
12
3e93d920163b planemo upload commit 3773c71b3fac75f58b5af497e19d54dca0d81dfd
estrain
parents:
diff changeset
13 ```
3e93d920163b planemo upload commit 3773c71b3fac75f58b5af497e19d54dca0d81dfd
estrain
parents:
diff changeset
14 amrfinderplus_download.download_amrfinderplus_db()
3e93d920163b planemo upload commit 3773c71b3fac75f58b5af497e19d54dca0d81dfd
estrain
parents:
diff changeset
15 amrfinderplus_download.amrfinderplus_db_path = f'{amrfinderplus_download._output_dir}/{amrfinderplus_download._db_name}'
3e93d920163b planemo upload commit 3773c71b3fac75f58b5af497e19d54dca0d81dfd
estrain
parents:
diff changeset
16 amrfinderplus_download.make_hmm_profile()
3e93d920163b planemo upload commit 3773c71b3fac75f58b5af497e19d54dca0d81dfd
estrain
parents:
diff changeset
17 amrfinderplus_download.make_blastdb()
3e93d920163b planemo upload commit 3773c71b3fac75f58b5af497e19d54dca0d81dfd
estrain
parents:
diff changeset
18 ```
3e93d920163b planemo upload commit 3773c71b3fac75f58b5af497e19d54dca0d81dfd
estrain
parents:
diff changeset
19
3e93d920163b planemo upload commit 3773c71b3fac75f58b5af497e19d54dca0d81dfd
estrain
parents:
diff changeset
20 6. Move the `amrfinderplus-db` folder content to `test-db`