diff test-data/README.md @ 6:3e93d920163b draft

planemo upload commit 3773c71b3fac75f58b5af497e19d54dca0d81dfd
author estrain
date Thu, 12 Mar 2026 20:38:44 +0000
parents
children
line wrap: on
line diff
--- /dev/null	Thu Jan 01 00:00:00 1970 +0000
+++ b/test-data/README.md	Thu Mar 12 20:38:44 2026 +0000
@@ -0,0 +1,20 @@
+# Test data for AMRFinderPlus
+
+## Create `test-db` 
+
+1. Download ARMFinderPlus database from NCBI using the data-manager Python script after modifying the `main` function to keep only `amrfinderplus_download.download_amrfinderplus_db()`
+2. Remove `AMR_DNA` files that are not `Enterococcus`
+3. Remove lines not related to `Enterococcus` in `taxgroup.tab`
+4. Keep in `AMR_CDS`, `AMRProt`, and `ReferenceGeneCatalog.txt` only sequences listed in `to_keep_from_test-db`
+5. Keep in `amr_targets.fa` only sequences listed in `amr_targets_to_keep_from_test-db`
+6. Copy the `AMR.LIB` from a previous version
+7. Run the data-manager Python script after modifying `main` function like this:
+
+    ```
+    amrfinderplus_download.download_amrfinderplus_db()
+    amrfinderplus_download.amrfinderplus_db_path = f'{amrfinderplus_download._output_dir}/{amrfinderplus_download._db_name}'
+    amrfinderplus_download.make_hmm_profile()
+    amrfinderplus_download.make_blastdb()
+    ```
+
+6. Move the `amrfinderplus-db` folder content to `test-db`
\ No newline at end of file