Database creation

Workflow db creation

Workflow for database creation

1) Download plasmid sequences available in NCBI refseqarrow-up-right.

2) Extract fasta from tar.gz.

3) Download and extract NCBI taxonomyarrow-up-right, which will be fed to pATLAS.

4) Clone this repository:

5) Install its dependenciesarrow-up-right

6) Configure the database:

7) run MASHix.pyarrow-up-right - the output will include a filtered. fasta file (master_fasta_*.fas).

8) run ABRicatearrow-up-right, with CARD, ResFinder, PlasmidFinder, VFDB databases.

9) Download the card indexarrow-up-right necessary for the abricate2db.py script (aro_index.csv).

10) Update the git submodules (git submodule update --init --recursive) and run abricate2db.pyarrow-up-right - using all the previous tsv as input.

11) dump database to a sql file.

Automation of this steps

This steps are fully automated in the nextflow pipeline pATLAS-db-creationarrow-up-right.

Creating a custom version of pATLAS

If you require to add your own plasmids to pATLAS database without asking to add them to pATLAS websitearrow-up-right, you can provide custom fasta files when building the database using the -i option of MASHix.pyarrow-up-right. Then follow the steps described above.

Last updated