Skip to content

Pre‐sketched databases

Jim Shaw edited this page Mar 31, 2024 · 3 revisions

Pre-sketched databases for skani

Pre-sketched databases for skani are available for download. These databases can be

  • searched against in seconds with skani search
  • sketched files can also be used with skani triangle and skani dist if desired

Sketched GTDB-R214 database (85,205 bacteria/archaea species representative genomes)

The database is 23 GB compressed and 51 GB uncompressed.

Parameters:

  • Default: -c 125, -m 1000

Download: https://storage.googleapis.com/skani_files/skani-gtdb-r214-sketch-v0.2.tar.gz

wget https://storage.googleapis.com/skani_files/skani-gtdb-r214-sketch-v0.2.tar.gz 
tar -zxvf skani-gtdb-r214-sketch-v0.2.tar.gz

skani search my_genome.fa -d skani-gtdb-r214-sketch-v0.2 -o results.tsv