Database report

Published

Tuesday, the 9th of July, 2024

Database updated from 2022.03 to 20240628

  Updated genbank-2022.03-archaea-k21.zip to create genbank-20240628-archaea-k21.zip 

  Updated genbank-2022.03-archaea-k31.zip to create genbank-20240628-archaea-k31.zip 

  Updated genbank-2022.03-archaea-k51.zip to create genbank-20240628-archaea-k51.zip 

Sourmash summary of first K-value in databases

The manifest for genbank-2022.03-archaea-k21.zip:


** loading from '../test/genbank-20240628/data/collect-mf.2022.03-archaea.csv'
path filetype: StandaloneManifestIndex
location: ../test/genbank-20240628/data/collect-mf.2022.03-archaea.csv
is database? yes
has manifest? yes
num signatures: 8749

** examining manifest...
total hashes: 14866188
summary of sketches:
   8749 sketches with DNA, k=21, scaled=1000, abund   14866188 total hashes

The manifest for test/genbank-20240628/genbank-20240628-archaea-k21.zip:


** loading from '../test/genbank-20240628/data/collect-mf.20240628-archaea.csv'
path filetype: StandaloneManifestIndex
location: ../test/genbank-20240628/data/collect-mf.20240628-archaea.csv
is database? yes
has manifest? yes
num signatures: 234

** examining manifest...
total hashes: 367743
summary of sketches:
   234 sketches with DNA, k=21, scaled=1000, abund    367743 total hashes

Genome report for the updated database

From 8749 in 'test/genbank-20240628/data/collect-mf.2022.03-archaea.csv':
Kept 93 in 'test/genbank-20240628/data/mf-clean.20240628-archaea.csv.
Removed 8656 total.
Removed 8656 identifiers because of suspected suspension of the genome.
Removed 0 because of changed version.
For more details, read  ../test/genbank-20240628/data/mf-details.20240628-archaea.txt 
 in the `data` directory

Genomes failed to download or sketch

accession,name,moltype,md5sum,download_filename,url