Database report

Published

Tuesday, the 9th of July, 2024

Database updated from 2022.03 to 20240628

  Updated genbank-2022.03-viral-k21.zip to create genbank-20240628-viral-k21.zip 

  Updated genbank-2022.03-viral-k31.zip to create genbank-20240628-viral-k31.zip 

  Updated genbank-2022.03-viral-k51.zip to create genbank-20240628-viral-k51.zip 

Sourmash summary of first K-value in databases

The manifest for genbank-2022.03-viral-k21.zip:


** loading from '../test/genbank-20240628/data/collect-mf.2022.03-viral.csv'
path filetype: StandaloneManifestIndex
location: ../test/genbank-20240628/data/collect-mf.2022.03-viral.csv
is database? yes
has manifest? yes
num signatures: 47951

** examining manifest...
total hashes: 2083014
summary of sketches:
   47951 sketches with DNA, k=21, scaled=1000, abund  2083014 total hashes

The manifest for test/genbank-20240628/genbank-20240628-viral-k21.zip:


** loading from '../test/genbank-20240628/data/collect-mf.20240628-viral.csv'
path filetype: StandaloneManifestIndex
location: ../test/genbank-20240628/data/collect-mf.20240628-viral.csv
is database? yes
has manifest? yes
num signatures: 1929

** examining manifest...
total hashes: 66903
summary of sketches:
   1929 sketches with DNA, k=21, scaled=1000, abund   66903 total hashes

Genome report for the updated database

From 47951 in 'test/genbank-20240628/data/collect-mf.2022.03-viral.csv':
Kept 469 in 'test/genbank-20240628/data/mf-clean.20240628-viral.csv.
Removed 47482 total.
Removed 47482 identifiers because of suspected suspension of the genome.
Removed 0 because of changed version.
For more details, read  ../test/genbank-20240628/data/mf-details.20240628-viral.txt 
 in the `data` directory

Genomes failed to download or sketch

accession,name,moltype,md5sum,download_filename,url