Database report

Published

Tuesday, the 9th of July, 2024

Database updated from 2022.03 to 20240628

  Updated genbank-2022.03-protozoa-k21.zip to create genbank-20240628-protozoa-k21.zip 

  Updated genbank-2022.03-protozoa-k31.zip to create genbank-20240628-protozoa-k31.zip 

  Updated genbank-2022.03-protozoa-k51.zip to create genbank-20240628-protozoa-k51.zip 

Sourmash summary of first K-value in databases

The manifest for genbank-2022.03-protozoa-k21.zip:


** loading from '../test/genbank-20240628/data/collect-mf.2022.03-protozoa.csv'
path filetype: StandaloneManifestIndex
location: ../test/genbank-20240628/data/collect-mf.2022.03-protozoa.csv
is database? yes
has manifest? yes
num signatures: 1192

** examining manifest...
total hashes: 58309636
summary of sketches:
   1192 sketches with DNA, k=21, scaled=1000, abund   58309636 total hashes

The manifest for test/genbank-20240628/genbank-20240628-protozoa-k21.zip:


** loading from '../test/genbank-20240628/data/collect-mf.20240628-protozoa.csv'
path filetype: StandaloneManifestIndex
location: ../test/genbank-20240628/data/collect-mf.20240628-protozoa.csv
is database? yes
has manifest? yes
num signatures: 22

** examining manifest...
total hashes: 941430
summary of sketches:
   22 sketches with DNA, k=21, scaled=1000, abund     941430 total hashes

Genome report for the updated database

From 1192 in 'test/genbank-20240628/data/collect-mf.2022.03-protozoa.csv':
Kept 5 in 'test/genbank-20240628/data/mf-clean.20240628-protozoa.csv.
Removed 1187 total.
Removed 1186 identifiers because of suspected suspension of the genome.
Removed 1 because of changed version.
For more details, read  ../test/genbank-20240628/data/mf-details.20240628-protozoa.txt 
 in the `data` directory

Genomes failed to download or sketch

accession,name,moltype,md5sum,download_filename,url