Database report

Published

Tuesday, the 9th of July, 2024

Database updated from 2022.03 to 20240628

  Updated genbank-2022.03-fungi-k21.zip to create genbank-20240628-fungi-k21.zip 

  Updated genbank-2022.03-fungi-k31.zip to create genbank-20240628-fungi-k31.zip 

  Updated genbank-2022.03-fungi-k51.zip to create genbank-20240628-fungi-k51.zip 

Sourmash summary of first K-value in databases

The manifest for genbank-2022.03-fungi-k21.zip:


** loading from '../test/genbank-20240628/data/collect-mf.2022.03-fungi.csv'
path filetype: StandaloneManifestIndex
location: ../test/genbank-20240628/data/collect-mf.2022.03-fungi.csv
is database? yes
has manifest? yes
num signatures: 10285

** examining manifest...
total hashes: 327547787
summary of sketches:
   10285 sketches with DNA, k=21, scaled=1000, abund  327547787 total hashes

The manifest for test/genbank-20240628/genbank-20240628-fungi-k21.zip:


** loading from '../test/genbank-20240628/data/collect-mf.20240628-fungi.csv'
path filetype: StandaloneManifestIndex
location: ../test/genbank-20240628/data/collect-mf.20240628-fungi.csv
is database? yes
has manifest? yes
num signatures: 203

** examining manifest...
total hashes: 6251266
summary of sketches:
   203 sketches with DNA, k=21, scaled=1000, abund    6251266 total hashes

Genome report for the updated database

From 10285 in 'test/genbank-20240628/data/collect-mf.2022.03-fungi.csv':
Kept 116 in 'test/genbank-20240628/data/mf-clean.20240628-fungi.csv.
Removed 10169 total.
Removed 10165 identifiers because of suspected suspension of the genome.
Removed 4 because of changed version.
For more details, read  ../test/genbank-20240628/data/mf-details.20240628-fungi.txt 
 in the `data` directory

Genomes failed to download or sketch

accession,name,moltype,md5sum,download_filename,url