# Final Results ## BUSCO # BUSCO version is: 5.4.4 # The lineage dataset is: eukaryota_odb10 (Creation date: 2020-09-10, number of genomes: 70, number of BUSCOs: 255) # Summarized benchmarking in BUSCO notation for file transcripts.fasta # BUSCO was run in mode: euk_tran ***** Results: ***** C:89.8%[S:24.7%,D:65.1%],F:6.3%,M:3.9%,n:255 229 Complete BUSCOs (C) 63 Complete and single-copy BUSCOs (S) 166 Complete and duplicated BUSCOs (D) 16 Fragmented BUSCOs (F) 10 Missing BUSCOs (M) 255 Total BUSCO groups searched Dependencies and versions: hmmsearch: 3.1 metaeuk: 6.a5d39d9 busco: 5.4.4 Report: In the analysis of the transcriptome assembly using BUSCO, we identified 229 complete BUSCOs, representing 89.8% of the total 255 BUSCO groups searched. Of these complete BUSCOs, 63 were classified as complete and single-copy (S), while 166 were identified as complete and duplicated (D). The analysis also revealed 16 fragmented BUSCOs (F) and 10 missing BUSCOs (M). These results indicate good completeness in terms of expected gene content, but the presence of fragmented and missing BUSCOs suggests that further investigation is needed to improve the quality of the transcriptome assembly. ## Contig metrics n seqs 91092 smallest 223 largest 28459 n bases 126520031 mean len 1388.93 n under 200 0 n over 1k 41012 n over 10k 180 n with orf 43335 mean orf percent 58.08 n90 563 n70 1473 n50 2333 n30 3533 n10 5886 gc 0.54 bases n 250420 Report: The contig metrics reveal that the transcriptome assembly consists of 91,092 contigs, with a total length of 126,520,031 bases and a mean length of 1388.93 bases. The assembly contains 41,012 contigs with lengths over 1,000 bases, including 180 contigs longer than 10,000 bases. About half of the bases in the assembly are GC (0.54). The assembly also contains 43,335 contigs with open reading frames (ORFs), with a mean ORF percent of 58.08%. The N50 of the assembly is 2333, indicating that half of the total assembly length is contained in contigs of at least 2333 bases. The N90 of 563 suggests that 90% of the assembly length is contained in contigs of at least 563 bases. Finally, there were no contigs under 200 bases in length. The presence of a significant number of contigs with ORFs and long contigs suggests that the transcriptome assembly is of good quality. However, the relatively high number of contigs (n=43,335) with ORFs raises the possibility that some of these contigs may be misassembled or represent partial transcripts. Further analysis, such as alignment to a reference genome or functional annotation, would help to clarify the quality and completeness of the assembly.