Search for putative complete and near-complete genomes of DNA-containing viruses in metagenomes obtained from the Lake Baikal

Authors

  • Potapov S.A. 1 ID
  • Tupikin A.E. 2 ID
  • Tikhonova I.V. 1 ID
  • Zhuchenko N.A. 1
  • Belykh O.I. 1 ID
  • 1 Limnological Institute Siberian Branch Russian Academy of Sciences, Ulan-Batorskaya St., 3, Irkutsk, 664033, Russia
    2 Institute of Chemical Biology and Fundamental Medicine, Siberian Branch of the Russian Academy of Sciences (ICBFM SB RAS), Ac. Lavrentieva ave., 8, Novosibirsk, 630090, Russia

DOI:

https://doi.org/10.31951/2658-3518-2024-A-4-1050

Keywords:

metagenomics, bacteriophages, high-throughput sequencing, Lake Baikal, complete genome

Abstract

The paper presents the analysis of putatively complete and near-complete genomes of bacteriophages extracted from metagenomic data obtained from DNA samples isolated from Lake Baikal water using modern bioinformatic programs. A total of 73 sequences with lengths ranging from 13.8 kb to 163.7 kb belonging to phages of the Caudoviricetes class were identified. Two contigs belonging putatively to cyanophages with lengths of 36.8 kb and 163.7 kb were detected, and in the latter one an ORF with a length of 159 amino acid residues similar to the small heat shock protein (Hsp20) was identified. Analysis of the amino acid sequences identified in the assembled bacteriophage genomes using the PHROG database revealed that 27.5% of them have an unknown function, while the majority of those with similarity to known ones (23.7%) belong to the category “DNA, RNA and nucleotide metabolism”. A number of accessory metabolic genes (AMGs) were also detected in the assembled genomes: nadMcysCcobSgalEcobT, etc. Most of the sequences with similarity to sequences from the IMG/VR database (89.6%) corresponded to sequences obtained from freshwater bodies.

Downloads

Additional Files

Published

2024-08-30

Issue

Section

Articles