Complete Genome Sequence of Bradyrhizobium sp. Strain BDV5040, Representative of Widespread Genospecies B in Australia

Christine Oger-Desfeux; Jérôme Briolay; Philippe Oger; Bénédicte Lafay

doi:10.1128/MRA.01326-20

Article Dans Une Revue Microbiology Resource Announcements Année : 2021

Complete Genome Sequence of Bradyrhizobium sp. Strain BDV5040, Representative of Widespread Genospecies B in Australia

(1) , (2) , (3) , (4, 5)

1
2
3
4
5

Christine Oger-Desfeux

Fonction : Auteur

Pôle Rhône-Alpin de BioInformatique [Lyon]

Jérôme Briolay

Fonction : Auteur

Institut Ecologie et Environnement - CNRS Ecologie et Environnement

Philippe Oger

Fonction : Auteur
PersonId : 2259
IdHAL : poger
ORCID : 0000-0001-6298-6870
IdRef : 167411993

Microbiologie, adaptation et pathogénie

Bénédicte Lafay

Fonction : Auteur
PersonId : 21704
IdHAL : benedicte-lafay
ORCID : 0000-0001-5783-5269
IdRef : 155699946

Laboratoire de Biométrie et Biologie Evolutive - UMR 5558

Ampère

Résumé

We report the complete genome sequence of Bradyrhizobium sp. strain BDV5040, representative of Bradyrhizobium genospecies B, which symbiotically associates with legume hosts belonging to all three Fabaceae subfamilies across the Australian continent. The complete genome sequence provides a genetic reference for this Australian genospecies. Bradyrhizobium sp. strain BDV5040 was isolated in 1995 from a root nodule of Bossiaea ensata (Fabaceae, Faboideae, Bossiaeeae) collected in Ben Boyd National Park, New South Wales, Australia (37°12′S, 149°57′E; altitude, 140 m), in the course of a survey of rhizobia associated with native shrubby legumes in southeastern Australia (1). It is a representative of Bradyrhizobium genospecies B, which occurs under different climatic and edaphic conditions across the whole Australian continent and exhibits a broad host range encompassing all three Fabaceae subfamilies (1–4). Strain BDV5040 was grown from a lyophilized stock in 30 ml of yeast extract mannitol broth (5) at 25°C and 200 rpm for 5 days. Genomic DNA was prepared by successive phenol-chloroform extractions as described (6). DNA quantification and quality control were performed using a NanoDrop spectrophotometer, a Qubit 4 fluorometer, and agarose gel electrophoresis. The same DNA was used for Nanopore and Illumina sequencing. Illumina libraries were obtained using the Nextera XT kit following the manufacturer’s instructions, starting with 1 ng of genomic DNA, and were analyzed by paired-end 2 × 300-bp sequencing on a MiSeq instrument. Poor-quality regions (Q < 30) of raw reads were removed using Sickle v1.33 (7). Long reads were obtained with the Oxford Nanopore Technologies MinION FLO-MIN106 flow cell from a library prepared with the SQK-RAD004 kit using 1 μg of DNA and 20-second tagmentation. Base calling was performed using Guppy v3.1.5. Sequence quality was assessed using FastQC v0.11.9 (8) for Illumina reads and MinIONQC.R v1.4.2 (9) for Nanopore reads. The preprocessing of Illumina sequencing data using Trimmomatic v0.39 (10) (parameters: ILLUMINACLIP:TruSeq3-PE-2.fa:2:30:10:2:keepBothReads LEADING:3 TRAILING:3 SLIDINGWINDOW:4:15 MINLEN:50) resulted in 3.87 million paired-end reads (mean lengths of 222 bp and 160 bp for forward and reverse reads, respectively, with ∼150× coverage). Nanopore long reads (total of 2.7 Gbp, with an N50 value of 26,211 bp) were filtered and trimmed for length (>1,500 bp) and quality (score of >8) using Nanofilt v2.5.0 (11), and adapters were removed using Porechop v0.2.4 (12). Long reads were further reduced to 800 Mbp as a target quantity using Filtlong v0.2.0 (13) (parameters: --min_length 2000 --keep_percent 90 --target_bases 800000000). Illumina and Nanopore reads were coassembled using Unicycler v0.4.8 (14) with default parameters, resulting in a single component with eight segments and incomplete status (length, 7,622,333 bp; N50, 7,339,313 bp). Completion was obtained by exporting the sequence path from Bandage v0.8.1 (15) and filling a last gap using Pilon v1.23 (16) and by manually comparing the sequence with Unicycler 003_long_read_assembly.fasta. The assembly and complete chromosome sequence were carefully inspected by visualizing the alignment of long and short reads using minimap2 v2.17 (17) and IGV v2.7.2 (18). Finally, the chromosome was rotated to start at dnaA. The circular chromosome is 7,622,528 bp long, with an average G+C content of 63.92%. The sequence was automatically annotated by the NCBI Prokaryote Genome Annotation Pipeline (PGAP) v4.13 (19). The genome consists of 7,092 protein-coding genes, 48 tRNAs, 1 copy each of the 5S, 16S, and 23S rRNA genes, and 88 pseudogenes. Data availability.The genome sequence of Bradyrhizobium genospecies B strain BDV5040 is available in NCBI GenBank under accession number CP061379. The raw sequence reads are available under SRA accession numbers SRX9514896 and SRX9514898 under BioProject number PRJNA662585 and BioSample number SAMN16089659.

Domaines

Bactériologie Génomique, Transcriptomique et Protéomique [q-bio.GN] Interactions entre organismes Systématique, phylogénie et taxonomie Bio-informatique [q-bio.QM]

Fichier principal

2021-Microbiology Resource Announcements-2021-Oger-Desfeux-e01326-20.full.pdf (187.76 Ko)

Origine : Publication financée par une institution

Bénédicte Lafay : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03118246

Soumis le : vendredi 22 janvier 2021-09:26:41

Dernière modification le : jeudi 11 avril 2024-13:10:04

Archivage à long terme le : vendredi 23 avril 2021-19:00:04

Dates et versions

hal-03118246 , version 1 (22-01-2021)

Identifiants

HAL Id : hal-03118246 , version 1
DOI : 10.1128/MRA.01326-20

Citer

Christine Oger-Desfeux, Jérôme Briolay, Philippe Oger, Bénédicte Lafay. Complete Genome Sequence of Bradyrhizobium sp. Strain BDV5040, Representative of Widespread Genospecies B in Australia. Microbiology Resource Announcements, 2021, 10 (3), pp.e01326-20. ⟨10.1128/MRA.01326-20⟩. ⟨hal-03118246⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UNIV-LYON1 INSA-LYON EC-LYON AMPERE BIOENVIS UMR5240 LBBE INSA-GROUPE UDL INRAE BIOEENVIS

92 Consultations

60 Téléchargements

Complete Genome Sequence of Bradyrhizobium sp. Strain BDV5040, Representative of Widespread Genospecies B in Australia

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager