Short- and long-read metabarcoding of the eukaryotic rRNA operon: Evaluation of primers and comparison to shotgun metagenomics sequencing

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Dokumenter

  • Fulltext

    Forlagets udgivne version, 11,2 MB, PDF-dokument

  • Latz, Meike
  • Vesna Grujcic
  • Sonia Brugel
  • Jenny Lycken
  • Uwe John
  • Bengt Karlson
  • Agneta Andersson
  • Anders F. Andersson

High-throughput sequencing-based analysis of microbial diversity has evolved vastly over the last decade. Currently, the go-to method for studying microbial eukaryotes is short-read metabarcoding of variable regions of the 18S rRNA gene with <500 bp amplicons. However, there is a growing interest in applying long-read sequencing of amplicons covering the rRNA operon for improving taxonomic resolution. For both methods, the choice of primers is crucial. It determines if community members are covered, if they can be identified at a satisfactory taxonomic level, and if the obtained community profile is representative. Here, we designed new primers targeting 18S and 28S rRNA based on 177,934 and 21,072 database sequences, respectively. The primers were evaluated in silico along with published primers on reference sequence databases and marine metagenomics data sets. We further evaluated a subset of the primers for short- and long-read sequencing on environmental samples in vitro and compared the obtained community profile with primer-unbiased metagenomic sequencing. Of the short-read pairs, a new V6-V8 pair and the V4_Balzano pair used with a simplified PCR protocol provided good results in silico and in vitro. Fewer differences were observed between the long-read primer pairs. The long-read amplicons and ITS1 alone provided higher taxonomic resolution than V4. Together, our results represent a reference and guide for selection of robust primers for research on and environmental monitoring of microbial eukaryotes.

OriginalsprogEngelsk
TidsskriftMolecular Ecology Resources
Vol/bind22
Udgave nummer6
Sider (fra-til)2304-2318
Antal sider15
ISSN1755-098X
DOI
StatusUdgivet - 2022

Bibliografisk note

Funding Information:
DNA sequencing was conducted by the Swedish National Genomics Infrastructure (NGI) in Stockholm and Uppsala. Computations were performed on resources provided by the Swedish National Infrastructure for Computing (SNIC) through the Uppsala Multidisciplinary Center for Advanced Computational Science (UPPMAX). We thank Luis Fernando Delgado Zambrano for bioinformatic support. This study was supported by the Swedish Agency for Marine and Water Management and the Swedish Environmental Protection Agency under the grant number NV‐03728‐17. Meike A. C. Latz was additionally supported by a research grant (34442) from VILLUM FONDEN and Anders F. Andersson by a research grant (2017‐00694) from Formas ‐ a Swedish Research Council for SustainableDevelopment.

Publisher Copyright:
© 2022 The Authors. Molecular Ecology Resources published by John Wiley & Sons Ltd.

ID: 307294312