A priori estimation of sequencing effort in complex microbial metatranscriptomes

dc.contributor.authorMonleón Getino, Toni
dc.contributor.authorFrías-López, Jorge
dc.date.accessioned2020-12-03T18:30:35Z
dc.date.available2020-12-03T18:30:35Z
dc.date.issued2020-11-01
dc.date.updated2020-12-03T18:30:36Z
dc.description.abstractMetatranscriptome analysis or the analysis of the expression profiles of whole microbial communities has the additional challenge of dealing with a complex system with dozens of different organisms expressing genes simultaneously. An underlying issue for virtually all metatranscriptomic sequencing experiments is how to allocate the limited sequencing budget while guaranteeing that the libraries have sufficient depth to cover the breadth of expression of the community. Estimating the required sequencing depth to effectively sample the target metatranscriptome using RNA‐seq is an essential first step to obtain robust results in subsequent analysis and to avoid overexpansion, once the information contained in the library reaches saturation. Here, we present a method to calculate the sequencing effort using a simulated series of metatranscriptomic/metagenomic matrices. This method is based on an extrapolation rarefaction curve using a Weibull growth model to estimate the maximum number of observed genes as a function of sequencing depth. This approach allowed us to compute the effort at different confidence intervals and to obtain an approximate a priori effort based on an initial fraction of sequences. The analytical pipeline presented here may be successfully used for the in‐depth and time‐effective characterization of complex microbial communities, representing a useful tool for the microbiome research community.
dc.format.extent13 p.
dc.format.mimetypeapplication/pdf
dc.identifier.idgrec704350
dc.identifier.issn2045-7758
dc.identifier.pmid33304545
dc.identifier.urihttps://hdl.handle.net/2445/172542
dc.language.isoeng
dc.publisherJohn Wiley & Sons
dc.relation.isformatofReproducció del document publicat a: https://doi.org/10.1002/ece3.6941
dc.relation.ispartofEcology and Evolution, 2020, vol. 10, num. 23, p. 13382-13394
dc.relation.urihttps://doi.org/10.1002/ece3.6941
dc.rightscc-by (c) Monleón Getino, Toni et al., 2020
dc.rights.accessRightsinfo:eu-repo/semantics/openAccess
dc.rights.urihttp://creativecommons.org/licenses/by/3.0/es
dc.sourceArticles publicats en revistes (Genètica, Microbiologia i Estadística)
dc.subject.classificationExpressió gènica
dc.subject.classificationRNA
dc.subject.classificationBiodiversitat
dc.subject.otherGene expression
dc.subject.otherRNA
dc.subject.otherBiodiversity
dc.titleA priori estimation of sequencing effort in complex microbial metatranscriptomes
dc.typeinfo:eu-repo/semantics/article
dc.typeinfo:eu-repo/semantics/publishedVersion

Fitxers

Paquet original

Mostrant 1 - 1 de 1
Carregant...
Miniatura
Nom:
704350.pdf
Mida:
2.06 MB
Format:
Adobe Portable Document Format