KIT | KIT-Bibliothek | Impressum | Datenschutz

How clear is our current view on microbial dark matter? (Re-)assessing public MAG & SAG datasets with MDMcleaner

Vollmers, John; Wiegand, Sandra; Lenk, F.; Kaster, Anne-Kristin 1
1 Institut für Angewandte Biowissenschaften (IAB), Karlsruher Institut für Technologie (KIT)

Abstract:

As of today, the majority of environmental microorganisms remain uncultured and is therefore referred to as ‘microbial dark matter’ (MDM). Hence, genomic insights into these organisms are limited to cultivation-independent approaches such as single-cell- and metagenomics. However, without access to cultured representatives for verifying correct taxon-assignments, MDM genomes may cause potentially misleading conclusions based on misclassified or contaminant contigs, thereby obfuscating our view on the uncultured microbial majority. Moreover, gradual database contaminations by past genome submissions can cause error propagations which affect present as well as future comparative genome analyses. Consequently, strict contamination detection and filtering need to be applied, especially in the case of uncultured MDM genomes. Current genome reporting standards, however, emphasize completeness over purity and the de facto gold standard genome assessment tool, checkM, discriminates against uncultured taxa and fragmented genomes. To tackle these issues, we present a novel contig classification, screening, and filtering workflow and corresponding open-source python implementation called MDMcleaner, which was tested and compared to other tools on mock and real datasets. ... mehr


Verlagsausgabe §
DOI: 10.5445/IR/1000145902
Veröffentlicht am 23.12.2022
Originalveröffentlichung
DOI: 10.1093/nar/gkac294
Scopus
Zitationen: 28
Web of Science
Zitationen: 22
Dimensions
Zitationen: 35
Cover der Publikation
Zugehörige Institution(en) am KIT Institut für Angewandte Biowissenschaften (IAB)
Institut für Biologische Grenzflächen (IBG)
Publikationstyp Zeitschriftenaufsatz
Publikationsjahr 2022
Sprache Englisch
Identifikator ISSN: 0305-1048, 1362-4962
KITopen-ID: 1000145902
HGF-Programm 43.33.11 (POF IV, LK 01) Adaptive and Bioinstructive Materials Systems
Erschienen in Nucleic Acids Research
Verlag Oxford University Press (OUP)
Band 50
Heft 13
Seiten e76
Bemerkung zur Veröffentlichung Gefördert durch den KIT-Publikationsfonds
Nachgewiesen in Web of Science
Scopus
Dimensions
KIT – Die Forschungsuniversität in der Helmholtz-Gemeinschaft
KITopen Landing Page