Does the choice of nucleotide substitution models matter topologically?

Hoff, Michael; Orf, Stefan; Riehm, Benedikt; Darriba, Diego; Stamatakis, Alexandros

doi:10.1186/s12859-016-0985-x

Does the choice of nucleotide substitution models matter topologically?

Hoff, Michael ¹; Orf, Stefan ¹; Riehm, Benedikt ¹; Darriba, Diego; Stamatakis, Alexandros

¹
¹ Fakultät für Informatik (INFORMATIK), Karlsruher Institut für Technologie (KIT)

Abstract:

Background: In the context of a master level programming practical at the computer science department of the Karlsruhe Institute of Technology, we developed and make available an open-source code for testing all 203 possible nucleotide substitution models in the Maximum Likelihood (ML) setting under the common Akaike, corrected Akaike, and Bayesian information criteria. We address the question if model selection matters topologically, that is, if conducting ML inferences under the optimal, instead of a standard General Time Reversible model, yields different tree topologies. We also assess, to which degree models selected and trees inferred under the three standard criteria (AIC, AICc, BIC) differ. Finally, we assess if the definition of the sample size (#sites versus #sites × #taxa) yields different models and, as a consequence, different tree topologies.
Results: We find that, all three factors (by order of impact: nucleotide model selection, information criterion used, sample size definition) can yield topologically substantially different final tree topologies (topological difference exceeding 10 %) for approximately 5 % of the tree inferences conducted on the 39 empirical datasets used in our study.
... mehr

KITopen-Download

Volltext

DOI: 10.5445/IR/1000062728

Externe Links

Originalveröffentlichung
DOI: 10.1186/s12859-016-0985-x

Scopus
Zitationen: 34

Web of Science
Zitationen: 39

Dimensions
Zitationen: 47

Export

Statistiken

Seitenaufrufe: 245
seit 24.07.2018

Downloads: 246
seit 06.10.2017

Zugehörige Institution(en) am KIT	Fakultät für Informatik (INFORMATIK)
Publikationstyp	Zeitschriftenaufsatz
Publikationsjahr	2016
Sprache	Englisch
Identifikator	ISSN: 1471-2105 urn:nbn:de:swb:90-627281 KITopen-ID: 1000062728
Erschienen in	BMC bioinformatics
Verlag	Springer Fachmedien Wiesbaden
Band	17
Heft	1
Seiten	Art.Nr.: 143
Schlagwörter	Phylogenetics, Nucleotide substitution, Model selection, Information criterion, BIC, AIC
Nachgewiesen in	Web of Science Dimensions OpenAlex Scopus

Repository KITopen

Does the choice of nucleotide substitution models matter topologically?

Abstract: