Classifying bacteria is crucial for understanding their ecology, evolution, and potential roles in health and disease. Two prominent systems for bacterial taxonomy are the National Center for Biotechnology Information (NCBI) taxonomy and the Genome Taxonomy Database (GTDB). While both aim to categorize bacteria, they differ in their underlying philosophies and methodologies, leading to discrepancies in classification. The NCBI taxonomy, a long-standing system, curates classifications based on a combination of phenotypic and genotypic data [1]. However, in recent years, researchers highlighted inconsistencies arising from historical practices and the subjective nature of phenotypic traits [2]. Additionally, the NCBI taxonomy is not strictly rank-normalized, meaning equivalent evolutionary distances can be assigned different taxonomic ranks [1]. GTDB, a newer system, focuses on a phylogenetically consistent and rank-normalized classification based on whole-genome sequences. It utilizes Average Nucleotide Identity (ANI) to delineate species boundaries and Relative Evolutionary Divergence (RED) to define higher taxonomic ranks [2]. This approach aims to provide a more objective and evolutionarily informative classification scheme leading to GTDB often recognizing finer taxonomic levels and proposing novel lineages not present in the NCBI taxonomy . Here’s a breakdown of the key differences between GTDB and NCBI taxonomies:Documentation Index
Fetch the complete documentation index at: https://docs.cosmosid.com/llms.txt
Use this file to discover all available pages before exploring further.
- Philosophy: NCBI - Curator-driven, phenotypic and genotypic data; GTDB - Genome-based, phylogenetically consistent.
- Data source: NCBI Refseq and Genbank; GTDB - NCBI Refseq and Genbank
- Species definition: NCBI - Less stringent, often based on 16S rRNA gene sequence similarity [1]; GTDB - Stricter, based on ANI values (>95% for bacteria) [2].
- Higher-rank classification: NCBI - Traditional Linnaean ranks (phylum, class, order, etc.); GTDB - Ranks are normalized based on evolutionary divergence (phylum level for Bacillota can be Bacillota, Bacillota_A, Bacillota_B; ) [2].
- Schoch, C.L., Ciufo, S., Domrachev, M., Hotton, C.L., Kannan, S., Khovanskaya, R., Leipe, D., McVeigh, R., O’Neill, K., Robbertse, B., Sharma, S., Soussov, V., Sullivan, J.P., Sun, L., Turner, S. and Karsch-Mizrachi, I., 2020. NCBI Taxonomy: a comprehensive update on curation, resources and tools. Database, 2020, baaa062.
- Parks, D.H., Chuvochina, M., Waite, D.W., Rinke, C., Skarshewski, A., Chaumeil, P.-A., and Hugenholtz, P., 2018. A standardized bacterial taxonomy based on genome phylogeny substantially revises the tree of life. Nature Biotechnology, 36(10), pp.996-1004