{{nav.loginGreeting}}
  • 資料取得
      • 物種出現紀錄
      • GBIF 應用程式介面
      • 物種
      • 資料集
      • Occurrence snapshots
      • Hosted portals
      • 趨勢
  • 實務作法
    • 資料分享

      • 快速上手
      • 資料集類別
      • 資料託管
      • 資料標準
      • 成為發布者
      • 資料品質
      • 資料論文
    • 資料使用

      • 資料使用精選案例
      • 引用指南
      • 使用 GBIF 資料的研究文獻
      • 線上引用部件
  • 工具
    • 發布者

      • IPT 整合式發布工具
      • 資料驗證工具
      • GeoPick
      • New data model ⭐️
      • 科學典藏
      • 建議資料集
    • 使用者

      • Hosted portals
      • Scientific collections
      • 資料處理
      • 衍生的資料集
      • rgbif
      • pygbif
      • MAXENT
      • 工具目錄
    • GBIF 實驗室

      • 物種學名對應
      • 學名解析
      • 序列識別碼
      • 相對觀測趨勢
      • GBIF 資料部落格
  • 社群
    • 網絡

      • GBIF 會員國家及組織
      • 節點
      • 資料發布者
      • 聯繫 GBIF 網絡
      • 社群論壇
      • 一個生物多樣性知識的聯盟
    • 志願參與

      • 輔導員
      • 生物多樣性開放資料大使
      • 翻譯人員
      • 公民科學家
    • 活動

      • 能量提升
      • 計畫與專案
      • 訓練及數位學習
      • Data Use Club
      • 生物地圖集
  • 關於
    • GBIF 網內

      • 什麼是 GBIF?
      • 成為會員
      • 治理
      • GBIF 執行計畫
      • Work Programme
      • 經費來源
      • 合作關係
      • 版本說明
      • 聯絡資訊
    • 新聞與推廣

      • 新聞
      • 通訊和郵件論壇
      • 活動
      • 獎項
      • 科學評論
      • Data use
  • User profile

Genome Taxonomy Database r214.1

Dataset homepage

Citation

Parks D, Hugenholtz P (2024). Genome Taxonomy Database r214.1. Version 1.92. The University of Queensland. Checklist dataset https://doi.org/10.15468/dpzg84 accessed via GBIF.org on 2024-08-12.

Description

The Genome Taxonomy Database (GTDB) is an initiative to establish a standardised microbial taxonomy based on genome phylogeny, primarily funded by the Australian Research Council via a Laureate Fellowship (FL150100038) and Discovery Project (DP220100900), with the welcome assistance of strategic funding from The University of Queensland. The genomes used to construct the phylogeny are obtained from RefSeq and GenBank, and GTDB releases are indexed to RefSeq releases, starting with release 76. Importantly and increasingly, this dataset includes draft genomes of uncultured microorganisms obtained from metagenomes and single cells, ensuring improved genomic representation of the microbial world. All genomes are independently quality controlled using CheckM before inclusion in GTDB, see statistics here . The GTDB taxonomy is based on genome trees inferred using FastTree from an aligned concatenated set of 120 single copy marker proteins for Bacteria, and with IQ-TREE from a concatenated set of 53 (starting with R07-RS207) and 122 (prior to R07-RS207) marker proteins for Archaea (download page here ). Additional marker sets are also used to cross-validate tree topologies including concatenated ribosomal proteins and ribosomal RNA genes. NCBI taxonomy was initially used to decorate the genome tree via tax2tree and subsequently used as a reference source of new taxonomic opinions including new names. The 16S rRNA-based Greengenes and SILVA taxonomies were intially used to supplement the taxonomy particularly in regions of the tree with no cultured representatives, however genome assembly identifiers are now used to create placeholder names for uncultured taxa. LPSN is used as the primary nomenclatural reference for establishing naming priorities and nomenclature types. All taxonomic ranks except species are normalised using PhyloRank and the taxonomy manually curated to remove polyphyletic groups. Polyphyly and rank evenness can be visualised in PhyloRank plots . Species were originally delineated based on phylogeny and rank normalization but this was replaced with an ANI-based method (starting with R04-RS89) to enable scalable and automated assignment of genomes to species clusters. The GTDB taxonomy can be queried and downloaded through a number of tools at https://gtdb.ecogenomic.org/

Taxonomic Coverages

  1. Archaea
    common name: Archaea rank: domain

Geographic Coverages

Bibliographic Citations

  1. Parks, D.H., et al. (2020). A complete domain-to-species taxonomy for Bacteria and Archaea. Nature Biotechnology - DOI:10.1038/s41587-020-0501-8
  2. Parks, D.H., et al. (2018). A standardized bacterial taxonomy based on genome phylogeny substantially revises the tree of life. Nature Biotechnology, 36: 996-1004 - DOI:10.1038/nbt.4229

Contacts

Donovan Parks
originator
position: Dr
Australian Centre for Ecogenomics
AU
email: donovan.parks@gmail.com
homepage: https://ecogenomic.org/personnel/dr-donovan-parks
userId: http://orcid.org/0000-0001-6662-9010
Phil Hugenholtz
originator
position: Professor
Australian Centre for Ecogenomics
AU
email: p.hugenholtz@uq.edu.au
homepage: https://ecogenomic.org/personnel/prof-phil-hugenholtz
userId: http://orcid.org/0000-0001-5386-7925
Donovan Parks
metadata author
position: Dr
Australian Centre for Ecogenomics
AU
email: donovan.parks@gmail.com
homepage: https://ecogenomic.org/personnel/dr-donovan-parks
userId: http://orcid.org/0000-0001-6662-9010
Phil Hugenholtz
metadata author
position: Professor
Australian Centre for Ecogenomics
AU
email: p.hugenholtz@uq.edu.au
homepage: https://ecogenomic.org/personnel/prof-phil-hugenholtz
userId: http://orcid.org/0000-0001-5386-7925
Pierre Chaumeil
user
position: Software developer
Australian Centre for Ecogenomics
AU
email: p.chaumeil@qfab.org
homepage: https://ecogenomic.org/personnel/mr-pierre-chaumeil
Donovan Parks
administrative point of contact
position: Dr
Australian Centre for Ecogenomics
AU
email: donovan.parks@gmail.com
homepage: https://ecogenomic.org/personnel/dr-donovan-parks
userId: http://orcid.org/0000-0001-6662-9010
Phil Hugenholtz
administrative point of contact
position: Professor
Australian Centre for Ecogenomics
AU
email: p.hugenholtz@uq.edu.au
homepage: https://ecogenomic.org/personnel/prof-phil-hugenholtz
userId: http://orcid.org/0000-0001-5386-7925
Pierre Chaumeil
administrative point of contact
position: Software developer
Australian Centre for Ecogenomics
AU
email: p.chaumeil@qfab.org
homepage: https://ecogenomic.org/personnel/mr-pierre-chaumeil
什麼是 GBIF? 應用程式介面 常見問答 通訊 隱私權 使用協議與條款 引用 行為準則 致謝
聯絡我們 GBIF Secretariat Universitetsparken 15 DK-2100 Copenhagen Ø Denmark
GBIF is a Global Core Biodata Resource