databases > DOI:10.25504/FAIRsharing.xgcyyn

ready CATH Protein Structure Classification

Abbreviation:CATH domain

General Information
The CATH database is a hierarchical domain classification of protein structures in the Protein Data Bank. Protein structures are classified using a combination of automated and manual procedures. There are four major levels in this hierarchy; Class (secondary structure classification, e.g. mostly alpha), Architecture (classification based on overall shape), Topology (fold family) and Homologous superfamily (protein domains which are thought to share a common ancestor). This collection is concerned with superfamily classification.


Countries that developed this resource United Kingdom

Created in 1996

Taxonomic range


How to cite this record CATH domain; CATH Protein Structure Classification; DOI:; Last edited: Jan. 8, 2019, 1:20 p.m.; Last accessed: Mar 06 2021 11:47 a.m.

Publication for citation  CATH: an expanded resource to predict protein function through structure and sequence. Dawson NL,Lewis TE,Das S,Lees JG,Lee D,Ashford P,Orengo CA,Sillitoe I; Nucleic Acids Res ; 2016; 10.1093/nar/gkw1098;

This record is maintained by isillitoe  ORCID

Record updated: Dec. 21, 2018, 10:52 a.m. by The FAIRsharing Team.

Show edit history



Additional Information

Access / Retrieve Data

Conditions of Use

Applies to: Data use

Data Access


CATH--a hierarchic classification of protein domain structures.

Orengo CA., Michie AD., Jones S., Jones DT., Swindells MB., Thornton JM.,
Structure 1997

View Paper (PubMed)

Functional classification of CATH superfamilies: a domain-based approach for protein function annotation.

Das S,Lee D,Sillitoe I,Dawson NL,Lees JG,Orengo CA
Bioinformatics 2016

View Paper (PubMed) View Publication

The history of the CATH structural classification of protein domains.

Sillitoe I,Dawson N,Thornton J,Orengo C
Biochimie 2015

View Paper (PubMed) View Publication

CATH: comprehensive structural and functional annotations for genome sequences.

Sillitoe I,Lewis TE,Cuff A,Das S,Ashford P,Dawson NL,Furnham N,Laskowski RA,Lee D,Lees JG,Lehtinen S,Studer RA,Thornton J,Orengo CA
Nucleic Acids Res 2014

View Paper (PubMed) View Publication

The CATH protein family database: a resource for structural and functional annotation of genomes.

Orengo CA,Bray JE,Buchan DW,Harrison A,Lee D,Pearl FM,Sillitoe I,Todd AE,Thornton JM
Proteomics 2002

View Paper (PubMed)

CATH: an expanded resource to predict protein function through structure and sequence.

Dawson NL,Lewis TE,Das S,Lees JG,Lee D,Ashford P,Orengo CA,Sillitoe I
Nucleic Acids Res 2016

View Paper (PubMed) View Publication

Gene3D: Extensive prediction of globular domains in proteins.

Lewis TE,Sillitoe I,Dawson N,Lam SD,Clarke T,Lee D,Orengo C,Lees J
Nucleic Acids Res 2017

View Paper (PubMed) View Publication

Related Standards

Reporting Guidelines

No guidelines defined

Terminology Artifacts

Identifier Schemas

No identifier schema standards defined


No metrics standards defined

Implementing Policies

This record is not implemented by any policy.


Record Maintainer



Grant Number(s)

  • 081989/Z/07/Z (The Wellcome Trust, UK)

  • BB/K020013/1 (Biotechnology and Biological Sciences Research Council (BBSRC), UK)