ready Pfam Protein Families


General Information
The Pfam database is a large collection of protein families, each represented by multiple sequence alignments and hidden Markov models (HMMs). Proteins are generally composed of one or more functional regions, commonly termed domains. Different combinations of domains give rise to the diverse range of proteins found in nature. The identification of domains that occur within proteins can therefore provide insights into their function. Pfam also generates higher-level groupings of related entries, known as clans. A clan is a collection of Pfam entries which are related by similarity of sequence, structure or profile-HMM.

Collected/Recommended By

Record updated: Aug. 12, 2016, 2:16 p.m. by The FAIRsharing Team.

Associated recommendations
In Collections

Access / Retrieve Data

Conditions of Use

Applies to: Data use

Data Access

REST Web Services

Related Standards

Reporting Guidelines

No guidelines defined

Terminology Artifacts


Record Maintainer

  • This record is in need of a maintainer. If you login, you'll be able to claim this record.



Grant Number(s)

  • 108433/Z/15/Z (The Wellcome Trust)

  • BB/L024136/1 (Biotechnology and Biological Sciences Research Council)

  • WT077044/Z/05/Z (The Wellcome Trust)


The Pfam protein families database.

Finn RD., Mistry J., Tate J., Coggill P., Heger A., Pollington JE., Gavin OL., Gunasekaran P., Ceric G., Forslund K., Holm L., Sonnhammer EL., Eddy SR., Bateman A.,
Nucleic Acids Res. 2010

View Paper (PubMed) View Paper (DOI)

The Pfam protein families database: towards a more sustainable future.

Finn RD,Coggill P,Eberhardt RY,Eddy SR,Mistry J,Mitchell AL,Potter SC,Punta M,Qureshi M,Sangrador-Vegas A,Salazar GA,Tate J,Bateman A
Nucleic Acids Res 2015

View Paper (PubMed) View Paper (DOI)