Gene Ontology (GO) Gene Association File Format 2.1
Abbreviation: GAF 2.1
This record replaces or incorporates the following deprecated resources:
Countries that developed this resource Worldwide
Created in 2015
How to cite this record FAIRsharing.org: GAF 2.1; Gene Ontology (GO) Gene Association File Format 2.1; DOI: https://doi.org/10.25504/FAIRsharing.FGJ2T8; Last edited: Sept. 20, 2019, 11:52 a.m.; Last accessed: Nov 29 2021 2:30 p.m.
Record added: April 8, 2019, 2:41 p.m.
Record updated: Sept. 20, 2019, 11:52 a.m. by suzi.
No XSD schemas defined
Conditions of UseApplies to: Data use
No guidelines defined
No semantic standards defined
Models and Formats
No identifier schema standards defined
No metrics standards defined
The Aspergillus Genome Database is a resource for genomic sequence data as well as gene and protein information for Aspergilli. This publicly available repository is a central point of access to genome, transcriptome and polymorphism data for the fungal research community.
Genetic, genomic and molecular information pertaining to the model organism Drosophila melanogaster and related sequences. This database also contains information relating to human disease models in Drosophila, the use of transgenic constructs containing sequence from other organisms in Drosophila, and information on where to buy Drosophila strains and constructs.
GeneDB is a genome database for prokaryotic and eukaryotic organisms and provides a portal through which data generated by the "Pathogen Genomics" group at the Wellcome Trust Sanger Institute and other collaborating sequencing centres can be accessed.
Saccharomyces Genome Database
The Saccharomyces Genome Database (SGD) provides comprehensive integrated biological information for the budding yeast Saccharomyces cerevisiae along with search and analysis tools to explore these data, enabling the discovery of functional relationships between sequence and gene products in fungi and higher organisms.
The Arabidopsis Information Resource
The Arabidopsis Information Resource (TAIR) maintains a database of genetic and molecular biology data for the model higher plant Arabidopsis thaliana. Data available from TAIR includes the complete genome sequence along with gene structure, gene product information, gene expression, DNA and seed stocks, genome maps, genetic and physical markers, publications, and information about the Arabidopsis research community. Gene product function data is updated every week from the latest published research literature and community data submissions.
PomBase is a model organism database that provides organization of and access to scientific data for the fission yeast Schizosaccharomyces pombe. PomBase supports genomic sequence and features, genome-wide datasets and manual literature curation as well as providing structural and functional annotation and access to large-scale data sets.
WormBase is an international consortium of biologists and computer scientists dedicated to providing the research community with accurate, current, accessible information concerning the genetics, genomics and biology of C. elegans and related nematodes.
Candida Genome Database
The Candida Genome Database (CGD) provides access to genomic sequence data and manually curated functional information about genes and proteins of the human pathogen Candida albicans. It collects gene names and aliases, and assigns gene ontology terms to describe the molecular function, biological process, and subcellular localization of gene products.
Gramene: A curated, open-source, integrated data resource for comparative functional genomics in plants
Gramene's purpose is to provide added value to plant genomics data sets available within the public sector, which will facilitate researchers' ability to understand the plant genomes and take advantage of genomic sequence known in one species for identifying and understanding corresponding genes, pathways and phenotypes in other plant species. It represents a broad spectrum of species ranging from unicellular photo-autotrophs, algae, monocots, dicots and other important taxonomic clades. Within Plant Reactome, a database portal of Gramene, there are over 60 plant genomes as well as pathways for more than 80 species.
Sol Genomics Network
The Sol Genomics Network (SGN) is a database and website dedicated to the genomic information of the Solanaceae family, which includes species such as tomato, potato, pepper, petunia and eggplant.
Gene Ontology Annotation Database
The GO Annotation Database (GOA) provides Gene Ontology (GO) annotations to proteins in the UniProt Knowledgebase (UniProtKB), RNA molecules from RNACentral and protein complexes from the Complex Portal. GOA files contain a mixture of manual annotation supplied by members of the Gene Onotology Consortium and computationally assigned GO terms describing gene products. Annotation type is clearly indicated by associated evidence codes and there are links to the source data.
Reactome - a curated knowledgebase of biological pathways
The cornerstone of Reactome is a freely available, open source relational database of signaling and metabolic molecules and their relations organized into biological pathways and processes. The core unit of the Reactome data model is the reaction. Entities (nucleic acids, proteins, complexes, vaccines, anti-cancer therapeutics and small molecules) participating in reactions form a network of biological interactions and are grouped into pathways. Examples of biological pathways in Reactome include classical intermediary metabolism, signaling, transcriptional regulation, apoptosis and disease. Inferred orthologous reactions are available for 15 non-human species including mouse, rat, chicken, zebrafish, worm, fly, and yeast.
Rat Genome Database
The Rat Genome Database stores genetic, genomic, phenotype, and disease data generated from rat research. It provides access to corresponding data for eight other species, allowing cross-species comparison. Data curation is performed both manually and via an automated pipeline, giving RGD users integrated access to a wide variety of data to support their research.
Mouse Genome Database - a Mouse Genome Informatics (MGI) Resource
MGI is the international database resource for the laboratory mouse, providing integrated genetic, genomic, and biological data to facilitate the study of human health and disease. Data includes gene characterization, nomenclature, mapping, gene homologies among mammals, sequence links, phenotypes, allelic variants and mutants, and strain data.
EcoCyc E. coli Database
EcoCyc is a model organism database for Escherichia coli K-12 MG1655. EcoCyc curation captures literature-based information on the functions of individual E. coli gene products, metabolic pathways, and regulation of E. coli gene expression. EcoCyc has been curated from 37,000 publications as of 2019. Updates to EcoCyc content continue to improve the comprehensive picture of E. coli biology. The utility of EcoCyc is enhanced by new tools available on the EcoCyc web site, and the development of EcoCyc as a teaching tool is increasing the impact of the knowledge collected in EcoCyc.
The Zebrafish Information Network
The Zebrafish Information Network, ZFIN, serves as the primary community database resource for the laboratory use of zebrafish. We develop and support integrated zebrafish genetic, genomic, developmental and physiological information and link this information extensively to corresponding data in other model organism and human databases.
dictyBase is a single-access database for the complete genome sequence and expression data of four Dictyostelid species providing information on research, genome and annotations. There is also a repository of plasmids and strains held at the Dicty Stock Centre. Relevant literature is integrated into the database, and gene models and functional annotation are manually curated from experimental results and comparative multigenome analyses.
A resource providing data on bioentities and their associated ontology terms for Plant Biology. The database provides access to ontology-based annotations of genes, phenotypes and germplasms from about 90 plant species. A number of internal and external ontologies are used to annotate the biological data available from this resource.
Scroll for more...
This record is not implemented by any policy.