Binary Alignment Map Format
How to cite this record FAIRsharing.org: BAM; Binary Alignment Map Format; DOI: https://doi.org/10.25504/FAIRsharing.hza1ec; Last edited: Jan. 8, 2019, 1:27 p.m.; Last accessed: May 15 2021 7:25 p.m.
Record updated: July 12, 2017, 10:56 a.m. by The FAIRsharing Team.
Edits to 'https://fairsharing.org/FAIRsharing.hza1ec' by 'The FAIRsharing Team' at 10:56, 12 Jul 2017 (approved): 'description' has been modified: Before: BAM is the compressed binary version of the Sequence Alignment/Map (SAM) format, a compact and index-able representation of nucleotide sequence alignments. Many next-generation sequencing and analysis tools work with SAM/BAM. For custom track display, the main advantage of indexed BAM over PSL and other human-readable alignment formats is that only the portions of the files needed to display a particular region are transferred to UCSC. This makes it possible to display alignments from files that are so large that the connection to UCSC would time out when attempting to upload the whole file to UCSC. Both the BAM file and its associated index file remain on your web-accessible server (http, https, or ftp), not on the UCSC server. UCSC temporarily caches the accessed portions of the files to speed up interactive display. After: BAM is the compressed binary version of the Sequence Alignment/Map (SAM) format, a compact and indexable representation of nucleotide sequence alignments. Many next-generation sequencing and analysis tools work with SAM/BAM. For custom track display, the main advantage of indexed BAM over PSL and other human-readable alignment formats is that only the portions of the files needed to display a particular region are transferred to UCSC. This makes it possible to display alignments from files that are so large that the connection to UCSC would time out when attempting to upload the whole file to UCSC. Both the BAM file and its associated index file remain on your web-accessible server (http, https, or ftp), not on the UCSC server. UCSC temporarily caches the accessed portions of the files to speed up interactive display. Domain list has changed: Previous values: Deoxyribonucleic acid (DNA)|material|http://purl.obolibrary.org/obo/SO_0000352 General purpose|None| Sequence||http://edamontology.org/data_2044 Sequence alignment||http://edamontology.org/data_0863 New values: Deoxyribonucleic acid (DNA)|material|http://purl.obolibrary.org/obo/SO_0000352 Sequence||http://edamontology.org/data_2044 Sequence alignment||http://edamontology.org/data_0863 Life Science|domain|
|contact form||https://genome.ucsc.edu/cgi-bin/hgUserSu ...|
|online documentation||http://genome.ucsc.edu/goldenPath/help/b ...|
No XSD schemas defined
Conditions of Use
No publications available
No guidelines defined
No semantic standards defined
Models and Formats
No identifier schema standards defined
No metrics standards defined
Encyclopedia of DNA Elements (ENCODE) has created a comprehensive parts list of functional elements in the human genome, including elements that act at the protein and RNA levels, and regulatory elements that control cells and circumstances in which a gene is active. UCSC coordinated data for the ENCODE Consortium from its inception in 2003 (Pilot phase) to the end of the first 5 year phase of whole-genome data production in 2012. All data produced by ENCODE investigators and the results of ENCODE analysis projects from this period are hosted in the UCSC Genome browser and database.
modMine is an integrated web resource of data and tools to browse and search modENCODE data and experimental details, download results and access the GBrowse genome browser.
UCSC Genome Browser database
Genome assemblies and aligned annotations for a wide range of vertebrates and model organisms, along with an integrated tool set for visualizing, comparing, analyzing and sharing both publicly available and user-generated genomic datasets.
The UCSC Archaeal Genome Browser
The UCSC Archaeal Genome Browser is a window on the biology of more than 100 microbial species from the domain Archaea. Basic gene annotation is derived from NCBI Genbank/RefSeq entries, with overlays of sequence conservation across multiple species, nucleotide and protein motifs, non-coding RNA predictions, operon predictions, and other types of bioinformatic analyses. In addition, we display available gene expression data (microarray or high-throughput RNA sequencing). Direct contributions or notices of publication of functional genomic data or bioinformatic analyses from archaeal research labs are very welcome.
A comprehensive online knowledgebase for the monkey research community.
The European Genome-phenome Archive
The European Genome-phenome Archive (EGA) allows you to explore datasets from genomic studies, provided by a range of data providers. Access to datasets must be approved by the specified Data Access Committee (DAC).
Ensembl creates, integrates and distributes reference datasets and analysis tools that enable genomics. Ensembl is a genome browser that supports research in comparative genomics, evolution, sequence variation and transcriptional regulation. Ensembl annotate genes, computes multiple alignments, predicts regulatory function and collects disease data.
SoyBase, the USDA-ARS soybean genetic database, is a comprehensive repository for professionally curated genetics, genomics and related data resources for soybean. SoyBase contains genetic, physical and genomic sequence maps integrated with qualitative and quantitative traits. The quantitative trait loci (QTL) represent more than 18 years of QTL mapping of more than 90 unique traits. SoyBase also contains the well-annotated 'Williams 82' genomic sequence and associated data mining tools. The genetic and sequence views of the soybean chromosomes and the extensive data on traits and phenotypes are extensively interlinked. This allows entry to the database using almost any kind of available information, such as genetic map symbols, soybean gene names or phenotypic traits. SoyBase is the repository for controlled vocabularies for soybean growth, development and trait terms, which are also linked to the more general plant ontologies.
Database of Genotypes and Phenotypes
The Database of Genotypes and Phenotypes (dbGaP) archives and distributes the results of studies that have investigated the interaction of genotype and phenotype. Such studies include genome-wide association studies, medical sequencing, molecular diagnostic assays, as well as association between genotype and non-clinical traits.
The ENCODE (Encyclopedia of DNA Elements) Consortium is an international collaboration of research groups funded by the National Human Genome Research Institute (NHGRI). The goal of ENCODE is to build a comprehensive parts list of functional elements in the human genome, including elements that act at the protein and RNA levels, and regulatory elements that control cells and circumstances in which a gene is active. ENCODE results from 2007 and later are available from this project. This covers data generated during the two production phases 2007-2012 and 2013-present.
Dog Genome SNP Database
Dog Genome SNP Database (DoGSD) is a data container for the variation information of dog/wolf genomes. It was designed and constructed as an SNPs detector and visualization tool to provide the research community a useful resource for the study of dog's population, evolution, phenotype and life habit.
The International Genome Sample Resource
The International Genome Sample Resource (IGSR) was established to ensure the ongoing usability of data generated by the 1000 Genomes Project and to extend the data set. The 1000 Genomes Project ran between 2008 and 2015, creating the largest public catalogue of human variation and genotype data. As the project ended, the Data Coordination Centre at EMBL-EBI has received continued funding from the Wellcome Trust to maintain and expand the resource. IGSR was set up to do this and has the following aims: ensure the future access to and usability of the 1000 Genomes reference data; incorporate additional published genomic data on the 1000 Genomes samples; and expand the data collection to include new populations not represented in the 1000 Genomes Project.
4DNucleome Data Portal
The 4D Nucleome Data Portal (4DN) hosts data generated by the 4DN Network and other reference nucleomics data sets, and an expanding tool set for open data processing and visualization. It is a platform to search, visualize, and download nucleomics data.
Virtual Chinese Genome Database
Virtual Chinese Genome Database (VCGDB) is a genome database of the Chinese population based on the whole genome sequencing data of 194 individuals. We are unsure when this database was last updated, and as such we have marked this record as Uncertain. Please contact us if you have any information on its current status.
National Cancer Institute's Genomic Data Commons
The National Cancer Institute’s (NCI’s) Genomic Data Commons (GDC) is a data sharing The National Cancer Institute's Genomic Data Commons (GDC) was created to promote precision medicine in oncology. It supports the import and standardization of genomic and clinical data from cancer research programs. The GDC contains NCI-generated data from some of the largest and most comprehensive cancer genomic datasets, including The Cancer Genome Atlas (TCGA) and Therapeutically Applicable Research to Generate Effective Therapies (TARGET). For the first time, these datasets have been harmonized using a common set of bioinformatics pipelines, so that the data can be directly compared. As a growing knowledge system for cancer, the GDC also enables researchers to submit data, and harmonizes these data for import into the GDC.
National Institute of Mental Health Data Archive
The National Institute of Mental Health Data Archive (NDA), originally established to support autism research, is an informatics platform and data repository that facilitates data sharing across all of mental health and other research communities, combining data from each of these repositories into a single resource with a single process for gaining access to all shared data. While querying and browsing data is publicly available, all subject-level data is available behind a login.
Scroll for more...
This record is not implemented by any policy.
This record is in need of a maintainer. If you login, you'll be able to claim this record.