Minimum Information about any (x) Sequence
In the following recommendations:
How to cite this record FAIRsharing.org: MIxS; Minimum Information about any (x) Sequence; DOI: https://doi.org/10.25504/FAIRsharing.9aa0zp; Last edited: Jan. 8, 2019, 1:37 p.m.; Last accessed: Apr 11 2021 2:05 a.m.
Record updated: July 12, 2018, 9:46 p.m. by The FAIRsharing Team.
Edits to 'https://fairsharing.org/FAIRsharing.9aa0zp' by 'The FAIRsharing Team' at 21:46, 12 Jul 2018 (approved): 'miriam_url' has been modified: Before: None After:
Edits to 'https://fairsharing.org/FAIRsharing.9aa0zp' by 'pyilmaz' at 13:27, 27 Apr 2017 (approved): 'homepage' has been modified: Before: http://wiki.gensc.org/index.php?title=MIxS After: http://gensc.org/mixs/
No XSD schemas defined
Conditions of Use
Models and Formats
No identifier schema standards defined
No metrics standards defined
GenBank is the NIH genetic sequence database, an annotated collection of all publicly available DNA sequences. The complete release notes for the current version of GenBank are available on the NCBI ftp site. A new release is made every two months. GenBank growth statistics for both the traditional GenBank divisions and the WGS division are available from each release. GenBank is part of the International Nucleotide Sequence Database Collaboration (INSDC), which comprises the DNA DataBank of Japan (DDBJ), the European Molecular Biology Laboratory (EMBL), and GenBank at the NCBI. These three organizations exchange data on a daily basis.
DNA Data Bank of Japan
An annotated collection of all publicly available nucleotide and protein sequences. DDBJ collects sequence data mainly from Japanese researchers, as well as researchers in any other countries. DDBJ is part of the International Nucleotide Sequence Database Collaboration (INSDC), which comprises the DNA DataBank of Japan (DDBJ), the European Molecular Biology Laboratory (EMBL), and GenBank at the NCBI. These three organizations exchange data on a daily basis.
Genomes OnLine Database
The Genomes Online Database provides access to information regarding genome and metagenome sequencing projects, and their associated metadata, around the world. Information in GOLD is organized into four levels: Study, Biosample/Organism, Sequencing Project and Analysis Project.
European Nucleotide Archive
The European Nucleotide Archive (ENA) is a globally comprehensive data resource for nucleotide sequence, spanning raw data, alignments and assemblies, functional and taxonomic annotation and rich contextual data relating to sequenced samples and experimental design. Serving both as the database of record for the output of the world's sequencing activity and as a platform for the management, sharing and publication of sequence data, the ENA provides a portfolio of services for submission, data management, search and retrieval across web and programmatic interfaces. The ENA is part of the International Nucleotide Sequence Database Collaboration (INSDC), which comprises the DNA DataBank of Japan (DDBJ), the European Molecular Biology Laboratory (EMBL), and GenBank at the NCBI. These three organizations exchange data on a daily basis.
The European Genome-phenome Archive
The European Genome-phenome Archive (EGA) allows you to explore datasets from genomic studies, provided by a range of data providers. Access to datasets must be approved by the specified Data Access Committee (DAC).
EBI Metagenomics has changed its name to MGnify to reflect a change in scope. This is a free-to-use resource aiming at supporting all metagenomics researchers. The service is an automated pipeline for the analysis and archiving of metagenomic data that aims to provide insights into the phylogenetic diversity as well as the functional and metabolic potential of a sample. You can freely browse all the public data in the repository.
Large-scale genomic data for the peanut have only become available in the last few years, with the advent of low-cost sequencing technologies. To make the data accessible to researchers and to integrate across diverse types of data, the International Peanut Genomics Consortium funded the development of PeanutBase. This database provides access to genetic maps and markers, locations of quantitative trait loci (QTLs), genome sequences, gene locations and sequences, gene families and correspondences with genes in other species, and descriptions of traits and growth characteristics. It also provides tools for exploration and analysis, including sequence of genomic and genic sequences, and keyword searches of genes, gene families, and QTL studies. These resources should facilitate breeding advancements in peanut, helping improve crop productivity and there are a variety of resources for peanut research around the web, ranging from tools for basic plant biology to information for growers and various sectors of the peanut industry to resources for plant breeders.
CyVerse Data Common Repository
The Data Commons provides services to manage, organize, preserve, publish, discover, and reuse data. Using our pipelines, you can easily publish data to the NCBI or directly to the CyVerse Data Commons. CyVerse Curated Data are stable and have DOIs. Community Released Data are maintained by community members and may not be permanent.
The MAR databases is a collection of manually curated marine microbial contextual and sequence databases, based at the Marine Metagenomics Portal. This was developed as a part of the ELIXIR EXCELERATE project in 2017 and is maintained by The Center for Bioinformatics (SfB) at the UiT The Arctic University of Norway. SfB is hosting the UiT node of ELIXIR Norway. The MarRef, MarDb, MarFun and MarCat contextual databases are built by compiling data from a number of public available sequence, taxonomy and literature databases in a semi-automatic fashion.
Genomic Observatories Meta-Database
The Genomic Observatories Meta-Database (GEOME) is a web-based database that captures the who, what, where, and when of biological samples and associated genetic sequences. GEOME helps users with the following goals: ensure the metadata from your biological samples is findable, accessible, interoperable, and reusable; improve the quality of your data and comply with global data standards; and integrate with R, ease publication to NCBI's sequence read archive, and work with an associated LIMS. The initial use case for GEOME came from the Diversity of the Indo-Pacific Network (DIPnet) resource.
National Microbiome Data Collaborative
The National Microbiome Data Collaborative seeks to address fundamental roadblocks in microbiome data science and gaps in transdisciplinary collaboration. The NMDC's Phase I Pilot was launched in July 2019 funded by the Department of Energy’s (DOE) Office of Science, Biological and Environmental Research program, with two strategic priorities — infrastructure and engagement. The infrastructure goal is to democratize microbiome data science by providing access to multidisciplinary, multi-omics data (e.g., metagenomics and metaproteomics) through distributed data resources, with seamless integration to platforms that support reproducible, cross- study analyses. The engagement goal is to enable community-driven shared ownership of microbiome data that supports open science across research teams, funders, publishers, and societies. Currently, no data are available, as this project is still in the development stage. Please sign up for our Quarterly newsletter via the website (starting in Sep 2020) for updates, feature releases, and ways to get involved.
Scroll for more...
National Science Foundation (NSF), USA (Government body)
Natural Environment Research Council (NERC), UK (Government body)
The Gordon and Betty Moore Foundation, USA (Charitable foundation)
Genomic Standards Consortium (Consortium)