Digital Object Identifier
In the following recommendations:
How to cite this record FAIRsharing.org: DOI; Digital Object Identifier; DOI: https://doi.org/10.25504/FAIRsharing.hFLKCn; Last edited: Feb. 24, 2020, 5:20 p.m.; Last accessed: Apr 17 2021 7:20 a.m.
Record added: April 23, 2018, 3:22 p.m.
Record updated: Feb. 24, 2020, 8:54 a.m. by The FAIRsharing Team.
Edits to 'https://fairsharing.org/FAIRsharing.hFLKCn' by 'The FAIRsharing Team' at 11:08, 24 Apr 2018 (approved): 'description' has been modified: Before: The digital object identifier [DOI] system originated in a joint initiative of three trade associations in the publishing industry (International Publishers Association; International Association of Scientific, Technical and Medical Publishers; Association of American Publishers). The system was announced at the Frankfurt Book Fair 1997. The International DOI Foundation (IDF) was created to develop and manage the DOI system, also in 1997. The DOI system was adopted as International Standard ISO 26324 in 2012. The DOI system implements the Handle System and adds a number of new features. The DOI system provides an infrastructure for persistent unique identification of objects of any type. The DOI system is designed to work over the Internet. A DOI name is permanently assigned to an object to provide a resolvable persistent network link to current information about that object, including where the object, or information about it, can be found on the Internet. While information about an object can change over time, its DOI name will not change. A DOI name can be resolved within the DOI system to values of one or more types of data relating to the object identified by that DOI name, such as a URL, an e-mail address, other identifiers and descriptive metadata. The DOI system enables the construction of automated services and transactions. Applications of the DOI system include but are not limited to managing information and documentation location and access; managing metadata; facilitating electronic transactions; persistent unique identification of any form of any data; and commercial and non-commercial transactions. The content of an object associated with a DOI name is described unambiguously by DOI metadata|based on a structured extensible data model that enables the object to be associated with metadata of any desired degree of precision and granularity to support description and services. The data model supports interoperability between DOI applications. The scope of the DOI system is not defined by reference to the type of content (format|etc.) of the referent|but by reference to the functionalities it provides and the context of use. The DOI system provides|within networks of DOI applications|for unique identification|persistence|resolution|metadata and semantic interoperability. After: The digital object identifier (DOI) system originated in a joint initiative of three trade associations in the publishing industry (International Publishers Association; International Association of Scientific, Technical and Medical Publishers; Association of American Publishers). The system was announced at the Frankfurt Book Fair 1997. The International DOI Foundation (IDF) was created to develop and manage the DOI system, also in 1997. The DOI system was adopted as International Standard ISO 26324 in 2012. The DOI system implements the Handle System and adds a number of new features. The DOI system provides an infrastructure for persistent unique identification of objects of any type. The DOI system is designed to work over the Internet. A DOI name is permanently assigned to an object to provide a resolvable persistent network link to current information about that object, including where the object, or information about it, can be found on the Internet. While information about an object can change over time, its DOI name will not change. A DOI name can be resolved within the DOI system to values of one or more types of data relating to the object identified by that DOI name, such as a URL, an e-mail address, other identifiers and descriptive metadata. The DOI system enables the construction of automated services and transactions. Applications of the DOI system include but are not limited to managing information and documentation location and access; managing metadata; facilitating electronic transactions; persistent unique identification of any form of any data; and commercial and non-commercial transactions. The content of an object associated with a DOI name is described unambiguously by DOI metadata|based on a structured extensible data model that enables the object to be associated with metadata of any desired degree of precision and granularity to support description and services. The data model supports interoperability between DOI applications. The scope of the DOI system is not defined by reference to the type of content (format|etc.) of the referent|but by reference to the functionalities it provides and the context of use. The DOI system provides|within networks of DOI applications|for unique identification|persistence|resolution|metadata and semantic interoperability.
Models and Formats
No syntax standards defined
No identifier schema standards defined
No metrics standards defined
figshare is a data publishing platform that is free for all researchers. Some of figshare’s core beliefs are: academic research outputs should be as open as possible, as closed as necessary; academic research outputs should never be behind a paywall; academic research outputs should be human and machine readable/query-able; academic infrastructure should be interchangeable; academic researchers should never have to put the same information into multiple systems at the same institution; identifiers for everything; and the impact of research is independent of where it is published and what type of output it is. figshare supports embargoing and managed access, and will embargo data while undergoing peer review. Metadata in figshare is licenced under is CC0. All files and metadata can be accessed from docs.figshare.com. figshare has also partnered with DuraSpace and Chronopolis to offer further assurances that public data will be archived under the stewardship of Chronopolis. In the highly unlikely event of multiple AWS S3 failures, figshare can restore public user content from Chronopolis. figshare is supported through Institutional, Funder, and Governmental service subscriptions.
Reactome - a curated knowledgebase of biological pathways
The cornerstone of Reactome is a freely available, open source relational database of signaling and metabolic molecules and their relations organized into biological pathways and processes. The core unit of the Reactome data model is the reaction. Entities (nucleic acids, proteins, complexes, vaccines, anti-cancer therapeutics and small molecules) participating in reactions form a network of biological interactions and are grouped into pathways. Examples of biological pathways in Reactome include classical intermediary metabolism, signaling, transcriptional regulation, apoptosis and disease. Inferred orthologous reactions are available for 15 non-human species including mouse, rat, chicken, zebrafish, worm, fly, and yeast.
Dryad is an open-source, community-led data curation, publishing, and preservation platform for CC0 publicly available research data. Dryad has a long-term data preservation strategy, and is a Core Trust Seal Certified Merritt repository with storage in US and EU at the San Diego Supercomputing Center, DANS, and Zenodo. While data is undergoing peer review, it is embargoed if the related journal requires / allows this. Dryad is an independent non-profit that works directly with: researchers to publish datasets utilising best practices for discovery and reuse; publishers to support the integration of data availability statements and data citations into their workflows; and institutions to enable scalable campus support for research data management best practices at low cost. Costs are covered by institutional, publisher, and funder members, otherwise a one-time fee of $120 for authors to cover cost of curation and preservation. Dryad also receives direct funder support through grants.
National Addiction & HIV Data Archive Program
NAHDAP acquires, preserves and disseminates data relevant to drug addiction and HIV research. The scope of the data housed at NAHDAP covers a wide range of legal and illicit drugs (alcohol, tobacco, marijuana, cocaine, synthetic drugs, and others) and the trajectories, patterns, and consequences of drug use as well as related predictors and outcomes.
Knowledge Network for Biocomplexity
The Knowledge Network for Biocomplexity (KNB) is an international repository intended to facilitate ecological and environmental research. KNB was created to share, discover, access and interpret complex ecological data. Contextual information provided with KNB data allows scientists to integrate and analyze data. The data originate from a highly-distributed set of field stations, laboratories, research sites, and individual researchers. The foundation of the KNB is the detailed metadata provided by researchers, which promotes both automated and manual integration of data into new projects. As part of the KNB effort, data management software is developed in a free and open source manner, so other groups can build upon the tools. The KNB is powered by the Metacat data management system, and is optimized for handling data sets described using the Ecological Metadata Language, but can store any XML-based metadata document.
PANGAEA - Data Publisher for Earth and Environmental Science
The information system PANGAEA is operated as an Open Access library aimed at archiving, publishing and distributing georeferenced data from earth system research. PANGAEA is a member of the ICSU World Data System (WDS).
Environmental Data Initiative Data Repository
The Environmental Data Initiative is an NSF-funded project meant to accelerate curation and archive of environmental data. With mature repository functionality and data curators on staff it supports the large and varied community of environmental researchers. The repository is open to all environmental research data and hosts data that provide a context to evaluate the nature and pace of ecological change, to interpret its effects, and to forecast the range of future biological responses to change. DOIs are assigned making all datasets first class, citable research objects and strict version control allows for immutability while also supporting updates to long-term datasets.
Open Science Framework
The Open Science Framework (OSF) is a free and open source project management tool that supports the entire research lifecycle: planning, execution, reporting, archiving, and discovery. Features include automated versioning, logging of all actions, collaboration support, free and unlimited file storage, registrations, and connections to other tools/services (ie. Dropbox, figshare, Amazon S3, Dataverse, GitHub). It is 100% free to researchers, open source, and intended for use in all domain areas. OSF has an open, public API to support broad indexing, as well as a partnership with Internet Archive for long-term preservation with a $250k preservation fund and an IMLS grant for transfer to Internet Archive (currently in progress). The OSF supports embargoing during peer review via a view-only link with the ability to anonymize contributor list. It also provides managed access by allowing access requests and private sharing settings. OSF is a non-profit with direct funder support through grants, government contracts, and community memberships.
Structural Biology Data Grid
The Structural Biology Data Grid (SBGrid-DG) community-driven repository to preserve primary experimental datasets that support scientific publications. The SBGrid Data Bank is an open source research data management system enabling Structural Biologists to preserve x-ray diffraction data and to make it accessible to the broad research community.
Sea scientific open data publication
Seanoe (SEA scieNtific Open data Edition) is a publisher of scientific data in the field of marine sciences. It is operated by Sismer within the framework of the Pôle Océan. Data published by SEANOE are available free. They can be used in accordance with the terms of the Creative Commons license selected by the author of data. Seance contributes to Open Access / Open Science movement for a free access for everyone to all scientific data financed by public funds for the benefit of research. An embargo limited to 2 years on a set of data is possible; for example to restrict access to data of a publication under scientific review. Each data set published by SEANOE has a DOI which enables it to be cited in a publication in a reliable and sustainable way. The long-term preservation of data filed in SEANOE is ensured by Ifremer infrastructure.
The Model Archive provides a stable archive for computational macro-molecular models published in the scientific literature. The model archive provides a unique stable accession code (DOI) for each deposited model, which can be directly referenced in the corresponding manuscripts.
Advanced Ecological Knowledge and Observation System
The Advanced Ecological Knowledge and Observation System from the Terrestrial Ecosystem Research Network (TERN AEKOS) focuses on ecological plot data that are generated by land-based ecosystem projects that use survey, monitoring and experimental studies to collect raw data on plants, animals (including traits) and their immediate environments at the same location. Plot data are observations sampled by using scientific techniques such as quadrants, grids, transects, animal trap arrays and other structured sampling techniques.
Zenodo is a generalist research data repository built and developed by OpenAIRE and CERN. It was developed to aid Open Science and is built on open source code. Zenodo helps researchers receive credit by making the research results citable and through OpenAIRE integrates them into existing reporting lines to funding agencies like the European Commission. Citation information is also passed to DataCite and onto the scholarly aggregators. Content is available publicly under any one of 400 open licences (from opendefinition.org and spdx.org). Restricted and Closed content is also supported. Free for researchers below 50 GB/dataset. Content is both online on disk and offline on tape as part of a long-term preservation policy. Zenodo supports managed access (with an access request workflow) as well as embargoing generally and during peer review. The base infrastructure of Zenodo is provided by CERN, a non-profit IGO. Projects are funded through grants.
Atmospheric Radiation Measurement Data Archive
The Atmospheric Radiation Measurement (ARM) Data Archive stores continuous measurements—supplemented by field campaigns—and provides data products that promote the advancement of climate models. ARM data include routine data products, value-added products (VAPs), field campaign data, complementary external data products from collaborating programs, and data contributed by ARM principal investigators for use by the scientific community. Data quality reports, graphical displays of data availability/quality, and data plots are also available from the ARM Data Center. These data are available free of charge to the public.
National Geoscience Data Centre Data Archive
The National Geoscience Data Centre (NGDC) collects and preserves geoscientific data and information, making them available to a wide range of users and communities. NGDC is recognised as the NERC Environmental Data Centre for geoscience data.
EarthChem develops and maintains databases, software, and services that support the preservation, discovery, access and analysis of geochemical data, and facilitate their integration with the broad array of other available earth science parameters. The EarthChem Portal stores geochemical data, provides a variety of search, visualization, and output formatting options, and is interoperable with the LEPR database and the MELTS software. EarthChem Library is a data repository that archives, publishes and makes accessible data and other digital content from geoscience research.
UK Polar Data Centre Data Archive
The UK Polar Data Centre (UK PDC) is the Natural Environment Research Council's (NERC) Designated Data Centre for polar science. It is the focal point for Arctic and Antarctic environmental data management in the UK.
The National Science Foundation funded OpenTopography facilitates community access to high-resolution, Earth science-oriented, topography data, and related tools and resources.
UNAVCO provides access to data that the community of geodetic scientists can use for quantifying the motions of rock, ice and water that are monitored by a variety of sensor types at or near the Earth's surface. The data types include GPS/GNSS, imaging data such as from SAR and TLS, strain and seismic borehole data, and meteorological data. Most of these can be accessed via web services. In addition, GPS/GNSS datasets, TLS datasets, and InSAR products are assigned digital object identifiers.
NERC British Oceanographic Data Centre Data Archive
BODC makes available biological, chemical, physical and geophysical data on the marine environment.
NERC EDS Environmental Information Data Centre
The Environmental Information Data Centre (EIDC) is a NERC Data Centre hosted by the UK Centre for Ecology & Hydrology (UKCEH). They manage nationally-important datasets concerned with the terrestrial and freshwater sciences.
Harvard Dataverse is a research data repository running on the open source web application Dataverse. Harvard Dataverse is fully open to the public, and allows upload and browsing of data from all fields of research, and is free for all researchers worldwide (up to 1 TB). Links to related grants, authors, software and research products are provided. Harvard Dataverse supports managed access (with an access request workflow) as well as embargoing generally and during peer review. Dataverse allows users to share, preserve, cite, explore, and analyse research data. It facilitates making data available to others, and allows you to replicate others' work more easily. Researchers, data authors, publishers, data distributors, and affiliated institutions all receive academic credit and web visibility. The Harvard Database receives support from Harvard University, public and private grants, and an emergent consortium model.
Mendeley Data is a multidisciplinary, free-to-use open repository specialized for research data. Data files of up to 10GB can be uploaded and shared. Search more than 20+ million datasets indexed from 1000s of data repositories and collect and share datasets with the research community following the FAIR data principles. Links are available to related authors, software, grants and research. Each version of a dataset is given a unique DOI, and dark archived with DANS (Data Archiving and Networking Services), ensuring that every dataset and citation will be valid in perpetuity. Metadata is licensed CC0, and datasets are and will continue to be free access. Mendeley Data will shortly support managed access, and currently supports embargoing of data both generally and while undergoing peer review. It is funded by a subscription model for Academic & Government entities.
Inter-university Consortium for Political and Social Research
ICPSR is a data archive of behavioral and social science research data. An international consortium of more than 750 academic institutions and research organizations, ICPSR provides leadership and training in data access, curation, and methods of analysis for the social science research community. ICPSR is a CoreTrustSeal core certified repository and was a 2019 United States National Medal for Museum and Library Service recipient, the nation’s highest honor given to museums and libraries that make significant and exceptional contributions to their communities.
Health and Medical Care Archive
The Health and Medical Care Archive (HMCA) is the data archive of the Robert Wood Johnson Foundation (RWJF), the largest philanthropy devoted exclusively to health and health care in the United States. Operated by the Inter-university Consortium for Political and Social Research (ICPSR) at the University of Michigan with funding from RWJF, HMCA preserves and disseminates data collected by selected research projects funded by RWJF and facilitates secondary analyses of the data. The data collections in HMCA primarily includes large-scale surveys of the American public about public health, attitudes towards health reform, and access to medical care; surveys of health care professionals and organizations, public health professionals, and nurses; evaluations of innovative programs for the delivery of health care, and many other topics and populations of interest. Our goal is to build a culture of health by increasing the understanding of health and health care and the factors that contribute to health in the United States through secondary analysis of RWJF-supported data collections.
World Data Center for Climate at DRKZ
The World Data Center Climate (WDCC) at the German Climate Computing Center (DKRZ: Deutsches Klimarechenzentrum GmbH) is part of an international effort to set up domain specific data portals as a service for the scientific community. The WDCC provides a long-term archiving service for large climate or Earth System research data sets. This service includes archiving and retrieval capability of data for time periods of 10 years or longer. The WDCC focuses on climate and climate-related data products, specifically those resulting from climate simulations. Metadata and research data in WDCC is curated in collaboration with the data providers. Data in WDCC is citable and can be directly integrated into scientific publications.
High Energy Physics Data Repository
The Durham High Energy Physics Database (HEPData) has been built up over the past four decades as a unique open-access repository for scattering data from experimental particle physics. It currently comprises the data points from plots and tables related to several thousand publications including those from the Large Hadron Collider (LHC). HEPData is funded by a grant from the UK STFC and is based at the IPPP at Durham University.
UK Solar System Data Centre Data Archive
The UK Solar System Data Centre (UKSSDC) is the Natural Environment Research Council's (NERC) Designated Data Centre for the Solar terrestrial physics and chemistry. It provides a central archive and data centre facility for Solar System science in the UK. The UKSSDC supports data archives for the whole UK solar system community encompassing solar, inter-planetary, magnetospheric, ionospheric and geomagnetic science. The UKSSDC is part of RAL Space based at the STFC run Rutherford Appleton Laboratory in Oxfordshire.
FAIRsharing is a FAIR-supporting resource that provides an informative and educational registry on data standards, databases, repositories and policy, alongside search and visualization tools and services that interoperate with other FAIR-enabling resources. FAIRsharing guides consumers to discover, select and use standards, databases, repositories and policy with confidence, and producers to make their resources more discoverable, more widely adopted and cited. Each record in FAIRsharing is curated in collaboration with the maintainers of the resource themselves, ensuring that the metadata in the FAIRsharing registry is accurate and timely. Every record is manually reviewed at least once a year. Records can be collated into collections, based on a project, society or organisation, or Recommendations, where they are collated around a policy, such as a journal or funder data policy.
G-Node Data Infrastructure Services
The German Neuroinformatics Node's data infrastructure (GIN) services provide a platform for comprehensive and reproducible management and sharing of neuroscience data. Building on well established versioning technology, GIN offers the power of a web based repository management service combined with a distributed file storage. The service addresses the range of research data workflows starting from data analysis on the local workstation to remote collaboration and data publication.
NASA GeneLab expands scientists’ access to experiments that explore the molecular response of terrestrial biology to spaceflight environments. The vast amounts of raw data generated by experiments aboard the International Space Station are being made available to a worldwide community of scientists and computational researchers.
4TU.ResearchData is an international data repository for science, engineering and design, open to anyone in the world to upload and download data. Its services include curation, sharing, long-term access and preservation of research datasets. These services are available to anyone around the world. In addition, 4TU.ResearchData also offers training and resources to researchers to support them in making research data findable, accessible, interoperable and reproducible (FAIR).
Institut Laue-Langevin Data Portal
The Institut Laue-Langevin (ILL) Data Portal is a repository for storage and access to experimental data obtained at the ILL. While individual DOIs for data are accessible, full access to the database, including searching and browsing, requires a login.
UK Data Archive
The UK Data Archive is an internationally acknowledged centre of expertise in acquiring, curating and providing access to social science and humanities data. We were founded in 1967, at the University of Essex, with the support of the then Social Science Research Council, with the aim of curating high-quality research data for analysis and reuse. The Economic and Social Research Council (ESRC) has continued to provide long-term commitment to the Data Archive and we are now a significant part of their UK data infrastructure. Since 2005 we have been designated a Place of Deposit by the National Archives, allowing us to curate public records. The UK Data Archive is the lead organisation of the UK Data Service, which provides unified access to the UK's largest collection of social, economic and population data. Funded by the ESRC, the UK Data Service provides access to regional, national and international social and economic data, support for policy-relevant research and guidance and training for the development of skills in data use.
Kaggle Datasets is a public open data platform which combines data, users, discussions, and software. Datasets can be searched, bookmarked and explored via Kernels and discussions. All types of data are allowed. Individual users and organizations can register. Datasets can be published either privately or shared publicly.
Ag Data Commons
The Ag Data Commons is a public, government, scientific research data catalog and repository available to help the agricultural research community share and discover research data funded by the United States Department of Agriculture and meet Federal open access requirements. Ag Data Commons holds data files managed directly by NAL and also links to datasets and resources located on other websites. The Ag Data Commons provides access to a wide variety of open data relevant to agricultural research and related domains. These may include subjects such as agronomy, genomics, hydrology, soils, agro-ecosystems, sustainability science, and economic statistics. Ag Data Commons was created to foster innovative data re-use, integration, and visualization to support bigger, better science and policy.
Norwegian Centre for Research Data
The Norwegian Centre for Research Data (NSD) is one of the largest archives for research data of its kind and provides data to researchers and students in Norway and abroad. Additionally, the NSD is a resource centre which assists researchers with regard to data gathering, data analysis, and issues of methodology, privacy and research ethics. The NSD is a certified archive (certified with the Core Trust Seal), and a member of DataCite. The NSD is also the data processor, data distributor and archive for the ESS – European Social Survey (http://www.europeansocialsurvey.org). The NSD is the Norwegian Service Provider for CESSDA (http://cessda.eu)
Electron Microscope Public Image Archive
EMPIAR, the Electron Microscopy Public Image Archive, is a public resource for raw electron microscopy images. Here, you can browse, upload and download the raw images used to build a 3D structure. The purpose of EMPIAR is to provide easy access to state-of-the-art raw data to facilitate methods development and validation, which will lead to better 3D structures. It complements the Electron Microscopy Data Bank (EMDB), where 3D volumes are stored.
Environmental Data Portal
EnviDat is the environmental data portal and repository developed by the Swiss Federal Research Institute WSL. The portal provides unified and managed access to environmental monitoring and research data. The portal has the capability to host and publish data sets. While sharing of data is centrally facilitated, data management remains decentralised and the know-how and responsibility to curate research data remains with the original data providers.
CyVerse Data Common Repository
The Data Commons provides services to manage, organize, preserve, publish, discover, and reuse data. Using our pipelines, you can easily publish data to the NCBI or directly to the CyVerse Data Commons. CyVerse Curated Data are stable and have DOIs. Community Released Data are maintained by community members and may not be permanent.
Imperial College Research Data Repository
A lightweight digital repository for data based on the concepts of collections of filesets. Both the collection and the fileset are assigned a DOI by the DataCite organisation which can be quoted in articles
Vivli Center for Global Clinical Research Data
The Vivli data repository provides a global data-sharing and analytics platform serving all elements of the international research community. It is focused on sharing individual participant-level data from completed clinical trials to serve the international research community. Vivli acts as a neutral broker between data contributor and data user and the wider data sharing community. Vivli is a non-profit organization focused on data sharing and analysis. Vivli provides managed access for human subject data. It provides a no-charge period for data only available within their secure research environment. There are costs after the no-charge time period ends. Vivli supports managed access as well as embargoing generally and during peer review. It has data preservation funding and assurances from Microsoft that it will maintain and archive data for the lifetime of its use, up to 20 years. Vivli is funded via grants and member fees.
Centre for Environmental Data Analysis Archive
The CEDA Archive operates the atmospheric and earth observation data centre functions on behalf of NERC for the UK atmospheric science and earth observation communities. It covers climate, composition, observations and NWP data as well as various earth observation datasets, including airborne and satellite data and imagery.
The Chemotion-repository is a repository for chemistry research data. The repository was founded at the Karlsruhe Institute of Technology (KIT) in Germany as a project funded by the German Research Foundation. The repository is domain specific, offering diverse functions to save and work with chemical datasets and analyse files. The DOI generation is provided by DataCite. The repository supports the storage of data related to molecules or reactions, with a focus on data from synthetic and analytic work. Data can be uploaded directly to the repository or can be added via a connector that mediates the transfer of data from the chemotion ELN (available as an Open Source) to the repository.
Signaling Pathways Project
The Signaling Pathways Project is an integrated 'omics knowledgebase based upon public, manually curated transcriptomic and cistromic (ChIP-Seq) datasets involving genetic and small molecule manipulations of cellular receptors, enzymes and transcription factors. Our goal is to create a resource where scientists can routinely generate research hypotheses or validate bench data relevant to cellular signaling pathways.
heiDATA is Heidelberg University’s research data repository. It is managed by the Competence Centre for Research Data, a joint institution of the University Library and the Computing Centre. All researchers affiliated with Heidelberg University can use this service for archiving and publishing their data. heiDATA runs on software from the Dataverse Project.
ZBW Journal Data Archive
The ZBW Journal Data Archive is a service for editors of journals in economics and management. The aim of this newly established web service is to offer scholarly journals an easy to handle infrastructure for managing and storing data sets of published articles and to link the data sets to their corresponding publication. The service is free of charge for academic journals.
Blackfynn Discover is a public resource for accessing large public Neuroscience datasets. Blackfynn Discover was developed through grants from the NIH NIDA, NIH CommonFund, DARPA, and others to provide a sustainable solution for fostering collaboration in the Neurosciences.
Duke Research Data Repository
The Research Data Repository is a service of the Duke University Libraries that provides curation, access, and preservation of research data produced by the Duke community. All data go through a curation process to help ensure data are well described, in a structure and format that supports long-term preservation, and generally meet the FAIR Guiding Principles for data (i.e., Findable, Accessible, Interoperable, and Reusable). All data and documentation are openly accessible for download unless under embargo.
The BonaRes Repository stores soil and agricultural research data from research projects and long-term field experiments which contribute significantly to the analysis of changes of soil and soil functions over the long term. Research data are described by the metadata following the BonaRes Metadata Schema (DOI: 10.20387/bonares-5pgg-8yrp) which combines international recognized standards for the description of geospatial data and research data. Metadata includes AGROVOC keywords.
Iowa State University's DataShare
Iowa State University’s DataShare is an open access repository for sharing, publishing, and archiving research data created by Iowa State University scholars and researchers.
UNC Dataverse is a research data repository hosted by the Odum Institute at the University of North Carolina at Chapel Hill. UNC Dataverse is an open access repository that accepts data deposits from individual researchers, research groups, institutions, journals, and other entities from all disciplinary domains. UNC Dataverse offers value-added features that support archiving, discovery, and sharing of research data that align with FAIR principles for findable, accessible, interoperable, reusable data.
National Archive of Criminal Justice Data
Established in 1978, the National Archive of Criminal Justice Data (NACJD) archives and disseminates data on crime and justice for secondary analysis. The archive contains data from over 2,700 curated studies or statistical data series. NACJD is home to several large-scale and well known datasets, including the National Crime Victimization Survey (NCVS), the FBI's Uniform Crime Reports (UCR), the FBI's National Incident-Based Reporting System (NIBRS), and the Project on Human Development in Chicago Neighborhoods (PHDCN). In addition to making data available, NACJD curates and preserves data to ensure that they are accessible now and in the future. Located within the Inter-University Consortium for Political and Social Research (ICPSR), a research center of the Institute for Social Research (ISR) at the University of Michigan, NACJD is sponsored by the Bureau of Justice Statistics (BJS), the National Institute of Justice (NIJ), and the Office of Juvenile Justice and Delinquency Prevention (OJJDP) of the United States Department of Justice.
Civic Learning, Engagement, and Action Data Sharing
Researchers from a wide variety of disciplines study civic education, civic action, and the many relationships between the two. Civic Learning, Engagement, and Action Data Sharing (CivicLEADS) provides infrastructure for researchers to share and access high-quality datasets which can be used to study civic education and involvement. Funded by a grant from the Spencer Foundation and part of ICPSR's Education and Child Care Data Archives at the University of Michigan, CivicLEADS provides a centralized repository for this multi-disciplinary research area, with data being created across education, political science, developmental sciences, and other disciplines. Researchers can access quantitative and qualitative data on a broad range of topics for secondary analysis as well as sharing their own primary research data. CivicLEADS includes datasets from other ICPSR archives and seeks out emerging data collected by studies still in the field. Beyond facilitating the sharing and discovery of data, CivicLEADS seeks to create a learning community around civic education and engagement research. We strive to form relationships with and between investigators and researchers at every level—from students to emeriti faculty. Data shared in this archive have been documented with thorough metadata, and tools drawing upon these metadata allow researchers to explore and compare variables both within and between studies. By providing tutorials, webinars, and in-person training, CivicLEADS connects researchers with data and facilitates the future of civic education and civic action research.
Biodiversity Heritage Library
The Biodiversity Heritage Library (BHL) is the world’s largest open access digital library for biodiversity literature and archives. Operating as an international library consortium, BHL provides free access to millions of pages, representing over 500 years of scientific research, alongside services like data exports, APIs, and taxonomic intelligence tools to facilitate discovery and reuse of data and collections.
The Tromsø Repository of Language and Linguistics
The Tromsø Repository of Language and Linguistics (TROLLing) is a repository of data, code, and other related materials used in linguistic research. The repository is open access, which means that all information is available to everyone. All postings are accompanied by searchable metadata that identify the researchers, the languages and linguistic phenomena involved, the statistical methods applied, and scholarly publications based on the data (where relevant). DataverseNO is aligned with the FAIR Guiding Principles for scientific data management and stewardship. Being part of DataverseNO, TROLLing is CoreTrustSeal certified.
DataverseNO (https://dataverse.no/) is a national, generic repository for open research data, owned and operated by UiT The Arctic University of Norway. DataverseNO is aligned with the FAIR Guiding Principles for scientific data management and stewardship. The technical infrastructure of the repository is based on the open source application Dataverse, which is developed by an international developer and user community led by Harvard University. DataverseNO is CoreTrustSeal certified.
NSF Arctic Data Center
The Arctic Data Center is the primary data and software repository for the Arctic section of NSF Polar Programs. The Center helps the research community to reproducibly preserve and discover all products of NSF-funded research in the Arctic, including data, metadata, software, documents, and provenance that links these together. The repository is open to contributions from NSF Arctic investigators, and data are released under an open license (CC-BY, CC0, depending on the choice of the contributor). All science, engineering, and education research supported by the NSF Arctic research program are included, such as Natural Sciences (Geoscience, Earth Science, Oceanography, Ecology, Atmospheric Science, Biology, etc.) and Social Sciences (Archeology, Anthropology, Social Science, etc.).
NASA Socioeconomic Data and Applications Center
The Socioeconomic Data and Applications Center (SEDAC) is a regular member of the World Data System and focuses on human interactions in the environment. Its mission is to develop and operate applications that support the integration of socioeconomic and Earth science data and to serve as an "Information Gateway" between the Earth and social sciences. The SEDAC is one of the Earth Observing System Data and Information System (EOSDIS) Distributed Active Archive Centers (DAACs) that archive and distribute earth science data, managed by NASA's Earth Science Data and Information System Project (ESDIS), as part of the Earth Science Data Systems (ESDS) Program.
Repository & Open Science Access Portal
ROSA P is the United States Department of Transportation (USDOT) National Transportation Library's (NTL) Repository and Open Science Access Portal. The name ROSA P was chosen to honor the role public transportation played in the civil rights movement, along with one of the important figures, Rosa Parks. Founded as an all-digital library program, NTL’s collections in ROSA P are full-text digital publications, datasets, and other resources. Legacy print materials that have been digitized are collected if they have historic, technical, or national significance. The repository is also designated as the full-text repository for USDOT-funded research under the USDOT Public Access Plan. Collections in ROSA P are available without restriction to transportation researchers, statistical organizations, the media, and the general public. NTL collects resources across all modes of transportation and related disciplines, with specific focus on information produced by USDOT, state DOTs, and other transportation organizations. Content types found in ROSA P include textual works, datasets, still image works, moving image works, other multimedia, and maps. All resources in ROSA P are in the public domain and/or explicit permission has been provided by the rights holder to NTL to make their materials available for free over the web.
Crossref is a central reference linking service, providing cross-publisher citation links via an API and visualization tools. Links to funders and other related events and outputs are also available. There are additional services for members such as content registration and reference linking.
The Funder Registry is an open registry of persistent identifiers for grant-giving organizations around the world. FundRef is the result of collaboration between funding agencies and publishers that correlates grants and other funding with the scholarly output of that support. Publishers participating in FundRef add funding data to the bibliographic metadata they already provide to Crossref for reference linking. FundRef data includes the name of the funder and a grant or award number. Metadata includes funder names, alternate names, and abbreviations.
Atmospheric Science Data Center
The Atmospheric Science Data Center (ASDC) at NASA Langley Research Center is responsible for processing, archiving, and distribution of NASA Earth science data in the areas of radiation budget, clouds, aerosols, and tropospheric chemistry. The ASDC specializes in atmospheric data important to understanding the causes and processes of global climate change and the consequences of human activities on the climate. The ASDC is one of the Earth Observing System Data and Information System (EOSDIS) Distributed Active Archive Centers (DAACs) that archive and distribute earth science data, managed by NASA's Earth Science Data and Information System Project (ESDIS), as part of the Earth Science Data Systems (ESDS) Program.
Archive of Data on Disability to Enable Policy and research
The Archive of Data on Disability to Enable Policy and research (ADDEP) is a repository of data on disability and rehabilitation. ADDEP was created to provide access to disability and rehabilitation data; provide technical assistance for data users; and build a connection between research data, community outreach, and activism. Types of data in ADDEP include disability status, health care, rehabilitation services and medicine, disability employment, income, education, and disability policies.
HydroShare is a system operated by The Consortium of Universities for the Advancement of Hydrologic Science Inc. (CUAHSI) that enables users to share and publish water-related data and models in a variety of flexible formats, and to make this information available in a citable, shareable and discoverable manner. HydroShare includes a repository for data and models, and tools (web apps) that can act on content in HydroShare providing users with a gateway to high performance computing and computing in the cloud. With HydroShare you can: share data and models with colleagues; manage access to shared content; share, access, visualize, and manipulate a broad set of hydrologic data types and models; publish data and models and obtain a citable digital object identifier (DOI); aggregate resources into collections; discover and access data and models published by others; use the web services application programming interface (API) to programmatically access resources; and use integrated web applications to visualize, analyze and run models with data in HydroShare.
National Tibetan Plateau/Third Pole Environment Data Center
The National Tibetan Plateau/Third Pole Environment Data Center (TPDC) stores research data concerning the Tibetan Plateau and surrounding regions. TPDC provides online and offline sharing protocols for data users with bilingual data sharing in Chinese and English. There are more than 2400 datasets shared by TPDC, covering geography, atmospheric science, cryospheric science, hydrology, ecology, geology, geophysics, natural resource science, social economy, and other fields. TPDC aims to provide data according to the FAIR ( findable, accessible, interoperable, and reusable) data sharing principles. Digital Object Identifiers (DOI) are used for scientific data access, tracking, and citation.
Research Vocabularies Australia
Research Vocabularies Australia (RVA) helps users access and reuse vocabularies for all areas of research. Some vocabularies are hosted by the Australian Research Data Commons (ARDC) and can be accessed directly through the RVA, otherwise links are provided to the vocabulary owner’s web page. Research Vocabularies Australia also allows you to create and/or publish a vocabulary as well as integrate an existing vocabulary into your own system.
Global Terrestrial Network for Permafrost - Database
The Global Terrestrial Network for Permafrost (GTN-P) database stores information regarding essential climate variables (ECV) as part of the Global Terrestrial Network on Permafrost (GTN-P) programme. The database aims to provide information on the relationships between ground temperature, gas fluxes and the Earth’s climate system. GTN-P contains time series for borehole temperatures and grids of active layer thickness (TSP, CALM) plus air and surface temperature and moisture (DUE Permafrost, MODIS) measured in the terrestrial Panarctic, Antarctic and Mountainous realms.
Astromaterials Data System
The Astromaterials Data System is a NASA funded data infrastructure that supports the preservation, discovery, access and analysis of laboratory analytical data generated by the study of astromaterials samples. The Astromaterials Data System operates the Astromaterials Data Repository (AstroRepo), which archives, publishes and makes accessible data generated by the study of lunar samples (samples collected during the Apollo missions), meteorites, interstellar dust, solar wind, presolar grains, and other extraterrestrial materials. AstroRepo follows international best practices for trusted data repositories. The Astromaterials Data System is operated at the Lamont-Doherty Earth Observatory of Columbia University as part of a suite of data systems for Earth and planetary sample data (chemical, mineralogical, and petrological observations), including EarthChem, the System for Earth Sample Registration, and the Library of Experimental Phase Relations.
The Dimensions database is a scholarly database containing research articles, citations, books, chapters, and conference proceedings, as well as awarded grants, patents, clinical trials, policy documents, datasets and altmetric information. The free version includes a searchable publications index and links to all the other different entities. The subscription version includes further faceting, further analytical capabilities, and searchable indices of the non-publication content.
Global Collaboration Engine
GLOBE (Global Collaboration Engine) is an online collaborative environment that enables land change researchers to share, compare and integrate local and regional studies with global data to assess the global relevance of their work.
Open Research Data Online
Open Research Data Online (ORDO) is The Open University's research data repository. Based upon the Figshare platform, ORDO can be used for the storage of live research data, but is particularly useful for archiving and publishing research data supporting a publication or once a project is completed.
University of Tasmania Research Data Portal
The University of Tasmania Research Data Portal (RDP) enables UTAS researchers to securely store and publish their datasets. Datasets published in RDP are publicly available through the Research Data Australia Search Portal (https://researchdata.edu.au/).
Integrated Carbon Observation System Data Portal
ICOS Data Portal provides observational data and elaborated products on greenhouse gases. Data sets can be visualised and downloaded fully and/or partially. ICOS data meets the global standards for atmospheric, ecosystem flux and marine observations of greenhouse gases. All information on observations and the metadata is stored in persistent long-term repositories.
Bridges is a collaborative digital repository for Monash University researchers. It allows researchers to manage, store, share and selectively publish their research files as citable research outputs with a persistent URL. Researchers maintain control over who can see their private files and all uploads are stored at Monash University.
Data Integration and Analysis System
The Data Integration and Analysis System (DIAS) was created to collect and store earth observation data; to analyze such data in combination with socio-economic data, and convert data into information useful for crisis management with respect to global-scale environmental disasters, and other threats; and to make this information available within Japan and overseas.
National Institute of Mental Health Data Archive
The National Institute of Mental Health Data Archive (NDA), originally established to support autism research, is an informatics platform and data repository that facilitates data sharing across all of mental health and other research communities, combining data from each of these repositories into a single resource with a single process for gaining access to all shared data. While querying and browsing data is publicly available, all subject-level data is available behind a login.
GEOFON seeks to facilitate cooperation in seismological research and earthquake and tsunami risk mitigation by providing rapid open transnational access to seismological data and source parameters of large earthquakes, and keeping these data accessible in the long term. It pursues these aims by operating and maintaining its own global network of permanent seismic stations in cooperation with local partners, facilitating real-time access to data from this network and those of many partner networks and plate boundary observatories, and providing a FAIR archive for seismological data. It also serves as the permanent archive for the Geophysical Instrument Pool at Potsdam (GIPP). Using real-time data streams, GEOFON provides rapid automatic earthquake parametric information to the scientific community, tsunami warning centres worldwide, organizations involved in rescue operations, news media and the general public. GEOFON is one of the nodes of the European Integrated Data Archive (EIDA) and a key nodal member of the European-Mediterranean Seismological Centre (EMSC).
Leibniz Centre for Agricultural Landscape Research Open Research Data
The Leibniz Centre for Agricultural Landscape Research Open Research Data (ZALF Open Research Data) repository contains freely accessible primary research data primarily created at ZALF, although data from other institutions involved in landscape research may also be submitted. The portal provides free access to data and their descriptive metadata, including information about research context, instruments, methods, research areas and data management aspects. Data registered with this resource will be given a DOI. This resource provides data in the areas of climate change, food security, biodiversity conservation, resource scarcity, crop systems, and sustainability.
Database of Individual Seismogenic Sources
The Database of Individual Seismogenic Sources (DISS) is a georeferenced repository of tectonic, fault, and paleoseismological information expressly devoted, but not limited, to potential applications in the assessment of seismic hazard at regional and national scale. DISS represents faults in 3D and contains fully-parameterized records. The core objects of DISS are: individual seismogenic sources, composite seismogenic sources, debated seismogenic sources and subduction zones.
DataStream is an open data platform for sharing information about freshwater health. It allows users to access, visualize and download full water quality datasets collected by communities and researchers throughout three regional hubs in the Mackenzie River Basin, Lake Winnipeg watershed and across Atlantic Canada.
International Crops Research Institute for the Semi-Arid Tropics Dataverse
The International Crops Research Institute for the Semi-Arid Tropics (ICRISAT) Dataverse stores data relating to agricultural research in Asia and sub-Saharan Africa. The goal of ICRISAT and its partners is to use this research to overcome poverty, hunger and a degraded environment through better agriculture.
U.S. Antarctic Program Data Center
The U.S. Antarctic Program Data Center (USAP-DC) provides a central USAP Project Catalog for all projects funded by the NSF Antarctic program and a Data Repository for research datasets derived from these projects. Data managed include the NSF research related glaciology data collection formerly managed through NSIDC. Datasets are registered in the Antarctic Master Directory to comply with the Antarctic Treaty and represent the U.S. in Scientific Committee on Antarctic Research (SCAR) activities.
Human Sciences Research Council Research Data Service
The HSRC Research Data Service provides a digital repository facility for the HSRC’s research data in support of evidence-based human and social development in South Africa and the broader region. The mission is to make research data accessible and to ensure its future survival and usability, with the aim to develop a quantitative and qualitative social science research data collection of high re-use value that speaks to the priorities of a developing country, such as the South African Social Attitudes Survey (SASAS) and the South African National HIV Prevalence, HIV Incidence, Behaviour and Communication Survey (SABSSM) amongst others. Access to data is dependent on ethical requirements for protecting research participants, as well as on legal agreements with the owners, funders, or in the case of data owned by the HSRC, the requirements of the depositors of the data. All data sets are provided free of charge. Metadata records that describe data sets, as well as related documentation including Read me files, user guides, code books, questionnaires, and other related information are provided such that informed decisions can be made about whether the data will be useful for an intended purpose.
Bicocca Open Archive Research Data
BOARD (Bicocca Open Archive Research Data) is the institutional data repository of the University of Milano-Bicocca. BOARD is an open, free-to-use research data repository, which enables members of University of Milano-Bicocca to make their research data publicly available. By depositing their research data in BOARD researchers can make their research data citable, share their data privately or publicly, ensure long-term storage for their data, keep access to all versions and link their article to their data.
arthistoricum.net@heiDATA is the research data repository of arthistoricum.net (Specialized Information Service Art - Photography - Design). It provides art historians with the opportunity to permanently publish and archive research data in the field of art history in connection with an open access online publication (e.g. article, ejournal, ebook) hosted by Heidelberg University Library. All research data e.g. images, videos, audio files, tables, graphics etc. receive a DOI (Digital Object Identifier). The data publications can be cited, viewed and permanently linked to as distinct academic output.
Open Data Commons for Spinal Cord Injury
The Open Data Commons for Spinal Cord Injury is a cloud-based community-driven repository to store, share, and publish spinal cord injury research data. There are several challenges for scientific reproducibility and bench-to-bedside translation. For example, only research and data that are published actually get disseminated, a phenomenon known as publication bias. Published research reflects to only a small fraction of all data collected, and data that do not lead to publication are largely ignored, hidden away in filing cabinets and hard drives. This results in an abundance of inaccessible scientific data known as “dark data”. Even when research is disseminated, it is usually in the form of summary reports of aggregated data (e.g. averages across individual subjects) such as scientific articles. The fact that the individual subject-level data are inaccessible further contributes to dark data. The spinal cord injury (SCI) community created the ODC-SCI to mitigate dark data in SCI research. The ODC-SCI also aims to increase transparency with individual-level data, enhance collaboration, facilitate advanced analytics, and conform to increasing mandates by funders and publishers to make data accessible. Members of the ODC-SCI have access to a private digital lab space managed by the PI or multi-PIs for dataset storage and sharing. The PIs can share their labs’ datasets with the registered members of the ODC-SCI community and make their datasets public and citable. The ODC-SCI implements stewardship principles that scientific data be made FAIR (Findable, Accessible, Interoperable and Reusable) and has been widely adopted by the international SCI research community.
Propylaeum@heiDATA is the research data repository of Propylaeum (Specialized Information Service Classics). It provides classical scholars with the opportunity to archive their research data permanently in connection with an open access online publication (e.g. article, ejournal, ebook) hosted by Heidelberg University Library. All research data e.g. images, videos, audio files, tables, graphics receive a DOI (Digital Object Identifier). The data publications can be cited, viewed and permanently linked to as distinct academic output.
Scroll for more...