The SCHARE platform is built on a set of established components (Google Cloud Platform, Terra and GitHub) used in flagship scientific projects at NIH.
SCHARE’s cloud-based platform contains:
Registration is required to access the SCHARE platform, (learn more and register).
STATUS: The SCHARE Datasets collection is accessible to all SCHARE-registered researchers. New datasets are being actively added.
On SCHARE, researchers can access, link, analyze, and export a wealth of datasets relevant to research in health disparities and health care outcomes, including:
Datasets are grouped by these categories:
Learn more about SCHARE Datasets and access the SCHARE platform (registration required).
STATUS: The SCHARE/PhenX Core Common Data Elements are available to all researchers through the National Library of Medicine.
Endorsed by the National Institutes of Health, the SCHARE/PhenX Core Common Data Elements (CCDEs) are standardized questions and responses that can be used across different studies to ensure consistent data collection and facilitate interoperability. CCDEs enable researchers to efficiently design data collection, management, and analysis plans; link data from different sources; and enable data harmonization to generate large datasets for AI use.
Learn more about the SCHARE Core Common Data Elements.
STATUS: The SCHARE Data Repository is available to SCHARE-registered researchers.
The SCHARE Repository enables researchers to meet the requirements of the NIH Data Management and Sharing policy, which requires the hosting, management, and sharing of data generated by NIH-funded research programs. SCHARE provides a repository for projects focused on population science topics, such as health disparities and public health outcomes. All SCHARE-registered users—including NIH-based researchers, external researchers, and public— can access data within the repository at varying privacy and security levels utilizing the controlled-access process. The SCHARE Repository utilizes core common data elements as a means to facilitate data aggregation for AI development that optimizes public health scientific knowledge discoveries and generates tools to monitor and improve health outcomes.
Access the SCHARE Data Repository.
STATUS: The SCHARE Collaborative Workspaces are available to all SCHARE-registered researchers.
SCHARE is powered by Terra, an open-source data analysis platform based on Google Cloud Platform. Terra was developed by the Broad Institute of MIT and Harvard in collaboration with Microsoft and Verily.
Using SCHARE’s Terra resources, researchers and their collaborators can access and cross-link the same publicly available or controlled-access data. They can also create secure online spaces for collaboratively running large-scale analyses and sharing reproducible results and resources.
SCHARE supports interactive analysis tools such as Jupyter notebooks. Jupyter notebooks are human-readable executable documents that can be run to perform advanced data analyses, including artificial intelligence and machine learning tasks, using coding languages such as Python and R. The platform also supports Dockstore as a repository for Docker-based analysis workflows that allow users to automate basic steps in their analyses.
STATUS: The SCHARE-HEAN NAIRR Pilot Project is Active.
The National Artificial Intelligence Research Resource (NAIRR) is a vision for a shared national research infrastructure for responsible discovery and innovation in AI. The NAIRR pilot will run for two years, beginning January 24, 2024. The pilot broadly supports fundamental, translational and use-inspired AI-related research with particular emphasis on societal challenges and use adoption.
To support these efforts, the SCHARE-HEAN (a.k.a. Multiple Chronic Diseases Disparities Research Consortium) Pilot Project forms a unique collaborative relationship between community partners, academia, and SCHARE to use big data and cloud computing data science analytics to increase the prevention, treatment, and management of multiple chronic diseases, such as diabetes, obesity, hypertension, coronary heart disease, congestive heart failure, chronic kidney disease, stroke, and certain cancers. The data warehouse includes chronic disease, ascribed and acquired attributes, and relevant environmental and living conditions data, which is mapped to the SCHARE common data elements for increase data interoperability, and highlighted in Think-a-Thons to democratize data use adoption.
Learn more about the SCHARE-HEAN NAIRR Pilot Project.
Page updated March 12, 2025 | created Jan. 18, 2023