The GA4GH Cloud Work Stream facilitates large-scale genomic data analysis by bringing “compute to the data”

19 Aug 2025

GA4GH has developed an animation to demonstrate how GA4GH standards support genomic data analysis across multiple sources, regardless of where or how the data is stored.

Researcher conducting federated analysis across multiple data sources

Advancements in technology have decreased genomic sequencing costs and allowed for the integration of genetic testing into clinical care. This has led to the generation of tens of millions of genomic samples around the world.. When studied in aggregate, this trove of information can lead to valuable insights on the causes of human health and disease.

Yet, analysing datasets from multiple sources all over the world poses technical, jurisdictional, and privacy concerns. Institutions manage data differently, leading to challenges in standardising data analysis approaches across multiple sites. In addition, local data protection regulations can limit access to and movement of data to safeguard patient privacy.

The Global Alliance for Genomics and Health (GA4GH) Cloud Work Stream aims to address these challenges. A newly released animation project shows how GA4GH Cloud standards help researchers conduct large-scale genomic data analysis across multiple sources, regardless of where or how the data is stored.

The animation also depicts the Cloud Work Stream’s guiding philosophy to “bring compute to the data” by defining, sharing, and executing portable workflows across any compute environment. This approach allows for secure analysis of the data in its protected place of origin.

Within the animation, Ana — a fictional cancer researcher — is able to conduct her analysis by remotely “visiting” various datasets with the help of GA4GH Cloud standards. She picks an analysis tool from the Tool Registry Service (TRS), retrieves datasets through the Data Repository Service (DRS), and executes her analysis using the Workflow Execution Service (WES) and Task Execution Service (TES). By using previously hard-to-access datasets to supplement her analysis, Ana is able to learn new insights into cancer.

 

 

To celebrate the launch of the animation, GA4GH hosted a Cloud Work Stream showcase across its social media accounts from 12 to 15 August 2025 to demonstrate the impact of the Work Stream’s standards.

 

 

Interested individuals can learn more and get involved with the Cloud Work Stream. Please reach out to Work Stream Manager Reggan Thomas with any additional questions.

Related Work Streams

Latest News

Researcher conducting federated analysis across multiple data sources
19 Aug 2025
The GA4GH Cloud Work Stream facilitates large-scale genomic data analysis by bringing “compute to the data”
See more
Two doctors studying a patient's DNA
22 Jul 2025
The perceived risks of sharing genomic data with researchers
See more
24 Jun 2025
GA4GH and CRDSA agree to a Strategic Partnership
See more