Learn how GA4GH helps expand responsible genomic data use to benefit human health.
Learn how GA4GH helps expand responsible genomic data use to benefit human health.
Our Strategic Road Map defines strategies, standards, and policy frameworks to support responsible global use of genomic and related health data.
Discover how a meeting of 50 leaders in genomics and medicine led to an alliance uniting more than 5,000 individuals and organisations to benefit human health.
GA4GH Inc. is a not-for-profit organisation that supports the global GA4GH community.
To guide our collaborative, globe-spanning alliance, GA4GH relies on a Standards Steering Committee and an Executive Committee.
The Funders Forum brings together organisations that offer both financial support and strategic guidance.
The EDI Advisory Group responds to issues raised in the GA4GH community, finding equitable, inclusive ways to build products that benefit diverse groups.
Distributed across four Host Institutions, our staff team supports the mission and operations of GA4GH.
Curious who we are? Meet the people and organisations across six continents who make up GA4GH.
More than 500 organisations connected to genomics — in healthcare, research, patient advocacy, industry, and beyond — have signed onto the mission and vision of GA4GH as Organisational Members.
These core Organisational Members are genomic data initiatives that have committed resources to guide GA4GH work and pilot our products.
This subset of Organisational Members whose networks or infrastructure align with GA4GH priorities has made a long-term commitment to engaging with our community.
Local and national organisations assign experts to spend at least 30% of their time building GA4GH products.
Anyone working in genomics and related fields is invited to participate in our inclusive community by creating and using new products.
Wondering what GA4GH does? Learn how we find and overcome challenges to expanding responsible genomic data use for the benefit of human health.
Study Groups define needs. Participants survey the landscape of the genomics and health community and determine whether GA4GH can help.
Work Streams create products. Community members join together to develop technical standards, policy frameworks, and policy tools that overcome hurdles to international genomic data use.
GIF solves problems. Organisations in the forum pilot GA4GH products in real-world situations. Along the way, they troubleshoot products, suggest updates, and flag additional needs.
NIF finds challenges and opportunities in genomics at a global scale. National programmes meet to share best practices, avoid incompatabilities, and help translate genomics into benefits for human health.
Communities of Interest find challenges and opportunities in areas such as rare disease, cancer, and infectious disease. Participants pinpoint real-world problems that would benefit from broad data use.
See all our products — always free and open-source. Do you work on cloud genomics, data discovery, user access, data security or regulatory policy and ethics? Need to represent genomic, phenotypic, or clinical data? We’ve got a solution for you.
All GA4GH standards, frameworks, and tools follow the Product Development and Approval Process before being officially adopted.
Learn how other organisations have implemented GA4GH products to solve real-world problems.
Help us transform the future of genomic data use! See how GA4GH can benefit you — whether you’re using our products, writing our standards, subscribing to a newsletter, or more.
Help create new global standards and frameworks for responsible genomic data use.
Align your organisation with the GA4GH mission and vision.
Solve your real-world data problems with support from this valuable network of global institutions.
Work with like-minded groups committed to better data use in areas like rare disease, cancer, and infectious disease.
Share your thoughts on all GA4GH products currently open for public comment.
Solve real problems by aligning your organisation with the world’s genomics standards. We offer software dvelopers both customisable and out-of-the-box solutions to help you get started.
Learn more about upcoming GA4GH events. See reports and recordings from our past events.
Speak directly to the global genomics and health community while supporting GA4GH strategy.
Be the first to hear about the latest GA4GH products, upcoming meetings, new initiatives, and more.
Questions? We would love to hear from you.
Read news, stories, and insights from the forefront of genomic and clinical data use.
Attend an upcoming GA4GH event, or view meeting reports from past events.
See new projects, updates, and calls for support from the Work Streams.
Read academic papers coauthored by GA4GH contributors.
Listen to our podcast OmicsXchange, featuring discussions from leaders in the world of genomics, health, and data sharing.
Check out our videos, then subscribe to our YouTube channel for more content.
View the latest GA4GH updates, Genomics and Health News, Implementation Notes, GDPR Briefs, and more.
Discover all things GA4GH: explore our news, events, videos, podcasts, announcements, publications, and newsletters.
26 Oct 2018
The GA4GH Cloud Work Stream has announced version 1 of its Workflow Execution Service (WES) API — a protocol for running the same genomic data analysis in multiple cloud environments.
The GA4GH Cloud Work Stream has announced version 1 of its Workflow Execution Service (WES) API — a protocol for running the same genomic data analysis in multiple cloud environments. The announcement was made at the GA4GH 6th Plenary Meeting in Basel, Switzerland earlier this month.
“WES enables users to define workflows in a standard way, package them up, and then hand them to workflow engines that live in many different places,” said David Glazer, Engineering Director at Verily Life Sciences and co-lead of the Cloud Work Stream. “You should be able to run your workflow wherever you want on whatever data you have and be confident you’ll get the same answer — WES allows you to do just that.”
WES is part of a larger framework to seamlessly bring algorithms to genomic data rather than attempting to transfer that data across national and institutional bounds, which is cumbersome, resource intensive, and limited by regulatory constraints. “Our task as a work stream is to come up with the standards to facilitate the definition of these portable workflows, the sharing of them, and the execution of them,” said Glazer.
Taken as a whole, this framework will have positive impacts for both tool developers and tool users — developers will only need to package their tools once to make them available to the broad community, and researchers will have ready access to more tools as well as the ability to run compatible analyses across data in many places. WES addresses that last step in the framework — the execution of portable workflows in a preferred compute environment using data in a preferred storage environment.
In addition to developing WES, the Cloud Work Stream has also been spending its time since it was founded twelve months ago developing an interoperability testbed to show its framework works. “We want not only to demonstrate that the workflows are interoperable, but also to confirm they are actually successful,” said Brian O’Connor, Consulting Director at the UCSC Computational Genomics Platform and co-lead of the Cloud Work Stream.
The testbed depends on an “orchestrator” that sits between a library of workflows and a series of cloud environments where those workflows can be run. In this role, it is responsible for directly testing workflows in different WES environments. It selects a workflow and then selects a cloud environment that has implemented WES where the analysis will be run. The orchestrator monitors the workflow along the way, spitting out details used by the Work Stream to confirm WES is working in each environment.
To date, four organizations have made their implementations of WES available to the testbed: Veritas Genetics (running on the Microsoft Azure cloud), the Human Cell Atlas (running on the Amazon Web Services cloud), the Broad Institute of MIT and Harvard (running on the Google Cloud Platform), and Illumina. The interoperability testbed shows that WES is working in all four cases by running test workflows from three GA4GH Driver Projects and collaborators — Human Cell Atlas, Australian Genomics, and TOPMed — along with another contributed by the PCAWG project. These workflows were coupled with the test data where the results of the analysis are already known, making it easy to verify workflow portability across sites.
“I can say, ‘Here’s a test set of inputs to run the workflow on; if successful, you should get these outputs’,” said O’Connor. “The orchestrator runs every test against every environment and we can see how well each combination fares.”
Over the coming year, the Cloud Work Stream will work to publish standards that complement WES in making it possible to run any tool on any data: an API for selecting tools (Tool Registry Service, or TRS) and an API for accessing data across multiple clouds (Data Repository Service, or DRS).