Learn how GA4GH helps expand responsible genomic data use to benefit human health.
Learn how GA4GH helps expand responsible genomic data use to benefit human health.
Our Strategic Road Map defines strategies, standards, and policy frameworks to support responsible global use of genomic and related health data.
Discover how a meeting of 50 leaders in genomics and medicine led to an alliance uniting more than 5,000 individuals and organisations to benefit human health.
GA4GH Inc. is a not-for-profit organisation that supports the global GA4GH community.
To guide our collaborative, globe-spanning alliance, GA4GH relies on a Standards Steering Committee and an Executive Committee.
The Funders Forum brings together organisations that offer both financial support and strategic guidance.
The EDI Advisory Group responds to issues raised in the GA4GH community, finding equitable, inclusive ways to build products that benefit diverse groups.
Distributed across four Host Institutions, our staff team supports the mission and operations of GA4GH.
Curious who we are? Meet the people and organisations across six continents who make up GA4GH.
More than 500 organisations connected to genomics — in healthcare, research, patient advocacy, industry, and beyond — have signed onto the mission and vision of GA4GH as Organisational Members.
These core Organisational Members are genomic data initiatives that have committed resources to guide GA4GH work and pilot our products.
This subset of Organisational Members whose networks or infrastructure align with GA4GH priorities has made a long-term commitment to engaging with our community.
Local and national organisations assign experts to spend at least 30% of their time building GA4GH products.
Anyone working in genomics and related fields is invited to participate in our inclusive community by creating and using new products.
Wondering what GA4GH does? Learn how we find and overcome challenges to expanding responsible genomic data use for the benefit of human health.
Study Groups define needs. Participants survey the landscape of the genomics and health community and determine whether GA4GH can help.
Work Streams create products. Community members join together to develop technical standards, policy frameworks, and policy tools that overcome hurdles to international genomic data use.
GIF solves problems. Organisations in the forum pilot GA4GH products in real-world situations. Along the way, they troubleshoot products, suggest updates, and flag additional needs.
NIF finds challenges and opportunities in genomics at a global scale. National programmes meet to share best practices, avoid incompatabilities, and help translate genomics into benefits for human health.
Communities of Interest find challenges and opportunities in areas such as rare disease, cancer, and infectious disease. Participants pinpoint real-world problems that would benefit from broad data use.
See all our products — always free and open-source. Do you work on cloud genomics, data discovery, user access, data security or regulatory policy and ethics? Need to represent genomic, phenotypic, or clinical data? We’ve got a solution for you.
All GA4GH standards, frameworks, and tools follow the Product Development and Approval Process before being officially adopted.
Learn how other organisations have implemented GA4GH products to solve real-world problems.
Help us transform the future of genomic data use! See how GA4GH can benefit you — whether you’re using our products, writing our standards, subscribing to a newsletter, or more.
Help create new global standards and frameworks for responsible genomic data use.
Align your organisation with the GA4GH mission and vision.
Solve your real-world data problems with support from this valuable network of global institutions.
Work with like-minded groups committed to better data use in areas like rare disease, cancer, and infectious disease.
Share your thoughts on all GA4GH products currently open for public comment.
Solve real problems by aligning your organisation with the world’s genomics standards. We offer software dvelopers both customisable and out-of-the-box solutions to help you get started.
Learn more about upcoming GA4GH events. See reports and recordings from our past events.
Speak directly to the global genomics and health community while supporting GA4GH strategy.
Be the first to hear about the latest GA4GH products, upcoming meetings, new initiatives, and more.
Questions? We would love to hear from you.
Read news, stories, and insights from the forefront of genomic and clinical data use.
Attend an upcoming GA4GH event, or view meeting reports from past events.
See new projects, updates, and calls for support from the Work Streams.
Read academic papers coauthored by GA4GH contributors.
Listen to our podcast OmicsXchange, featuring discussions from leaders in the world of genomics, health, and data sharing.
Check out our videos, then subscribe to our YouTube channel for more content.
View the latest GA4GH updates, Genomics and Health News, Implementation Notes, GDPR Briefs, and more.
Discover all things GA4GH: explore our news, events, videos, podcasts, announcements, publications, and newsletters.
28 Sep 2021
The GA4GH 9th Plenary event was held virtually from 28 to 29 September 2021. The meeting brought together 511 participants across 57 countries, featuring 25 sessions on topics including real-world implementations of GA4GH standards, engaging with underrepresented communities, responsible data, COVID data sharing, and more. Read more below.
9th Plenary focused on 4 key themes.
Ewan Birney (EMBL-EBI)
Birney opened GA4GH 9th Plenary with an overview of the meeting, sharing it’s our most geographically diverse meeting to date with 57 countries represented. The meeting would follow four key themes: The GA4GH Story So Far; Collaborate. Innovate. Accelerate.; Responsible Data; and Expanding the Scope & Impact of GA4GH.
Keynote: A new appraisal of genomics at the World Health Organization
Harold Varmus (Weill Cornell Medicine)
In his keynote talk, Varmus discussed approaches to increasing equity in the use of genomics through two examples: 1) advising the WHO on expanding applications of genomics in global health; and 2) Building a research consortium in New York City to study the role of genetic ancestry in cancer risk, progression, outcomes, and more.
Overview & Updates from GA4GH
Peter Goodhand (GA4GH) & Susan Fairley (GA4GH)
Goodhand shared key updates to GA4GH, including the formation of a non-profit to ensure the stability of GA4GH, expansion of staff to accelerate GA4GH’s core work and mission, and launch of a new COVID-19 Interoperability Initiative, and directive grant to help with pandemic preparedness in the future. Next, Chief Standards Officer Susan Fairley provided an update on the technical side of GA4GH and the variety of activities that are occurring to support standards development.
EDI Advisory Group: Building Intentional Communities
Laura Paglione (Spherical Cow Group)
Paglione discussed the work of the GA4GH EDI Advisory Group to date, and how the team has taken an intentional community approach. Work has culminated into three overarching categories—Onboarding, Participation Levels, and Equity by Design—and encouraged the community to get involved and contribute to EDI efforts to effect change.
Monica Munoz-Torres (University of Colorado), Alex Wagner (Nationwide Children’s Hospital), Peter Robinson (Jackson Laboratory)
Munoz-Torres provided an overview of the GA4GH Phenopackets standard, which is now on its v2.0 release. The new version brings significant changes to enable better representation of cancer, common disease, and COVID-19. Munoz-Torres then described efforts to integrate the Variation Represerntation Specification (VRS) and VRSATILE—standards developed by the GA4GH Genomic Knowledge Standards Work Stream—into Phenopackets v2.0,
Jordi Rambla (CRG)
Beacon is a discovery tool aiming to be a “lingua franca.” Rambla shared an overview of Beacon’s development, and how the current version—Beacon v2—can provide more information on sequences and better support clinical genomics research and its application to healthcare.
Sarion Bowers (Wellcome Sanger Institute)
Bowers presented three ways to currently access data—open access, registered access, and controlled access. For sensitive data that is under controlled access, the Passports standard uses digital identities to help Data Access Committees (DACs) review data access requests. In the future, the team aims to combine Passports with other GA4GH standards and to allow for further automation to alleviate burden on DACs.
Max Barkley (DNAstack)
Next, Barkley discussed efforts to integrate Passports standard with the Data Repository Service (DRS) API, which is used for accessing files in the cloud or on remote servers. The team discussed challenges, progress, and next steps to allow for the integration of the two standards.
Pamela Bretscher (Broad Institute)
Bretscher shared an example of DUO in action, in the Broad Institute’s DUOS system. A matching algorithm can instantly determine whether a data access request is within the parameters of its original consents, allowing Data Access Committees to more easily make a final decision on whether to grant access. In the future, DUOS hopes to make use of the Passports Standard and to continually update processes based on the GA4GH DACReS group.
Miro Cupak (DNAstack)
A newly-approved standard, the Data Connect API provides a flexible mechanism for data discovery and search. Cupak shared the key features of the Data API, which allows data custodians to easily describe their data as it sits without being overly prescriptive or going through an intensive transformation process, lowering the barrier to sharing and searching data.
James Bonfield (Wellcome Sanger Institute)
Bonfield presented an update on CRAM, a file format for genomic data compression. Updates were made to v3.1, which reports significant boosts in terms of the speed/size tradeoff of the data file. Challenges and next steps were presented to further improve the standard.
Albert Smith (University of Michigan)
Smith provided an overview of the history of VCF, a tab-delimited format for storing variant calls and individual genotypes. Addressing VCF scalability is next on the radar, with benchmarking being planned to evaluate possible solutions and proposals.
Max Barkley (DNAstack), Ian Fore (NIH), Brian O’Connor (Broad Institute)
Now two years in the works, FASP has shown how GA4GH APIs can be used in concert to enable real-world scientific use cases. The team has brought people together through regular meetings and hackathons to zero in on specific use cases. Barkley provided an update on two key demos—the Vertical and Horizontal demos—and progress made to date.
Jeremy Adams (GA4GH)
Adams presented a new GA4GH initiative, the GA4GH Starter Kit, which serves as a simple, comprehensive suite of open source applications to help new organizations implement or adopt GA4GH standards. Two GA4GH standards—DRS and WES—have starter kits, with more standards coming soon.
Kathryn North (Australian Genomics)
North opened track 2 by sharing more on the Genomics in Health Implementation Forum, a community aimed at putting GA4GH standards into practice and enabling collective learning in an effort to solve the ability to share data across borders. North presented a brief history of GHIF, benefits of participating in GHIF, and how to join.
Zornitza Stark (Australian Genomics)
Stark presented work done through the Acute Care Genomics project, which aims to pilot a national approach for rapid genomic testing for infants admitted into the intensive care unit. The project is using interoperable formats to share the data for research use, establishing electronic workflows, and streamlining clinical data capture. The project aims to accelerate the implementation of genomics within the Australian healthcare system and foster a virtuous cycle between clinical and research genomics.
Mar Gonzalez-Porta (Genome Institute of Singapore)
Gonzalez-Porta discussed Singapore’s Precision Medicine Strategy. Phase 1 is complete, which aimed to sequence 10,000 health Singaporeans and enable clinical adoption through return of results. The team is transitioning now to Phase 2, which aims to build a database of 100K genomes from the local population and link it to corresponding phenotypic data. GA4GH standards such as Beacon and CRAM have aided in their efforts thus far.
Soichi Ogishima (Tohoku University)
Ogishima provided an overview of the GA4GH standards implemented in Japan’s biobank network, including the Data Use Ontology (DUO) and the Machine Readable Consent Guidance (MRCG) for research purpose matching and Phenopackets v2 for clinical data capture and exchange. The teams aims to expand the biobank network and cohort network to enable a responsible data sharing ecosystem between clinical genomics and research.
Kenjiro Kosaki (Keio University)
Kosaki discussed use of Phenopackets in IRUD, where the accumulation of phenotype-genotype data within Phenopackets has proven valuable. Through the IRUD-Exchange data sharing platform, new disorders—such as a new overgrowth syndrome—were able to be identified, along with re-analysis of unresolved cases.
Michelle Johnson-Jennings (University of Washington), Katie Collins (University of Saskatchewan)
Johnson-Jennings and Collins discussed the work of CIEDAR, one of the eight pillars of Canada’s Coronavirus Variants Rapid Response Network (CoVaRR Net). The inclusion of CIEDAR was key to integrating indigenous peoples in the network itself—to build true partnerships after a history of distrust and data mismanagement and to enable effective community health policies during the COVID-19 pandemic.
Karen Miga (UCSC Genomics Institute)
The reference genome was largely derived from the genomic information of a single individual and has historically consisted of data from individuals of European descent. The Human Pangenome Consortium aims to improve the representation of sequence diversity in the human population to build a new reference structure, in order to improve variant detection and clinical outcomes for all people. Miga shared current work to date from the project and the current goal of launching global partnerships to expand representation.
Katherine D. Blizinsky (All of Us Research Program)
Blizinsky discussed the goal of the All of Us research program: to build a longitudinal cohort of deep, rich data on people living in the United States. A key part of this project is diversity and meaningful engagement—balancing participant interests and autonomy with maximizing the use of donated data. Blizinsky shared an update on the reach and engagement of the program, but also emphasized key factors that must continue to be fostered or built, including trust and mechanisms for engagement across barriers.
Hardip Patel (Australian National University)
Patel discussed the inequitable health outcomes that indigenous communities in Australia face today, and emphasized the need to increase diversity in genomics studies to improve clinical outcomes. Underpinning these efforts are the need to respect data sovereignty of indigenous people, including collective benefit and authority to control the data. Building upon these principles, Patel shared current efforts, including community engagement and new studies underway.
Jean-Pierre Hubaux (EPFL)
Hubaux discussed homomorphic encryption as a viable and secure way to carry out federated analysis for health data and precision medicine and streamline the process for conducting large-scale collaborative GWAS studies. Hubaux then shared the MedCO project, a developing system deployed in Swiss hospitals that uses homomorphic encryption as a building block.
Jonathan Dursi (CanDIG)
Dursi shared CanDIG’s responsible data sharing principles for federation design and when building technical solutions, emphasizing that building trust among collaborators is key. Dursi then discussed implementation of these principles and current progress to date across CanDIG’s distributed infrastructure.
Rachele Hendricks-Sturrup (Duke-Margolis Center for Health Policy)
Hendricks-Sturrup discussed the implication of real-world data cross the patient-consumer spectrum, a concept in which healthcare is transitioning into an around-the-clock activity involving the generation, use, and integration of all types of data. This leads to the intersection of regulated and unregulated health data, which must be considered from ethical, legal, and social perspectives so that in the face of health innovation, no one gets harmed.
Anna Lewis (Harvard University)
Lewis discussed the emerging ethical and legal aspects of supporting the return of clinically actionable results back to patients. While return of results is hard, resource-intensive, and in tension with the notion of meeting larger sample sizes, Lewis emphasized the duty to return results and the cultural shift taking place of viewing patients as partners and collaborators in the research endeavor. Lewis discussed the GA4GH approved Return of Results policy as a helpful resource and reference point for managing the return of results.
Tommi Nyronen (CSC-IT Center for Science)
Nyronen presented a background of the EU’s “1 + Million Genomes” project, which aims to bring together 22 EU countries, the UK, and Norway with the goal to have at least 1 million sequenced genomes accessible in the EU by 2022. Next, Nyronen shared an infrastructure proof of concept with a rare diseases use case utilizing GA4GH standard, before discussing next steps on the roadmap.
Alisa Manning (Broad Institute)
Manning discussed the potential for NIH cloud interoperability approaches to improve outcomes of pediatric diseases. The Pediatric Cardiac Genetics Consortium aims to bring together researchers to study data for the first time in the cloud, using health controls from two NHLBI cohorts.
Mamana Mbiyavanga (University of Capetown)
The H3Africa Consortium aims to facilitate African-based research with the goal of improving the health of African populations by building capacity in human and research infrastructure. Mbiyavanga shared the GA4GH standards used to date within the Consortium that are key to building infrastructure on the continent.
Lauren Fromont (Centre for Genomic Regulation)
Fromont discussed the evolution of the Beacon API, from its initial “yes/no” response for data discovery to making the standard more relevant for clinical use. Fromont then shared an example of implementing Beacon v2 in hospitals in Catalunya. The Beacon v2 marks an important step in enabling a robust and global Beacon network.
Alan Christoffels (SANBI)
Presentation Video | Q&A Video
There are many barriers to data sharing in the public health space, from lack of adequate technical solutions to legal and ethical implications. In this keynote talk, Christoffels discussed the role of the Public Health Alliance for Genomic Epidemiology (PHA4GE) in addressing these barriers. Christoffels shared the work of three groups within PHA4GE—the Data Structures Working Group, the Ethics and Data Sharing Working Group, and the Infrastructure Working Group—in facilitating and supporting responsible data sharing during an outbreak.
Ben Neale (Broad Institute)
Neale discussed the aim of the COVID-19 Host Genetic Initiative, which aims to map the human genetic architecture of COVID-19 to understand why a large proportion of individuals are asymptomatic while others experience progressive, even life-threatening consequences. Neale shared findings from several studies on what is known so far, and emphasized the importance of an open community to allow for robust, real-time replication of findings and exchange of results.
Melissa Haendel (University of Colorado Anschutz Medical School)
Data sharing is critical to addressing unanswered questions in COVID-19, from predicting acute COVID-19 severity early in its course to diagnosing and treating long COVID. With the largest public HIPAA-limited dataset in US history, Haendel discussed the role of the National COVID Cohort Collaborative (N3C) to help tackle these topics.
Catalina Lopez-Correa (Genome Canada)
Lopez-Correa discussed the role of CanCOGeN, a Genome Canada-led consortium, in using genomics to better understand and control the COVID-19 pandemic. Lopez-Correa dove into the work of the HostSeq and ViralSeq implementation committees, which aim to sequence and analyze host genomes and viral genomes, respectively. To unlock the power of this data, Lopez-Correa emphasized the importance of data sharing and interoperability to improve clinical outcomes for patients.
Bron Kisler, Clinical Research Data Sharing Alliance
The Clinical Research Data Sharing Alliance (CRDSA) seeks to maximize the value of patient-donated data by ensuring broad secondary use and understanding of high-quality clinical research data. Kisler shared that key to accomplishing this goal is balancing the conflicting priorities of open science and patient privacy. To address this tension, CRDSA is developing collaborative solutions to address these challenges and share best practices in order to bridge the worlds of genomic data sharing and clinical research data sharing.
Marc Williams (Geisinger Health System)
Williams shared the work of the MyCode Community Health Initiative, examining how lack of standards impacts testing, interpretation, and reporting—ultimately impacting the ability to scale health systems. Williams discussed challenges and potential solutions, sharing that the learning health system model is a viable framework for implementing precision health.
Rebecca Truty (Invitae)
Invitae aims to bring comprehensive genetic information into mainstream medicine to improve healthcare. Truty described Invitae’s two core tenets—that patients own their health data and health data is more valuable when shared—and the ways Invitae is supporting these goals. Truty discussed current challenges and potential solutions, emphasizing the importance of standards throughout the process.