Learn how GA4GH helps expand responsible genomic data use to benefit human health.
Learn how GA4GH helps expand responsible genomic data use to benefit human health.
Our Strategic Road Map defines strategies, standards, and policy frameworks to support responsible global use of genomic and related health data.
Discover how a meeting of 50 leaders in genomics and medicine led to an alliance uniting more than 5,000 individuals and organisations to benefit human health.
GA4GH Inc. is a not-for-profit organisation that supports the global GA4GH community.
To guide our collaborative, globe-spanning alliance, GA4GH relies on a Standards Steering Committee and an Executive Committee.
The Funders Forum brings together organisations that offer both financial support and strategic guidance.
The EDI Advisory Group responds to issues raised in the GA4GH community, finding equitable, inclusive ways to build products that benefit diverse groups.
Distributed across four Host Institutions, our staff team supports the mission and operations of GA4GH.
Curious who we are? Meet the people and organisations across six continents who make up GA4GH.
More than 500 organisations connected to genomics — in healthcare, research, patient advocacy, industry, and beyond — have signed onto the mission and vision of GA4GH as Organisational Members.
These core Organisational Members are genomic data initiatives that have committed resources to guide GA4GH work and pilot our products.
This subset of Organisational Members whose networks or infrastructure align with GA4GH priorities has made a long-term commitment to engaging with our community.
Local and national organisations assign experts to spend at least 30% of their time building GA4GH products.
Anyone working in genomics and related fields is invited to participate in our inclusive community by creating and using new products.
Wondering what GA4GH does? Learn how we find and overcome challenges to expanding responsible genomic data use for the benefit of human health.
Study Groups define needs. Participants survey the landscape of the genomics and health community and determine whether GA4GH can help.
Work Streams create products. Community members join together to develop technical standards, policy frameworks, and policy tools that overcome hurdles to international genomic data use.
GIF solves problems. Organisations in the forum pilot GA4GH products in real-world situations. Along the way, they troubleshoot products, suggest updates, and flag additional needs.
NIF finds challenges and opportunities in genomics at a global scale. National programmes meet to share best practices, avoid incompatabilities, and help translate genomics into benefits for human health.
Communities of Interest find challenges and opportunities in areas such as rare disease, cancer, and infectious disease. Participants pinpoint real-world problems that would benefit from broad data use.
Find out what’s happening with up to the minute meeting schedules for the GA4GH community.
See all our products — always free and open-source. Do you work on cloud genomics, data discovery, user access, data security or regulatory policy and ethics? Need to represent genomic, phenotypic, or clinical data? We’ve got a solution for you.
All GA4GH standards, frameworks, and tools follow the Product Development and Approval Process before being officially adopted.
Learn how other organisations have implemented GA4GH products to solve real-world problems.
Help us transform the future of genomic data use! See how GA4GH can benefit you — whether you’re using our products, writing our standards, subscribing to a newsletter, or more.
Help create new global standards and frameworks for responsible genomic data use.
Align your organisation with the GA4GH mission and vision.
Want to advance both your career and responsible genomic data sharing at the same time? See our open leadership opportunities.
Join our international team and help us advance genomic data use for the benefit of human health.
Share your thoughts on all GA4GH products currently open for public comment.
Solve real problems by aligning your organisation with the world’s genomics standards. We offer software dvelopers both customisable and out-of-the-box solutions to help you get started.
Learn more about upcoming GA4GH events. See reports and recordings from our past events.
Speak directly to the global genomics and health community while supporting GA4GH strategy.
Be the first to hear about the latest GA4GH products, upcoming meetings, new initiatives, and more.
Questions? We would love to hear from you.
Read news, stories, and insights from the forefront of genomic and clinical data use.
Attend an upcoming GA4GH event, or view meeting reports from past events.
See new projects, updates, and calls for support from the Work Streams.
Read academic papers coauthored by GA4GH contributors.
Listen to our podcast OmicsXchange, featuring discussions from leaders in the world of genomics, health, and data sharing.
Check out our videos, then subscribe to our YouTube channel for more content.
View the latest GA4GH updates, Genomics and Health News, Implementation Notes, GDPR Briefs, and more.
Discover all things GA4GH: explore our news, events, videos, podcasts, announcements, publications, and newsletters.
20 Apr 2022
The GA4GH April Connect Meeting occurred from 20 to 22 April 2022 to provide opportunities for collaboration across the GA4GH Work Streams, Driver Projects, and the broader community and to support contributors in advancing work on the GA4GH Roadmap. Read more below.
GA4GH Chief Standards Officer Susan Fairley highlighted developments the GA4GH community has achieved since the first Work Stream Firehose session in January 2018, such as progress reported in a special issue of the journal Cell Genomics in November 2021. In order to move toward a future of seamless federated genomic analysis, the next phase of work involves engagement, integration and filling gaps, and building community. The session continued with updates from the GA4GH Equity, Diversity, and Inclusion (EDI) Advisory Group, Federated Analysis Systems Project (FASP), and Work Streams.
GA4GH Steering Committee Chair Ewan Birney chaired the session, which was one in a series of ten open town hall brainstorms to gather input from the community on the focus and directionof GA4GH. Through a gap analysis in 2020 chaired by Heidi Rehm and Andrew Morris, the GA4GH community previously identified three strategic priorities: interoperability and technical alignment, implementation support, and engagement with healthcare and clinical standards. The 2022 strategic refresh sessions provide an opportunity for members to share feedback on GA4GH’s success with these priorities so far, and provide suggestions for improvement. This session focused heavily on promoting implementation around the globe.
Partnering with the EDI Advisory Group and Regulatory and Ethics Work Stream (REWS) Diversity Group, a representative from Aleria led an interactive workshop on how increasing inclusion can boost both diversity and performance within an organisation. The speaker emphasised that inclusion is invisible, with some groups experiencing exclusion much more commonly and acutely than other groups. A review of case studies and a sample measurement of exclusion experiences among attendees highlighted challenges and opportunities for incorporating equity, diversity, and inclusion into GA4GH standards creation.
Implementers of GA4GH Cloud deliverables such as the Workflow Execution Service (WES) and Tool Registry Service (TRS) APIs presented their experiences with real-world use cases. Representatives from Amazon Web Services (AWS), DNAStack, Broad/Cromwell, and Dockstore shared their target users, goals, challenges, and suggestions for GA4GH Cloud APIs. The group then discussed pain points and opportunities for improvement in greater detail. Implementers will next focus on submitting pull requests and contributing to the Cloud Work Stream Roadmap, especially for the next release of WES.
The session began with several brief presentations from current GA4GH testing initiatives, including Acidbio/BED, VCF validator, Refget, RNAget, and the recently approved Beacon v2.0. A demonstration of a testbed infrastructure provided a proof of concept for the unification of GA4GH testing initiatives. The group discussed the next steps for testbed infrastructure, which included sharing building blocks, interacting with journal editors, defining file format accompaniments, and harmonising efforts to reduce workload.
The session chairs walked through plans by two Driver Projects — Variant Interpretation for Cancer Consortium (VICC) and Clinical Genome Resource (ClinGen) — to adopt Genomic Knowledge Standards (GKS) within the framework of a maturity model. The structure and terms of the maturity model are aligned to those seen in the HL7 FHIR specifications, which include three maturity level categories: draft, trial use, or stable. Next, members of the GKS Work Stream aim to draft classes used in the Variation Representation Standard (VRS) (and associated VRSATILE tooling) and Variant Annotation (VA) standard with the maturity model in mind.
GA4GH is currently reviewing and revising the process used to propose, develop, and approve standards. Chief Standards Officer (CSO) Susan Fairly discussed the detail of the updated GA4GH product development and approval document. The modifications include who votes on product approvals, the timelines of approvals, how to make changes to approval decisions, and product documentation. The discussion notes and feedback were recorded using a miro board. The first draft of the new product approval is now open for comments here.
Members of the Genomic Knowledge Standards (GKS) Work Stream discussed the genomic knowledge search problem and proposed a mechanism for representing variation as a categorical concept, or definition. This approach would provide a framework for computable descriptions of variation. Moving forward, the team aims to develop a working community around categorical variation and to draft and vet several categorical variation models.
The Regulatory and Ethics Work Stream (REWS) met to share updates on current activities, hear from Driver Projects, and give a preview of new groups. The Data Access Committee Review Standards (DACReS) team requested feedback via survey, and the Genetic Discrimination group invited members to a working meeting to draft the detailed plan of a short commentary that would define the notion of genetic discrimination. Presentations from the GDPR Forum, Consent Toolkit team, and REWS Diversity group covered recent progress on projects. Several new groups are being launched: in 2022, Diversity in Datasets, Ethical Provenance Toolkit, Public Attitudes for Genomic Policy, Clinical Data Sharing and Consent; and in 2023, Newborn Whole Genome Sequencing. Finally, representatives from Australian Genomics, EUCANCan, the Human Cell Atlas (HCA), and EpiShare gave overviews of their efforts.
The session began with a brief overview of OpenID Connect, OAuth 2.0, JSON Web Tokens, JSON Web Signature, and Claims. The revisions introduced in Passports v1.2 will be minor and will not need approval from the GA4GH Steering Committee (SC). The revision allows Passports to be issued as signed tokens using RFC 8693 “OAuth 2.0 Token Exchange“ mechanism, which will allow mutually trusting systems to pass signed tokens among them. Passport v2.0 will include major edits requiring the SC’s approval. This includes the introduction of a new type of token, a work-order token, which will be limited to specific datasets, used by specified agents only, and small enough to fit into HTTP headers. This new token will also be issued using the OAuth 2.0 Token Exchange mechanism that will be introduced in v1.2.
The Genetic Discrimination (GD) group reached a consensus on a person-centric approach to focus on discrimination from the perspective of those who experience it. There was also a discussion on the need to define genetic discrimination broadly and the need to acknowledge issues of public concerns surrounding GD. The majority of the session focused on preparing an outline for an EDI bioethics publication.
The recently formed Computable Cohort Representation group kicked off a hackathon series to tackle a number of questions, including: how does one represent a set of patient attributes to assemble a cohort from different systems and datasets? How does one formulate the queries? Pain points, such as gaining access, to data which might block the primary goal were also identified. Preparatory work was described. Over the course of several weeks, the team will host five hackathons on key topical areas in order to show an example phenotype across different definitions, specify the phenotype in an executable format, and test across implementations in a federated manner. To learn more about hackathon plans or to get involved, you can read the team’s planning document.
The Data Connect team walked through several realistic datasets to illustrate the ease of sharing data using the Data Connect reference implementation. The ability of data providers to use Data Connect whether or not they have full semantic annotation or harmonized data was shown. This illustrated that Data Connect allows for inclusivity, and a pathway to more advanced levels of harmonization as and when providers are able to take that resource intensive step. Participants discussed issues related to identifiers, and prototype exploration of schema representation on top of Data Connect’s use of json-schema.
Contributors to the Variation Annotation (VA) subgroup of the Genomic Knowledge Standards (GKS) Work Stream presented the current state of and future plans for the VA specification. For the initial release, the group plans to focus on creating a new modelling framework with base profiles (such as pathogenicity or therapeutic response profiles) that basic users can adopt, and that power users can adapt into their own specialised models to share back with the community. A lively discussion addressed the connection between VA and file formats such as VCF. Before GA4GH 10th Plenary, the team plans to draft the framework and base profiles, incorporating VRS Added Tools for Interoperable Loquacious Exchange (VRSATILE) value object descriptors.
Attendees discussed updates to the completed suite of Cloud APIs, which have all been approved and are in maintenance mode. This year, the Data Repository Service (DRS) team aims to release v1.3 and refine integration with the Passports specification. The Workflow Execution Service (WES) API team aims to look into workflow and engine parameters. The Tool Registry Service (TRS) API team is looking at a potential breaking change, to account for more possible representations of a workflow resource. The Task Execution Service (TES) API group is looking at a number of updates to roll into the next release, including tag filtering.
Members of the Genomic Knowledge Standards (GKS) Work Stream and other attendees reviewed outcomes from their previous Connect sessions and developed a roadmap for 2022, with an aim to ratify and adopt a maturity and data versioning model to help current and potential adopters implement the evolving GKS standards. By GA4GH 10th Plenary, the team also hopes to draft a formal VRSATILE specification, which supports organisations who wish to adopt VRS.
Representatives from the Beacon v2 standard, Genomics England, the Autism Sharing Initiative, the NIH Cloud Platform Interoperability Effort (NCPI), and Human Heredity & Health in Africa (H3Africa) presented use cases to the Federated Analysis Systems Project (FASP) group. Participants asked questions, identified patterns, and generated ideas for future demonstrations that build on user interest in improved data access and federated analysis, including in the areas of end-to-end gene panels and federated imputation.
At this session, contributors discussed ways to achieve common approaches to share data and the models used to describe data, to support interoperability. The idea of a schema registry service (similar to the tool registry service developed by the GA4GH Cloud Work Stream, but for identifying and accessing schema instead of tools) was discussed. As a first step, they aim to establish a group to explore this idea, with a view to supporting the Data Connect and Beacon use cases.
The session began with contributors reviewing use cases for DRS + Passports and Discovery + Passports integrations, focusing on how to authenticate API access for various biomedical platforms. The group discussed that in order to solve this, a standardised authorisation is needed. The session was divided into three main categories: current problems that need to be solved, potential solutions, and remaining problems without solutions. The ideas and discussions were tracked using a Miro board.
More genomic datasets are being created in various international healthcare settings, but often in different and fragmented environments — making it difficult to share interoperably between research and healthcare. In this workshop, participants looked at a rare disease proof of concept from the Beyond One Million Genomes Project (B1MG) that used the ELIXIR network. Then they discussed challenges, opportunities, and successes of integrating healthcare and research through technology and operations within their own organisations. The session chairs plan to compile the information into a gap analysis.
During the course of the meeting, 408 individauls (355 virtual and 53 in-person from 35 countries gathered together to advance standards development work and cross-work stream collaborations and projects. To conclude the meeting, GA4GH Chief Standards Officer (CSO) Susan Fairley reported on the progress made during all 19 working sessions and next steps to further the goals of each group.