About us
Learn how GA4GH helps expand responsible genomic data use to benefit human health.
Learn how GA4GH helps expand responsible genomic data use to benefit human health.
Our Strategic Road Map defines strategies, standards, and policy frameworks to support responsible global use of genomic and related health data.
Discover how a meeting of 50 leaders in genomics and medicine led to an alliance uniting more than 5,000 individuals and organisations to benefit human health.
GA4GH Inc. is a not-for-profit organisation that supports the global GA4GH community.
The GA4GH Council, consisting of the Executive Committee, Strategic Leadership Committee, and Product Steering Committee, guides our collaborative, globe-spanning alliance.
The Funders Forum brings together organisations that offer both financial support and strategic guidance.
The EDI Advisory Group responds to issues raised in the GA4GH community, finding equitable, inclusive ways to build products that benefit diverse groups.
Distributed across a number of Host Institutions, our staff team supports the mission and operations of GA4GH.
Curious who we are? Meet the people and organisations across six continents who make up GA4GH.
More than 500 organisations connected to genomics — in healthcare, research, patient advocacy, industry, and beyond — have signed onto the mission and vision of GA4GH as Organisational Members.
These core Organisational Members are genomic data initiatives that have committed resources to guide GA4GH work and pilot our products.
This subset of Organisational Members whose networks or infrastructure align with GA4GH priorities has made a long-term commitment to engaging with our community.
Local and national organisations assign experts to spend at least 30% of their time building GA4GH products.
Anyone working in genomics and related fields is invited to participate in our inclusive community by creating and using new products.
Wondering what GA4GH does? Learn how we find and overcome challenges to expanding responsible genomic data use for the benefit of human health.
Study Groups define needs. Participants survey the landscape of the genomics and health community and determine whether GA4GH can help.
Work Streams create products. Community members join together to develop technical standards, policy frameworks, and policy tools that overcome hurdles to international genomic data use.
GIF solves problems. Organisations in the forum pilot GA4GH products in real-world situations. Along the way, they troubleshoot products, suggest updates, and flag additional needs.
GIF Projects are community-led initiatives that put GA4GH products into practice in real-world scenarios.
The GIF AMA programme produces events and resources to address implementation questions and challenges.
NIF finds challenges and opportunities in genomics at a global scale. National programmes meet to share best practices, avoid incompatabilities, and help translate genomics into benefits for human health.
Communities of Interest find challenges and opportunities in areas such as rare disease, cancer, and infectious disease. Participants pinpoint real-world problems that would benefit from broad data use.
The Technical Alignment Subcommittee (TASC) supports harmonisation, interoperability, and technical alignment across GA4GH products.
Find out what’s happening with up to the minute meeting schedules for the GA4GH community.
See all our products — always free and open-source. Do you work on cloud genomics, data discovery, user access, data security or regulatory policy and ethics? Need to represent genomic, phenotypic, or clinical data? We’ve got a solution for you.
All GA4GH standards, frameworks, and tools follow the Product Development and Approval Process before being officially adopted.
Learn how other organisations have implemented GA4GH products to solve real-world problems.
Help us transform the future of genomic data use! See how GA4GH can benefit you — whether you’re using our products, writing our standards, subscribing to a newsletter, or more.
Join our community! Explore opportunities to participate in or lead GA4GH activities.
Help create new global standards and frameworks for responsible genomic data use.
Align your organisation with the GA4GH mission and vision.
Want to advance both your career and responsible genomic data sharing at the same time? See our open leadership opportunities.
Join our international team and help us advance genomic data use for the benefit of human health.
Discover current opportunities to engage with GA4GH. Share feedback on our products, apply for volunteer leadership roles, and contribute your expertise to shape the future of genomic data sharing.
Solve real problems by aligning your organisation with the world’s genomics standards. We offer software dvelopers both customisable and out-of-the-box solutions to help you get started.
Learn more about upcoming GA4GH events. See reports and recordings from our past events.
Speak directly to the global genomics and health community while supporting GA4GH strategy.
Be the first to hear about the latest GA4GH products, upcoming meetings, new initiatives, and more.
Questions? We would love to hear from you.
Read news, stories, and insights from the forefront of genomic and clinical data use.
Attend an upcoming GA4GH event, or view meeting reports from past events.
See new projects, updates, and calls for support from the Work Streams.
Read academic papers coauthored by GA4GH contributors.
Listen to our podcast OmicsXchange, featuring discussions from leaders in the world of genomics, health, and data sharing.
Check out our videos, then subscribe to our YouTube channel for more content.
View the latest GA4GH updates, Genomics and Health News, Implementation Notes, GDPR Briefs, and more.
Discover all things GA4GH: explore our news, events, videos, podcasts, announcements, publications, and newsletters.
12 Jun 2025
The Global Alliance for Genomics and Health (GA4GH) announces two recent product approvals: Categorical Variation Representation Specification (Cat-VRS) and Variant Annotation Specification (VA-Spec). Both products were developed by members of the Genomic Knowledge Standards (GKS) Work Stream to foster greater precision in the representation of genomic knowledge.
By Jaclyn Estrin, GA4GH Science Writer
The Global Alliance for Genomics and Health (GA4GH) is pleased to announce two newly approved products: the Categorical Variation Representation Specification (Cat-VRS) and Variant Annotation Specification (VA-Spec).
Cat-VRS and VA-Spec are part of a suite of open-source tools developed by the Genomic Knowledge Standards (GKS) Work Stream to standardise how genomic variants are represented and annotated, ranging from specific variants to categories of variants. Implemented together, each product plays a role in improving representation of genomic knowledge to accelerate patient disease diagnosis and foster advancements in patient care.
Currently, the way genomic variants are represented varies across different knowledge bases and clinical sequencing labs. It is, therefore, challenging for researchers to draw connections explaining how genetic diseases manifest with the presence of specific genomic variants.
Because of these challenges, the effort needed to interpret a patient’s genome is extensive, requiring six to ten hours of human expert analysis to produce a final report. This work is an important step in the journey from patient sample to patient care, where samples are taken for genomic sequencing, scientists analyse and interpret the data to determine the genetic variants relevant to the symptoms, and then doctors receive results to diagnose and recommend care options.
There are many points throughout this process that could result in delays. In past decades, it was the genomic sequencing process that was time and resource intensive. Now, with advancements in genomic sequencing technologies, data is being generated at an astronomically greater scale that far outpaces genomic analysts’ bandwidth to interpret it all. This results in a clinical interpretation bottleneck.
Brendan Reardon (Dana-Farber Cancer Institute; Broad Institute of MIT and Harvard), Cat-VRS Product Co-Lead explained, “The number of data points per patient continues to go up over time.” This is especially true if a patient has undergone whole exome or whole genome sequencing. He added, “These are orders of magnitude more mutations that clinical geneticists need to sort through.”
The Categorical Variation Representation Specification (Cat-VRS) is a tool that takes most of the burden of computation and analysis off of the variant analysts. Reardon said, “The more we can do to make their job easier, the faster they will be able to make a clinically relevant decision for each patient.”
Cat-VRS builds off of the GKS Variation Representation Specification (VRS), which establishes a standardised computational language to integrate and share variant knowledge across institutions. Cat-VRS goes one step further by establishing a standard to enable search, analysis, and sharing across categorical properties of variant data.
In clinical genomics research, doctors often look for shared genetic patterns in patients with similar symptoms to guide treatment decisions. These patterns may appear as the same variant in a very specific genomic location, variants at any genomic location in the same gene, or even broader and more complex categories of variation.
“A general process in science is observing trends and then coming up with general rules to explain those trends,” said Daniel Puthawala (The Abigail Wexner Research Institute at Nationwide Children’s Hospital), Cat-VRS Product Co-Lead. He continued, “In genomics, those rules manifest as these categorical variants.”
Alex Wagner (The Abigail Wexner Research Institute at Nationwide Children’s Hospital), Co-Lead of the GKS Work Stream, further explained, “Three to four percent of patients with non-small-cell lung cancer have variants that cause the MET gene to splice out a specific portion of the gene (exon 14), which is used as a biomarker to inform treatment selection. No one genetic variant is responsible. Instead, the collection of all genetic variants that cause this splicing behavior are relevant — a categorical variant.”
Deciding if a variant from a patient belongs to one of these categorical variants from genomic knowledgebases has historically been a highly manual process. Because of the time intensive nature, researchers are having to triage which patients get this type of genomic testing.
Puthawala said, “This is a huge problem, because genomic analysis, particularly in paediatric medicine, is one of the best tools we have for diagnosis, so we need to be able to scale this. Cat-VRS is a data framework that enables computers to procedurally determine how categories of variants are related. This allows researchers to find knowledge about variants of interest faster, alleviating the bottleneck to speed up genomic interpretation.”
Scaling genomic analysis also requires efficient exchange of the knowledge attached to each variant. While VRS and Cat-VRS represent variants and categories of variants, each of these variants is also annotated — or attached — with knowledge. The Variant Annotation Specification (VA-Spec) provides a model to explicitly express knowledge associated with a variant.
This includes both high level, clinically relevant types of knowledge, as well as foundational biological knowledge. Both knowledge types have embedded evidence and provenance information about where the data originated, what methods were used during the annotation process, and how the data was interpreted.
Matthew Brush (The University of North Carolina at Chapel Hill School of Medicine) led the VA-Spec product development, with support from Larry Babb (Broad Institute of MIT and Harvard) and Alex Wagner.
Brush said, “It can be painfully slow to integrate and harmonise knowledge from different sources. The main motivation [for VA-Spec] was the absence of an established computable standard for exchanging variant knowledge and evidence in a way that supported the diverse needs of the research and clinical community, and the challenges that this poses for very important research and clinical enterprises that rely on this type of knowledge.”
VA-Spec helps facilitate clear and efficient exchange of variant knowledge across clinics, institutions, and research systems. Wagner said that VA-Spec encodes knowledge “in a way that allows for communication across different community frameworks…under one interoperable, unified framework that computers understand.”
Brush added, “Having a model that makes that process easier, more efficient, and less prone to misrepresentation and errors in translation will have huge benefits for the community.” The VA-Spec team is currently working to expand the specification, which is built on a generic foundational model, to support additional types of variant knowledge and evidence.
The development of technical standards such as these specifications requires collaboration with implementation partners. Babb said, “Close collaboration with developers and domain experts behind key genomic resources like gnomAD, MaveDB, ClinVar, and CIViC gives high confidence that our standards and tools are both practical and robust, and that shared semantics will enable more accurate, consistent, and reliable use across community pipelines and systems.”
In today’s genomic data ecosystem, where the rate of available scientific knowledge is growing at an exponential pace, researchers have greater access to health data, far beyond the small fraction of information that was previously available. Shared standards for genomic variant representation and annotation provide a foundation to accelerate discovery and application of distributed knowledge resources, enabling researchers to more quickly diagnose patients and deliver faster, more precise care.