Sequence Annotation (SA)

Aims to develop an extensible, common model for describing genomic features

Knowledge associated with specific coordinates on a biological sequence is referred to as a sequence annotation. Subtypes include genome annotations, gene annotations, and reference annotations (among others). Sequence annotations are used to represent concepts such as gene models, alternatively spliced transcripts, and coding regions. Untranslated regions, introns, pseudogenes, regulatory regions, and repetitive sequences are also frequently represented as features mapped to a coordinate system. In addition to serving as a guide for which sequences correspond to which genes and transcripts, sequence annotations are also used to represent the parts of a sequence (e.g. coding sequence, exons, splice junctions, etc.) and how those parts relate to each other.

The Sequence Annotation (SA) study group is evaluating interoperability gaps in:

  • the annotation of biological sequence;
  • the representation of sequence features (i.e. genes, transcripts, exons, etc.);
  • the storage, transformation, and transmission of sequence annotations between annotation providers and users.

The work carried out by the SA study group aims to inform the future development of an extensible, common model for describing specific genomic regions of interest.

 

Jump to...

Benefits

  • Aims to develop a data model for core genomic entities such as gene, transcript, and regulatory region
  • Will allow software developers to extend the model to explore specific needs

Target users

Developers

Community resources

Dive deeper into this product! SA will provide a model for describing genomic features such as transcripts, genes, regulatory elements, and other related regions of interest. Developers can extend the SA model to cover their specific needs with the confidence that the core entities will remain compatible with others using the SA model.


Don't see your name? Get in touch:

  • Evan Christensen
    University of Utah
  • Karen Eilbeck
    University of Utah
  • Robert Freimuth
    Mayo Clinic
  • Kais Ghedira
    Institut Pasteur de Tunis
  • Roderic Guigo
    Centre for Genomic Regulation
  • John Marshall
    Centre for Population Genomics
  • Jean Muller
    Laboratoire de génétique médicale (UMR_S 1112)
  • Francis Ouellette
    McGill University / Université McGill
  • Emilio Righi
    Centre for Genomic Regulation
  • Dmitriy Sonkin
    NIH National Cancer Institute (NCI)
  • David Steinberg
    University of California, Santa Cruz
  • Sean Upchurch
    California Institute of Technology
  • Andy Yates
    EMBL's European Bioinformatics Institute (EBI)
loading...

News, events, and more

Catch up with all news and articles associated with Sequence Annotation (SA).

6 Jan 2025
GKS Work Stream Open House (January 2025)
See more