Gene Set Variation Analysis


Gene Set Variation Analysis (GSVA) is a non-parametric, unsupervised method which estimates the relative enrichment of a gene set of interest across a sample population. Hence, it allows us to observe the variation in the activity of a set of genes, such us a pathway or a gene signature, corresponding to a particular biological condition, within an entire gene expression set. It produces a value, termed enrichment score (ES), per sample and gene set, which can be examined for associations with clinical features of interest.


BMC Bioinformatics. 2013 Jan 16;14:7. doi: 10.1186/1471-2105-14-7.
GSVA: gene set variation analysis for microarray and RNA-seq data.
Hanzelmann S, Castelo R, Guinney J

This implementation of GSVA, contains two preloaded, publicly available gene expression datasets, which the user can select through the 'Choose a dataset' drop down menu on the top left. The link, below the drop down menu opens the Gene Expression Omnibus (GEO) page, with detailed desciption of the data set in question.

In addition the application holds a collection of 103 signatures, related to immune cells, immunological responses and conditions. The user can choose a signature of interest and explore ES scores distribution, through the "Select Signature/variable to plot" drop down menu, on the right of the BoxPlot tab. The "select cohort column" drop down, allows the choice of the condition of interest. At the Plot tab, the user can visualize the correlation between signature ES scores and/or numerical variables.

Importantly, users can provide their own signature typing in or copying a set of gene symbols, separated by spaces, in the Input gene list box. In the current GSVA implementation, the application only works with gene symbols.


In conclussion:

  • The BoxPlot tab allows visual exploration of ES scores, per user selected condition and provides the user with statistical information, for between group comparisons.
  • The Plot tab allows the user to visualise the correlation between enrichment scores of signatures, as well as correlation with available numerical clinical variables.
  • The table tab allows the user to download the enrichment score table for further analysis.
  • The Signatures tab provides the list of signatures used here, along with additional information and links to the publication, from where each signature was acquired.