Gene Set Co-Expression Analysis (GSCA)

The GSCA package can be downloaded here, which contains the R codes for running GSCA and an example data set used in Choi and Kendziorski (2009). Codes and the data can also be loaded from this R image file. Code for a brief example of a DC (Differential Co-expression) analysis with only 3 permutations is given here:

load("GSCA.RData") # if Windows, double-click the "GSCA.RData" icon.
dc.M <- singleDC(data = LungCancer3$data$Michigan, group = c(86, 10),
GSdefList = LungCancer3$info$GSdef, nperm = 3)
str(dc.M)

Once gene sets are identified as significantly DC between two groups, it is naturally of interest to visually display two condition-specific networks to contrast. The plot below is a normal-specific network from the Michigan lung cancer study, followed by the R code used to draw such a plot. Each edge is colored so it corresponds to the gene-gene pairwise Pearson correlation, with correlation values ranging from -1 to 1 are shown in blue to red. Yellow nodes are shown to indicate hypothetical DE genes in this example.

library(gplots) # for bluered()
GS <- LungCancer3$info$GSdef
GSdesc <- LungCancer3$info$Name
setid <- "GO:0019216"
gid <- GS[[setid]]
ss <- c("SERPINA3", "SOD1", "SCAP", "NPC2", "ADIPOQ",
        "PRKAA1", "AGT", "PPARA", "BMP6", "BRCA1")
plotNW(genes = ss,
       cormatrix = cor(t(LungCancer3$data$Michigan[gid, 87:96])),
       node.de = c(rep(1, 5), rep(0, 5)), ncolor = "yellow",
       ecolor = bluered(201), ewidth = 5, gtype = "circo")

More detailed examples are available in the package vignette; further information is available in the manual.

Last Modified June 2009