Attract: Attract Package

(Mar JC, Wells CA, Quackenbush J. Defining an informativeness metric for clustering gene expression data. Bioinformatics. 2011 Apr 15;27(8):1094-100.J. PMID: 21330289.)

What is Attract?

The Attract algorithm identifies the core gene expression modules that are differentially activated between cell types or different sample groups, and elucidates the set of expression profiles which describe the range of transcriptional behavior within each module.

The algorithm assists in understanding expression changes that are specific to cell types or cell fate transitions. A mammalian organism is made up of over 200 types of specialized cells. Each cell type carries out a specific task integral to maintaining homeostasis of the organism. Cell types can vary by morphology, structure, lifespan, functional ability and much more. Different cell or tissue types acquire their diversity by driving differentially coordinated expression patterns through interacting gene networks. The attractor hypothesis proposed by Stuart Kauffman describes how cell fate transitions between cell types occur through coordinated changes in genome-wide gene expression.

The Attract algorithm starts by using downloaded bioconductor chip annotation to identify the set of core pathway modules driving the cell types-specific expression changes. For each of these modules, the algorithm then constructs synexpression groups, which consist of genes with correlated expression profiles. The algorithm is also extrapolates out to identify genes with similar expression profiles which may not be currently annotated in current pathway databases like KEGG.

When should Attract be used

The Attract algorithm can be used for any expression data set where replicate samples (at least three) have been collected for various cell types or experimental groups such as a panel of drug treatments or a time course experiment. The algorithm is not well-suited for data generated under an experimental design that has only one experimental or covariate group, e.g. samples from 100 identical cell lines with no perturbations.

Brief Description of the algorithm

The Attract algorithm can be roughly divided in to four steps:

    1. Find core attractor state pathway modules.
    2. Remove genes that do not show expression changes across the different groups using the R Package LIMMA
    3. Find Synexpression groups. (A synexpression group contains genes that share similar expression profiles across groups)
    -- Use the informativeness metric to compute clusters (synexpression groups) in each geneset.
    4. For every synexpression group in a geneset, find genes in the expression data which have correlated expression profiles.

Figure 1: The initialization dialog
Running the Attract Module

Choose the type of analysis you wish to run. Attract supports two-class analysis for comparison of two groups of samples, or multi-class analysis, for comparison of three or more groups.

Choose an alpha cutoff level. This number is a false discovery rate used to remove flat or uninformative genes.

From the drop-down menu, select the name of the microarray platform your data was collected from. This step is very important.

Click Continue to display the Cluster Selector.

If you have previously created clusters of samples using the Cluster Manager or another MeV module, they will be displayed here, as in Figure 2. Use the drop-down menus on the right-hand side of the Cluster Selector to assign your sample groups to analysis groups for Attract. You can assign more than one sample group to an analysis group. You can also use the “Button Selection” tab shown in Figure 3 to achieve the same. Group1 and Group2 symbolize B-ALL and T-ALL. You can save these grouping using the “Save settings” button. To load saved groupings, use the “Load settings” button. Reset button will clear all your choices. Click the Ok button to run Attract.


Figure 2: The Attract Dialog with cluster selector
Attract Results
The results of the Attract module are found in the Result Tree, on the left-hand side of the Multiple Array Viewer window. The Results Table contains a list ....

Figure 3: Attract Results