# Detect alt-splicing (ANOVA)

Alternative splicing results in a single gene coding for multiple protein isoforms, so this task can only be invoked from transcript level data. The algorithm is is based on ANOVA to detect genes with multiple transcripts showing expression changes differently in different biology groups, e.g. a gene has two transcripts: A and B, transcript A is showing up-regulation in the treated group comparing to the control group, while B is showing down regulation in treated group.

* [Alt-splicing ANOVA dialog](#alt-splicing-anova-dialog)
* [ANOVA advanced options](#anova-advanced-options)
* [Alt-splicing ANOVA report](#alt-splicing-anova-report)

## Alt-splicing ANOVA dialog

The alt-splicing dialog is very similar to ANOVA dialog, since the analysis is based on the ANOVA model specified. To setup an ANOVA model, first chose factors from the available sample attributes. The factors can be categorical or numeric attribute(s). Click on a **check box** to select and click **Add factors** button to add a factor to the model (Figure 1).

<div align="left"><figure><img src="https://1384254481-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FJVEESmJAPppJ3ijFq5aR%2Fuploads%2Fgit-blob-5246685e2840e7e9edff7417574367ed76354652%2Falt_splicing%20dialog.png?alt=media" alt=""><figcaption><p>Figure 1. Alt-splicing dialog: selecting factors and alt-splicing factors</p></figcaption></figure></div>

Only one alt-splicing factor needs to be selected from the ANOVA factors. The ANOVA model performed is based on the factors specified in the dialog, while the transcript ID and transcript ID interaction with alt-splicing factor effects are added into the model automatically.

* Transcript ID effect: not all transcripts of a gene are expressed at the same level, so transcript ID is added to the model to account for transcript-to transcript differences.
* Interaction of transcript ID with alt-splicing factor: that effect is used to estimate whether different transcripts have different expression among the levels of the same factor.

Suppose there is an experiment designed to detect transcripts showing differential expression in two tissue groups: liver vs muscle. The alt-splicing ANOVA dialog allows you to specify the ANOVA model that in this analysis is the *Tissue*. The alt-splicing factor is chosen from the ANOVA factor(s), so the alt-splicing factor is also *Tissue* (Figure 1).

The alt-splicing range will limit analysis to genes possessing the number of transcripts in the specified range. Lowering the maximum number of transcripts will increase the speed of analysis.

<div align="left"><figure><img src="https://1384254481-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FJVEESmJAPppJ3ijFq5aR%2Fuploads%2Fgit-blob-ce1d44dea9efeb6c87bbd55e212ffda82090faff%2Falt_splicing%20contrast.png?alt=media" alt=""><figcaption><p>Figure 2. Alt-splicing ANOVA comparisons setup dialog</p></figcaption></figure></div>

Click **Next** to setup the comparisons (Figure 2). The levels (i.e. subgroups) of the alt-splicing factor will be displayed in the left panel; click to select a level name and move it to one of the panels on the right. The fold change calculation on the comparison will use the group in the top panel as the numerator, and the group in the bottom panel as the denominator. Click on **Add comparison** button to add a comparison to the comparisons table.

## ANOVA advanced options

Click on the *Configure* to customize Advanced options (Figure 3).

<div align="left"><figure><img src="https://1384254481-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FJVEESmJAPppJ3ijFq5aR%2Fuploads%2Fgit-blob-9e71e1f5c840619aa11bc37983bb567ae18885b3%2Falt-splicing-adv-dialog.png?alt=media" alt=""><figcaption><p>Figure 3. Configuring advanced options when running alt-splicing ANOVA</p></figcaption></figure></div>

*Low-expression feature* and *Multiple test correction* sections are the same as the matching GSA advanced option, so see [GSA advanced options](https://help.multiomics.illumina.com/partek/partek-flow/user-manual/task-menu/gsa#gsa-advanced-options) discussion.

*Report option*

* *User only reliable estimation results:* There are situations when a model estimation procedure does not fail outright, but still encounters some difficulties. In this case, it can even generate p-value and fold change on the comparisons, but they are not reliable, i.e. they can be misleading. Therefore, the default is set to *Yes*.
* *Data has been log transformed with base*: showing the current scale of the input data on this task.

## Alt-splicing ANOVA report

For this analysis, only genes with more than one transcript will be included in the calculation. The report format is the same as [ANOVA report](https://help.multiomics.illumina.com/partek/partek-flow/user-manual/task-menu/anova-limma-trend-limma-voom#anova-report), each row represent a transcript, and besides statistics information on the specified comparisons, there is also alt-splicing information at the right end of the table. That information is represented by the p-value of interaction of transcript ID with alt-splicing factor. Note that the transcripts of the same gene should have the same p-value. Small p-value indicates significant alt-splicing event, hence the table is sorted based on that p-value by default (Figure 4).

<div align="left"><figure><img src="https://1384254481-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FJVEESmJAPppJ3ijFq5aR%2Fuploads%2Fgit-blob-3a2f03b76a4a9df39d8d42ef3f5af90c17f6a152%2Falt-splicing-report.png?alt=media" alt=""><figcaption><p>Figure 4. Alt-splicing report. Clicking on the column header sorts the table. Panel on the left filters the table</p></figcaption></figure></div>

In the example above (Figure 4), the alt-splicing p-value of gene SLC25A3 is very small which indicates that this gene shows preferential transcript expression across tissues. There are 3 splicing variants of the gene: NM\_213611, NM\_005888 and NM\_002635. Fold change clarifies that NM\_005888 has higher expression in the muscle relative to the liver (negative fold change, liver as the reference category), while NM\_002635 has higher expression in the liver.

To visualize the difference, click on the **Browse to location** icon (Figure 5). The 3rd exon is differentially expressed between NM\_005888 and NM\_002635. Muscle primarily expresses NM\_005888 while liver primarily uses NM\_002635.

<div align="left"><figure><img src="https://1384254481-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FJVEESmJAPppJ3ijFq5aR%2Fuploads%2Fgit-blob-8df7f109dad46cde1d82e54e685e530f90d5469c%2Falt-splicing-chrview.png?alt=media" alt=""><figcaption><p>Figure 5. Isoform proportion track in chromosome view visualizing alternative splicing. Differential usage of exon 3 is highlighted</p></figcaption></figure></div>

## Additional Assistance

If you need additional assistance, please visit [our support page](http://www.partek.com/support) to submit a help ticket or find phone numbers for regional support.
