isoformSwitchAnalysisPart2 {IsoformSwitchAnalyzeR} | R Documentation |
This high-level function adds the results of the extrenal sequence analysis supplied (if any), then proceeds to annotate intron ration. Then functional consequences of the isoform switches are identified and isoform switch analysis plots are created for the top n
isoform switches. Lastly a plot summarizing the functional consequences is created. This function is meant to be used after isoformSwitchAnalysisPart1 have been used.
isoformSwitchAnalysisPart2( switchAnalyzeRlist, alpha=0.05, dIFcutoff = 0.1, n=NA, codingCutoff, removeNoncodinORFs, pathToCPATresultFile=NULL, pathToPFAMresultFile=NULL, pathToSignalPresultFile=NULL, consequencesToAnalyze=c('intron_retention','coding_potential','ORF_seq_similarity','NMD_status','domains_identified','signal_peptide_identified'), pathToOutput=getwd(), fileType='pdf', asFractionTotal=FALSE, outputPlots=TRUE, quiet=FALSE )
switchAnalyzeRlist |
The |
alpha |
The cutoff which the (callibrated) fdr correct p-values must be smaller than for calling significant switches. Defualit is 0.05. |
dIFcutoff |
The cutoff which the changes in (absolute) isoform usage must be larger than before an isoform is considered switching. This cutoff can remove cases where isoforms with (very) low dIF values are deemed signicant and thereby included in the downstream analysis. This cutoff is analogours to having a cutoff on log2 fold change in a normal differential expression analysis of genes to ensure the genes have a certain effect size. Default is 0.1 (10%). |
n |
The number of top genes (after filtering and sorted according to |
codingCutoff |
Numeric indicating the cutoff used by CPAT for distinguishing between coding and non-coding transcripts. The cutoff is dependent on species analyzed. Our analysis suggest that the optimal cutoff for overlapping coding and noncoding isoforms are 0.725 for human and 0.721 for mouse - HOWEVER the suggested cutoffs from the CPAT article (see references) derived by comparing known genes to random non-coding regions of the genome is 0.364 for human and 0.44 for mouse. |
removeNoncodinORFs |
A logic indicating wether to remove ORF information from the isoforms which the CPAT analysis classifies as non-coding. This can be particular useful if the isoform (and ORF) was predicted de-novo but is not recommended if ORFs was imported from a GTF file. This will affect all downstream analysis and plots as both analysis of domains and signal peptides requires that ORFs are annotated (e.g. analyzeSwitchConsequences will not consider the domains (potentially) found by Pfam if the ORF have been removed). |
pathToCPATresultFile |
Path to the CPAT result file. If the webserver is used please dowload the tab-delimited file from the bottom of the result page and give that as input, else simply supply the result file. |
pathToPFAMresultFile |
Path to the PFAM result file. If the webserver is used you need to copy paste the result part of the mail you get into a empty plain text document (notepad, sublimetext TextEdit or similar (aka not word)) and save that to a file. This file is then used as input to the function. |
pathToSignalPresultFile |
Path to the summary SignalP result file. If using the web-server the results should be copy pasted into a empty plain text document (notepad, sublimetext TextEdit or similar (aka not word)) and save that to a file. This file is then used as input to the function. |
consequencesToAnalyze |
A vector of strings indicating what type of functional consequences to analyze. Do note that there is bound to be some differences beteen transcripts (else there would be identical). See details in analyzeSwitchConsequences for full list of usable strings and their meaning. Default is c('intron_retention','coding_potential','ORF_seq_similarity','NMD_status','domains_identified','signal_peptide_identified') (corresponding to analyze: intron retention, CPAT result, ORF AA sequence similarity, NMD status, PFAM domains annotated and signal peptides annotated by Pfam). |
pathToOutput |
A path to the folder in which the plots should be made. Default is working directory ( getwd() ). |
fileType |
A string indicating which file type is generated. Available options are \'pdf\' and \'png\'. Default is pdf. |
asFractionTotal |
A logic indicating whether the number of consequences should be calculated as numbers (if FALSE) or as a fraction of the total number of switches in the plot summarizing general consequences of all the isoform swithces. Default is FALSE. |
outputPlots |
A logic indicating whether all isoform switches as well as the summary of functional consequences should be ouputted in the directory specified by |
quiet |
A logic indicating whether to avoid printing progress messages (incl. progress bar). Default is FALSE |
This function performs the second part of a Isoform Analysis Workflow by:
Adding external sequence analysis (see analyzeCPAT, analyzePFAM and analyzeSignalP)
Predict functional consequences of switching (see analyzeSwitchConsequences)
Output Isoform Switch Consequence plots for all genes where there is a significant isoform switch (see switchPlot)
Output a visualization of general consequences of isoform switches.
This function ouputs
The supplied switchAnalyzeRlist
now annotated with all the analysis described above
One folder per comparison of condition containing the isoform switch analysis plot of all genes with significant isoforms switches
A plot summarizing the overall consequences off all the isoform switchces.
Kristoffer Vitting-Seerup
Vitting-Seerup et al. The Landscape of Isoform Switches in Human Cancers. Mol. Cancer Res. (2017).
analyzeCPAT
analyzePFAM
analyzeSignalP
analyzeAlternativeSplicing
extractSwitchSummary
analyzeSwitchConsequences
switchPlotTopSwitches
data("exampleSwitchListIntermediary") exampleSwitchListAnalyzed <- isoformSwitchAnalysisPart2( switchAnalyzeRlist = exampleSwitchListIntermediary, dIFcutoff = 0.4, # Set high for short runtime in example data pathToCPATresultFile = system.file("extdata/cpat_results.txt", package = "IsoformSwitchAnalyzeR"), pathToPFAMresultFile = system.file("extdata/pfam_results.txt", package = "IsoformSwitchAnalyzeR"), pathToSignalPresultFile = system.file("extdata/signalP_results.txt", package = "IsoformSwitchAnalyzeR"), codingCutoff = 0.364, removeNoncodinORFs = TRUE, # Because ORF was predicted de novo outputPlots = FALSE # keeps the function from outputting the plots from this example )