################# ## DATA ## in data/ subdirectory ################# 1] 2 data.frames containing all probe set pairs with Q75 >= 55 for both the Affymetrix and the custom-made CDF. Both contain a 'peCC' column which holds the pairs Pearson correlation coefficient calculated on the Tissue dataset (orig from AtgenExpress data). The 'alSum' column contains the Q75 value of the alignment scores of X's reporters to the transcript of Y. The 'meant2hit' column contains the gene locus the probe set was designed to target according to Affymetrix (extracted from ATH1-121501_annot.csv). AffysTissue.rda =============== data.frame with 42826 rows. > colnames(AffysTissue) [1] "pr" "target" "alSum" "meant2hit" "PSofTarget" [6] "peCC" CustomsTissue.rda ================= data.frame with 40094 rows. > colnames(CustomsTissue) [1] "pr" "target" "alSum" "peCC" 2] Same as above but gene pairs that align to each other with BLAST in at least one direction with an E-value smaller than 10^{-10} are omitted AffysTissue.noBl.rda ==================== data.frame with 11361 rows. CustomsTissue.noBl.rda ====================== data.frame with 10702 rows. 3] Same as 1] but a column containing the metacorrelation coefficients added. Only those probe set pairs are included for which is this value =! NA (which is the case when the alignment scores of X's reporters to Y's transcript are all identical -> no correlation coefficient can be calculated). AffysTissueMC.rda ================= data.frame with 42815 rows. > colnames(AffysTissueMC) [1] "pr" "target" "alSum" "meant2hit" "PSofTarget" [6] "peCC" "Mcor" CustomsTissueMC.rda =================== data.frame with 39424 rows. > colnames(CustomsTissueMC) [1] "pr" "target" "alSum" "peCC" "Mcor" 4] Cross-hybridisation examples. All three are S4 objects. To look at the data, must set class first: setClass("XhybExamples", representation(X = "character", Y = "character", IVX = "numeric", IVY = "numeric", IVXi = "matrix", ai = "numeric")) The slots of these objects contain: X : the name of probe set X Y : the name of probe set Y IVX : the expression intensities of X in the Tissue dataset IVY : the expression intensities of Y in the Tissue dataset IVXi : the expression intensities of X's reporters in the Tissue dataset ai : the alignment scores of X's reporters to Y's transcript sequence ex1.rda ======= X: "245875_at" Y: "AT3G28550" ex2.rda ======= X: "250857_at" Y: "AT1G75180" ex3.rda ======= X: "258508_at" Y: "AT3G06650" #################### ## SCRIPTS ## in R/ subdirectory #################### 1] An S4 method to plot S4 objects of 'XhybExamples' objects. This generates a window with 3 plots: - The left one plots the summarised expression patterns of X and Y. - The middle plot shows the background corrected normalised intensity values of X's reporters. The colours used correspond to the alignment strength (ai) of the reporter to Y's transcript sequence. - The plot on the right plots the Pearson correlation coefficient of the reporters' expression patterns to Y's expression pattern in function of their alignment score, ai. plotExample.R ============= 2] 'runSimulation' is a function that will run the simulation we ran for our study. By modifying the parameters, users can play with it themselves. runSimulation.R ===============