Skip to content

Commit 65e1bcd

Browse files
committed
Some explanation of RLE and NUSE
1 parent 713650a commit 65e1bcd

File tree

4 files changed

+187
-447
lines changed

4 files changed

+187
-447
lines changed

affymetrix.Rmd

+10-1
Original file line numberDiff line numberDiff line change
@@ -166,7 +166,7 @@ mva.pairs(pm(raw)[,5:8],plot.method="smoothScatter")
166166

167167

168168
******
169-
### Based on all the plots you have generated, what would you conclude about the overall quality of this experiment?
169+
Based on all the plots you have generated, what would you conclude about the overall quality of this experiment?
170170
******
171171

172172

@@ -180,6 +180,15 @@ an experiment in a robust way.
180180
The output is a matrix of residuals, or weights for each chip which can be used as an additional diagnostic; systematically high residuals across an entire array, or a large fraction of an array is indicative of an outlier array. The main use of this tool is in deciding whether or
181181
not to keep an array in the down-stream data analysis.
182182

183+
### Relative Log Expression (RLE)
184+
185+
The Relative Log Expression (RLE) values are computed by calculating for each probe-set the ratio between the expression of a probe-set and the median expression of this probe-set across all arrays of the experiment. It is assumed that most probe-sets are not changed across the arrays, so it is expected that these ratios are around 0 on a log scale. The boxplots presenting the distribution of these log-ratios should then be centered near 0 and have similar spread. Other behavior would be a sign of low quality.
186+
187+
188+
### Normalized Unscaled Standard Error (NUSE)
189+
190+
The Normalized Unscaled Standard Error (NUSE) is the individual probe error fitting the Probe-Level Model (the PLM models expression measures using a M-estimator robust regression). The NUSE values are standardized at the probe-set level across the arrays: median values for each probe-set are set to 1. The boxplots allow checking (1) if all distributions are centered near 1 (typically an array with a boxplot centered around 1.1 shows bad quality) and (2) if one array has globally higher spread of NUSE distribution than others, which may also be a sign of low quality.
191+
183192
```{r}
184193
library(affyPLM)
185194
plmset <- fitPLM(raw)

affymetrix.nb.html

+111-102
Large diffs are not rendered by default.

public-data.Rmd

+3-1
Original file line numberDiff line numberDiff line change
@@ -79,7 +79,7 @@ mydata[[1]]
7979
remotefile <- 'ftp://ftp.ncbi.nlm.nih.gov/geo/series/GSE1nnn/GSE1729/matrix/GSE1729_series_matrix.txt.gz'
8080
destfile <- "data/GSE1729_series_matrix.txt.gz"
8181
if(!file.exists(destfile)) download.file(remotefile, destfile)
82-
mydata <- getGEO(filename="data/GSE1729_series_matrix.txt.gz")
82+
mydata<- getGEO(filename="data/GSE1729_series_matrix.txt.gz")
8383
mydata
8484
```
8585

@@ -94,6 +94,8 @@ The structure of the `mydata` object should be quite familiar to you. The expres
9494
```{r}
9595
exprs(mydata)[1:5,1:5]
9696
summary(exprs(mydata)[,1:5])
97+
mva.pairs(exprs(mydata)[,1:4],log.it = TRUE)
98+
pData(mydata)
9799
```
98100

99101
We can visualise the expression values for a particular gene by knowing which row of the expression matrix it is found in.

public-data.nb.html

+63-343
Large diffs are not rendered by default.

0 commit comments

Comments
 (0)