Skip to content

Latest commit

 

History

History
22 lines (18 loc) · 1.22 KB

README.md

File metadata and controls

22 lines (18 loc) · 1.22 KB

HRT841 Surrogate Variable Analysis

File Descriptions:

  • SVA_Entire_Dataset.Rmd: Surrogate Variable Analysis performed on the entire RNA-seq expression matrix (TPM units).
  • CSS844_SVA_Group_PlottingCleanedMatrices.ipynb: Plotting expression matrices (TPM) that have been 'cleaned' of Surrogate Variable effects (generated from - -mdr_try_sva_v13.Rmd) using PCA, MDS, and tSNE.
  • rPCA.R: Outlier removal using Robust Principle Component Analysis from the entire RNA-seq expression matrix before cleaning.
  • fetch_bioproject.sh: Fetch corresponding BioProject number from an NCBI SRA Accession using Entrez Direct E-Utilities on the command line.

Data & Plots:

Requirements

  1. SVA R package
  2. limma R package
  3. ggplot2 R package
  4. reshape R package
  5. rrcov R package
  6. DESeq2 R package
  7. Tidyverse R package