This repository contains an R script for performing RNA-Seq data analysis, including data preprocessing, differential expression analysis, visualization, and functional enrichment. The script utilizes popular R packages such as edgeR, limma, and org.Mm.eg.db for efficient and comprehensive analysis.
- Reads count data and sample information.
- Filters low-expressed genes and normalizes counts.
- Library size bar plots.
- Multidimensional scaling (MDS) plots.
- Heatmaps of highly variable genes.
- Boxplots of log CPM values.
- Linear modeling using
voomfromlimma. - Generalized linear models (GLM) using
edgeR. - Identification of significant differentially expressed genes (DEGs).
- MD plots for DEGs.
- Heatmaps of top DEGs.
- Venn diagrams for comparison of conditions.
- Gene Ontology (GO) analysis.
- KEGG pathway analysis.
- Top enriched pathways and biological processes.
The following R packages are required to run the script:
To install these packages, run:
install.packages(c("RColorBrewer", "gplots", "tidyverse"))
BiocManager::install(c("edgeR", "limma", "org.Mm.eg.db"))