Data-projects-with-R-and-GitHub

Introduction into the topic

Salvador et al. (2023) published a research article on age-dependent immune and lymphatic responses following spinal cord injury to better understand the lifelong disabling effects that can result from such severe injuries Publication.

The study investigates how the immune and lymphatic systems respond to spinal cord injuries in young and old mice. The researchers used a technique called single-cell RNA sequencing to study the activity of individual cells by measuring which genes are turned on in each cell (scRNA-seq).

Data

The data is publicly available on the Gene Expression Omnibus (GEO) under the accession number GSE205038. The pre-analyzed data set mmc4.xlsx is already provided on GitHub Data file. To give you an idea how the data looks like:

gene logFC p_adjust ID Description GeneRatio BgRatio pvalue p_adjust_2
Lars2 1.705496 0 GO:0045766 positive regulation of angiogenesis 11/109 188/23328 0 3.2e-06

Data manipulation

Now let’s take a look at the DEGs:

Now, we will focus on the GO terms to get on rough idea what biological processes, but not individual genes, are upregulated.

Data visualization

The data wrangling and the volcano plot will certainly take up a lot of time. That’s why the dot plot is more for when you still have time to spare.

I’m looking forward to your solutions! Good luck!