Single-cell RNA-seq data analysis workshop
Audience | Computational skills required | Duration |
---|---|---|
Biologists | Introduction to R | 3-session online workshop (~7.5 hours of trainer-led time) |
Description
This repository has teaching materials for a hands-on Introduction to single-cell RNA-seq analysis workshop. This workshop will instruct participants on how to design a single-cell RNA-seq experiment, and how to efficiently manage and analyze the data starting from count matrices. This will be a hands-on workshop in which we will focus on using the Seurat package using R/RStudio. Working knowledge of R is required or completion of the Introduction to R workshop.
Note for Trainers: Please note that the schedule linked below assumes that learners will spend between 3-4 hours on reading through, and completing exercises from selected lessons between classes. The online component of the workshop focuses on more exercises and discussion/Q & A.
These materials were developed for a trainer-led workshop, but are also amenable to self-guided learning.
Learning Objectives
- Describe best practices for designing a single-cell RNA-seq experiment
- Describe steps in a single-cell RNA-seq analysis workflow
- Use Seurat and associated tools to perform analysis of single-cell expression data, including data filtering, QC, integration, clustering, and marker identification
- Understand practical considerations for performing scRNA-seq, rather than in-depth exploration of algorithm theory
Lessons
Installation Requirements
Applications
Download the most recent versions of R and RStudio for your laptop:
Packages for R
Note 1: Install the packages in the order listed below.
Note 2: All the package names listed below are case sensitive!
Note 3: If you have a Mac with an M1 chip, download and install this tool before intalling your packages: https://mac.r-project.org/tools/gfortran-12.2-universal.pkg
Note 4: At any point (especially if you’ve used R/Bioconductor in the past), in the console R may ask you if you want to update any old packages by asking Update all/some/none? [a/s/n]:. If you see this, type “a” at the prompt and hit Enter to update any old packages. Updating packages can sometimes take quite a bit of time to run, so please account for that before you start with these installations.
Note 5: If you see a message in your console along the lines of “binary version available but the source version is later”, followed by a question, “Do you want to install from sources the package which needs compilation? y/n”, type n for no, and hit enter.
(1) Install the 4 packages listed below from Bioconductor using the the BiocManager::install()
function.
AnnotationHub
ensembldb
multtest
glmGamPoi
Please install them one-by-one as follows:
BiocManager::install("AnnotationHub")
BiocManager::install("ensembldb")
& so on ...
(2) Install the 8 packages listed below from CRAN using the install.packages()
function.
tidyverse
Matrix
RCurl
scales
cowplot
BiocManager
Seurat
metap
Please install them one-by-one as follows:
install.packages("tidyverse")
install.packages("Matrix")
install.packages("RCurl")
& so on ...
(3) Finally, please check that all the packages were installed successfully by loading them one at a time using the library()
function.
library(Seurat)
library(tidyverse)
library(Matrix)
library(RCurl)
library(scales)
library(cowplot)
library(metap)
library(AnnotationHub)
library(ensembldb)
library(multtest)
library(glmGamPoi)
(4) Once all packages have been loaded, run sessionInfo().
sessionInfo()
Citation
To cite material from this course in your publications, please use:
Mary Piper, Meeta Mistry, Jihe Liu, William Gammerdinger, & Radhika Khetani. (2022, January 6). hbctraining/scRNA-seq_online: scRNA-seq Lessons from HCBC (first release). Zenodo. https://doi.org/10.5281/zenodo.5826256.
A lot of time and effort went into the preparation of these materials. Citations help us understand the needs of the community, gain recognition for our work, and attract further funding to support our teaching activities. Thank you for citing this material if it helped you in your data analysis.
These materials have been developed by members of the teaching team at the Harvard Chan Bioinformatics Core (HBC) RRID:SCR_025373. These are open access materials distributed under the terms of the Creative Commons Attribution license (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.