Skip to the content.

Understanding chromatin biology using high throughput sequencing workshop

Learning Objectives

Installations

All:

Mac users:

Windows users:

Notes

Instructions for Harvard researchers with access to HMS-RC’s O2 cluster

To run through the code in the lessons below, you will need to be logged into O2 and working on a compute node (i.e. your command prompt should have the word compute in it).

  1. Log in using ssh ecommonsID@o2.hms.harvard.edu and enter your password.
  2. Once you are on the login node, use srun --pty -p interactive -t 0-2:30 --mem 1G /bin/bash to get on a compute node or as specified in the lesson.
  3. Proceed only once your command prompt has the word compute in it.
  4. If you log out between lessons (using the exit command twice), please follow points 1. and 2. above to log back in and get on a compute node when you restart with the self learning.

Lessons

Part 1

  1. Shell basics review
  2. Working in an HPC environment - Review
  3. Best Practices in Research Data Management (RDM)
  4. Dataset overview and Project organization)

Part II

  1. A review of high-throughput sequencing methods for understanding chromatin biology
  2. Experimental design considerations for HTS of chromatin
  3. Quality Control of Sequence Data: Running FASTQC and evaluating results
  4. Alignment using Bowtie2

Part III

  1. Filtering BAM files
  2. Peak calling
  3. Handling peak files using bedtools

Part IV

  1. File formats for peak visualization
  2. Qualitative assessment of peak enrichment using deepTools
  3. Troubleshooting your ChIP-seq analysis

Part V

  1. Automating the ChIP-seq workflow

Answer Keys


Building on this workshop


Resources


These materials have been developed by members of the teaching team at the Harvard Chan Bioinformatics Core (HBC). These are open access materials distributed under the terms of the Creative Commons Attribution license (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.