Introduction to Peak Analysis
Learning Objectives
- Describe peak data and different file formats generated from peak calling algorithms
- Assess various metrics used to assess the quality of peak calls
- Compare peak calls across samples within a dataset
- Create visualizations to evaluate peak annotations
- Evaluate differentially enriched regions between two sample groups
Installations
On your desktop
On your HPCC (if not using Harvard’s O2 cluster)
Required
Nextflow
version 24.11.0-edge
Alternative to Nextflow
samtools
version 1.15.1bedtools
version 2.30.0Picard
version 2.27.5phantompeakqualtools
version 1.2.2deepTools
version 3.5.6bedGraphToBigWig
version 302.1
NOTE: If you are not working on the O2 cluster and are using different versions of these software programs, these packages may still work with the provided commands. However, this workshop was designed on these versions specifically, so you may need to tweak some of the commands if you use different versions of this software.
Lessons
- Workflow overview: From sequenced reads to peaks
- Existing workflows for ChIP-seq analysis
- Understanding peaks and peak file formats
- Assessing peak quality metrics
- Assessing sample similarity and identifying potential outliers
- Concordance across replicates using peak overlaps
- Peak annotation and visualization using ChIPseeker
- Differential enrichment analysis using DiffBind
- Peak visualization using IGV
- Annotation and functional analysis of DE regions
- Motif analysis/discovery
NOTE: If you aren’t working on Harvard’s O2 cluster the directory structure for the HPCC that you are using is likely different and you will need to modify paths to work within your HPCC’s directory structure.
Answer key
These materials have been developed by members of the teaching team at the Harvard Chan Bioinformatics Core (HBC). These are open access materials distributed under the terms of the Creative Commons Attribution license (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.