Variant Analysis Workshop
Audience | Computational skills required | Duration |
---|---|---|
Biologists | Introduction to the Command-line | 3 - 2.5 hour sessions |
Description
This Introduction to Variant Analysis workshop is aimed at providing best practices for calling variants for paired normal/tumor datasets. Importantly, while this workshop series does focus on calling variants in the context of paired tumor/normal samples, much of this workshop’s pipeline and discussion is adaptable to other types of variant calling applications. This workshop series should provide participants with an ability to take raw sequence reads and process them into a VCF file with annotated variants. Furthermore, the workshop series ends with a tutorial for visualizing called variants within the Integrative Genomics Viewer (IGV).
These materials were developed for a trainer-led workshop, but are also amenable to self-guided learning.
Pre-Requisite
This workshop series is designed for users with a background in the fundamentals of working in the shell environment on an high-performance computing cluster (HPCC). If you don’t have this background or are unsure if you are ready for this workshop series, you should first complete our Introduction to the Command Line workshop series.
Learning Objectives
- Evaluate QC metrics for variant calling
- Call variants using GATK
- Filter variants to retain only high-quality variant calls
- Annotate variants using SnpEff and dbSNP
- Prioritize variants by their impact
- Visualize variants in IGV
- Implement cBioPortal to analyze variants
Lessons
Installation Requirements
All:
- FileZilla Client (make sure you get ‘FileZilla Client’)
- Integrative Genomics Viewer (IGV)
Mac users: No additional installation requirements.
Windows users: GitBash
These materials have been developed by members of the teaching team at the Harvard Chan Bioinformatics Core (HBC). These are open access materials distributed under the terms of the Creative Commons Attribution license (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.