Skip to the content.

Integrating RNA-seq and ChIP-seq

As discussed previously, cells use a wide range of mechanisms to regulate gene expression. The increase or decrease in production of specific gene products is dependent on sophisticated programs/machinery, which can change in different cellular conditions.

Evaluating the amount of up- and down-regulation of gene, was something that is explored with RNA-seq (and was performed in Baizabal, 2018). In this ChIP-seq session, we discussed the machinery regulating expression, specifically in the context of transcription factors and chromatin regulators. It is very common to run both of these types of experiments and integrate data from each to derive a better understanding at the molecular level. However, there are not alot of sophisticated methods developed to compae the two types of data in a robust manner.

Direct comparisons

Typically in the literature, one will find studies in which the RNA-seq and ChIP-seq were performed independently and then differentially expressed genes were compared with target genes from the ChIP-seq using a simple overlap measure. These overlapping statistics are complemented with a figure focussing on a panel of select genes. For these genes binding evidence is displayed (using bigWig tracks and/or density profile plots) and a plot of expression changes taken from the RNA-seq data next to it.

This was taken from Gao et. al, Nature 2014, where they investigated the AUTS2 protein, risk factor for ASD and other neurological disorders and its link with PRC1, a key epigenetic regulator.

BETA (Binding and Expression Target Analysis)

An another alternative is a tool called BETA from Shirley Liu’s lab at HMS. BETA is a software package that integrates ChIP-seq of transcription factors or chromatin regulators with differential gene expression data to infer direct target genes. This tool stands out from the other nearest gene analyses as it has potential to find target genes for proteins that bind enhancer regions.

BETA has three functions:

  1. to predict whether the factor has activating or repressive function
  2. to infer the factor’s target genes
  3. to identify the motif of the factor and its collaborators which might modulate the factor’s activating or repressive function.

It has three commands depending on what input data you have and what output you desire.

Image source: Nat Protoc. 2013 Dec; 8(12): 2502–2515.

NOTE: BETA is available as a module on O2.

How does it work?