-
Notifications
You must be signed in to change notification settings - Fork 1
Home
This document describes the semi-supervised peak calling method and the test dataset.
Please install all necessary tools: SPAN Peak Caller and JBR Genome Browser.
The semi-supervised algorithm consists of the following steps:
- Fit unsupervised SPAN 2.0 statistical data model and peaks
- Upload treatment alignment visualization files and peaks files to JBR.
Most likely, peak-calling results are OK, and there is no need for a semi-supervised procedure. - Create manual markup
- Upload the SPAN model directly into the JBR session
- Perform a supervised learning procedure
Required files:
- Monocyte_H3K4me1_hg38_ENCFF396RZF.bam Histone H3K4me1 ChIP-Seq of Human Monocytes
- Monocyte_Control_hg38_ENCFF328STD.bam ChIP-Seq Control of Human Monocytes
- hg38.chrom.sizes UCSC genome file for hg38.
- Monocyte_H3K4me1_hg38_ENCFF396RZF.bw treatment alignment visualization file
All BAM files were aligned to the hg38 human genome reference.
Additional files:
- Monocyte_H3K4me1_hg38_ENCFF396RZF_q0.05_peaks.narrowPeak peaks produced by MACS2
-
Monocyte_H3K4me1_hg38_ENCFF396RZF_broad0.1_peaks.broadPeak
peaks produces by MACS2
--broad
-
Monocyte_H3K4me1_hg38_ENCFF396RZF-W200-G600-islands-summary-FDR0.01.bed
peaks produced by SICER
Peaks obtained by MACS2 and SICER are given for comparison purposes.
Launch SPAN with the following command line.
java -Xmx8G -jar span.jar analyze -t Monocyte_H3K4me1_hg38_ENCFF396RZF.bam -c Monocyte_Control_hg38_ENCFF328STD.bam --chrom.sizes hg38.chrom.sizes --threads 4 --model Monocyte_H3K4me1_hg38_ENCFF396RZF.span --peaks Monocyte_H3K4me1_hg38_ENCFF396RZF.peak
The model file Monocyte_H3K4me1_hg38_ENCFF396RZF.span
may be later used for semi-supervised peak calling
procedure.
-
Launch JBR Genome Browser. If your current session isn't hg38, create a new hg38 session using
File | New Session...
-
Upload treatment and control files: Monocyte_H3K4me1_hg38_ENCFF396RZF.bw and Monocyte_Control_hg38_ENCFF328STD.bw.
Open files as remote URLsFile | Load URL(s)...
menu action or as local file usingFile | Load BED file...
orFile | Load BigWig / Wig / BigBed file...
.
See JBR Genome Browser documentation for more details. -
Alternatively, you can open preconfigured session file example.yaml as remote file
File | Load URLS(s)
or from local fileFile | Open Session
.
Session contain ChIP-seq signal visualization, semi-supervised markup data and all the peak files mentioned behind.
-
Select an appropriate zoom level to make separate peaks distinguishable from background.
Recommended zoom levels:-
5-10
kbp for TFs and narrow histone modifications -
50-100
kbp for broad histone modifications
-
-
Four supervised annotation types are supported:
- peaks : there is at least one peak in the labeled area
- noPeaks : there are no peaks in the labeled area
- peakStart : exactly one peak starts in the labeled area
-
peakEnd : exactly one peak ends in the labeled area
peakStart and peakEnd are not paired and may refer to different peaks.
-
Enable JBR Genome Browser annotations mode using JBR menu action
SPAN | Peaks Annotation Mode
.
Repeat the following procedure:- Press and hold SHIFT+ALT, use mouse drag-n-drop to select genome region, use ESC to clear selection
- Set the annotation type by clicking four label buttons or pressing the N/P/S/E key on the keyboard
-
You can import and export markup as BED files using
Import
andExport
buttons on the annotator panel.
For details on creating annotation, please refer to JBR documentation.
Once you have SPAN model fitted, load it into JBR using menu action File | Load SPAN model...
-
You can perform learning procedure for all SPAN models loaded in JBR at once by using
SPAN | Tune SPAN models...
action from the main menu.
Alternatively, you can work only with a selected SPAN model track(s), chooseTune SPAN model
from context menu. -
Export peaks to BED file To export peaks for all loaded SPAN model tracks, select
SPAN | Export SPAN Peaks...
action from the main menu and choose the target directory.
Alternatively, export peaks for selected track(s) only: chooseExport SPAN model peaks...
from the context menu.
After you have performed the semi-supervised tuning procedure you can see resulting peaks file statistics including
number of peaks, average length, etc. by selecting track(s) and using About Tracks
action from the context menu.
Should you have any further questions, please do not hesitate to contact us anytime.