Code for CLIP data analysis for the paper Control of immediate early gene expression by CPEB4-repressor complex-mediated mRNA degradation by Poetz, Lebedeva et al., Genome Biology, 2022
First, look into config.yml. It should work with default values. However if you have, for example, the hg19 genome you can change the path to it to shorten the runtime and to avoid downloading it again. You can also specify custom paths to the plots and results folders. Default paths are relative to the directory where manuscript_figures.R is.
Next, get the R container and install relevant packages (if you do not have an R installation already).
Presuming that you have no root access and are on linux:
- Install miniconda3: follow the instructions for your system.
You will need to download and run a bash installation script.
bash Miniconda3-latest-Linux-x86_64.sh
- Install singularity in a conda environment and activate it
eval "$($HOME/miniconda3/bin/conda shell.bash hook)"
conda create --name singularity singularity=3.7.1
conda activate singularity
Next steps are done within singularity conda environment:
- Pull the latest R bioconductor container
singularity pull bioc.sif docker://bioconductor/bioconductor_docker:devel
- Install the necessary packages
singularity exec bioc.sif Rscript install_packages.R
- Run the script to produce figures
(It requires memory and time to run.)
singularity exec bioc.sif Rscript manuscript_figures.R
Contains the data encessary to run main R scripts, including bam and bed files of CPEB4 CLIP that result from the mapping pipeline and omniCLIP.
Auxiliary R scripts (they should be called from within manuscript_figures.R).
R code to to create figures: manuscript_figures.R.
The script should produce a "plots" folder with pdf images of raw figure panels as well as "results" folder with the R objects and "source_data" subfolder with the csv files that were used for plots.
Contains the pipelines used for mapping 1) p32 and 2) ir CPEB4 CLIP data as well as conda environment file with software version specifications.
Contains the scripts used to run omniCLIP on CPEB4 data as well as multiple published HeLa CLIP data. The omniCLIP version used for this, as well as scripts to generate the database are in this omniCLIP fork. The general pipeline to map any CLIP data is in this repository.
Detailed wet lab protocol for infrared PAR-CLIP can be found here.