Collection of scripts / notebooks to reliably select and curate datasets
data
: Some definition data used to query other datasets. These files are mostly static because they don't change often, but they are versioned just in case.human-kinases
: Aggregates different sources that list the complete human kinome and provides a curated CSV file with Uniprot IDs and their origin.kinases-in-chembl
: Maps human kinases UniProt IDs (fromhuman-kinases
) to ChEMBL target IDs.chembl-25
: Retrieve activities (IC50, Ki, Kd) of small compounds against human kinases in ChEMBL v25 (requires local SQLite copy).chembl-26
: Retrieve activities (IC50, Ki, Kd) of small compounds against human kinases in ChEMBL v26 (requires local SQLite copy).chembl-27
: Retrieve activities (IC50, Ki, Kd) of small compounds against human kinases in ChEMBL v27 (requires local SQLite copy).
Create a conda
environment using the included environment file:
conda env create -f devtools/environment.yml
conda activate openkinome-datascripts