Skip to content

Commit

Permalink
May2024 - new files for the IQB workshop (#32)
Browse files Browse the repository at this point in the history
* eliminated one variable in the final cell

* added an exercise for the workshop and homework

* starting a notebook on small molecules

* first draft of the search for trypsin ligands

* second image for the EC_class_ligands_search notebook

* first image for the EC_class_ligands_search notebook

* latest update with Jessica's suggested changes

* a ligand file for the EC_Class notebook

* images for the new notebooks

* first attempt at ligand modification

* optimize in 3D and print 3D sdf files

* ligand 13U and 3 modified ligands in 3D

* Exploring AllChem MMFF

* changed the ligand to be used in the rdkit notebook

* fully functional notebook with glucose and ATP test cases

* restarted notebooks and cleared all outputs

* restarted and cleared all output

* Adds notebook to TOC, adds missing image

---------

Co-authored-by: Jessica Nash <[email protected]>
  • Loading branch information
paulcraig and janash committed Jun 10, 2024
1 parent 4e98a01 commit 4f7f1d4
Show file tree
Hide file tree
Showing 15 changed files with 2,876 additions and 0 deletions.
1 change: 1 addition & 0 deletions biochemist-python/_toc.yml
Original file line number Diff line number Diff line change
Expand Up @@ -13,3 +13,4 @@ chapters:
- file: chapters/biopython_mmcif
- file: chapters/rcsb_api
- file: chapters/MolVis_with_iCN3D
- file: chapters/EC_class_ligands_search
357 changes: 357 additions & 0 deletions biochemist-python/chapters/EC_class_ligands_search.ipynb

Large diffs are not rendered by default.

1,663 changes: 1,663 additions & 0 deletions biochemist-python/chapters/Modifying_Ligands_with_rdkit.ipynb

Large diffs are not rendered by default.

70 changes: 70 additions & 0 deletions biochemist-python/chapters/SmallMolVis.ipynb
Original file line number Diff line number Diff line change
@@ -0,0 +1,70 @@
{
"cells": [
{
"cell_type": "markdown",
"id": "739f4a0d-a082-417a-a983-225798383b06",
"metadata": {},
"source": [
"# Viewing Small Molecules in iCN3D\n",
"\n",
"While most people use iCN3D to view proteins and other macromolecular structures, it can also be used to explore small molecules of interest. The simplest way to do this is to find the small molecule on [PubChem](https://pubchem.ncbi.nlm.nih.gov/), capture its unique identifier and modify the iCN3D code we have used so far, replacing `mmdbid` or `pdbid` with `cid`.\n",
"\n",
"Here is the [PubChem Search Page for ATP](https://pubchem.ncbi.nlm.nih.gov/) page which contains 231 compounds. \n",
"![ATP Search Results](images/PubChem_ATP_231.png \"ATP Search Results\")\n",
"\n",
"Explore the page to find the line that says, **Compound CID: 644208**. The number **644208** is the critical piece of information. If PubChem contains a 3D version of this molecule, you can view it in iCN3D.\n"
]
},
{
"cell_type": "code",
"execution_count": 1,
"id": "ef33b2c7-1613-430f-8b75-b954186115c6",
"metadata": {},
"outputs": [
{
"data": {
"text/plain": [
"'/Users/pac8612/Desktop/python-scripting-biochemistry/biochemist-python/chapters'"
]
},
"execution_count": 1,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"import os\n",
"os.getcwd()"
]
},
{
"cell_type": "code",
"execution_count": null,
"id": "0f72f8f2-d478-4c46-a3cc-0a1508d310f1",
"metadata": {},
"outputs": [],
"source": []
}
],
"metadata": {
"kernelspec": {
"display_name": "Python 3",
"language": "python",
"name": "python3"
},
"language_info": {
"codemirror_mode": {
"name": "ipython",
"version": 3
},
"file_extension": ".py",
"mimetype": "text/x-python",
"name": "python",
"nbconvert_exporter": "python",
"pygments_lexer": "ipython3",
"version": "3.8.8"
}
},
"nbformat": 4,
"nbformat_minor": 5
}
138 changes: 138 additions & 0 deletions biochemist-python/chapters/data/ligand13U-3D.sdf
Original file line number Diff line number Diff line change
@@ -0,0 +1,138 @@

RDKit 3D

65 67 0 0 0 0 0 0 0 0999 V2000
6.9596 1.4301 1.2036 C 0 0 0 0 0 0 0 0 0 0 0 0
6.4999 -0.0268 1.1959 C 0 0 0 0 0 0 0 0 0 0 0 0
6.1183 -0.5709 -0.1846 C 0 0 0 0 0 0 0 0 0 0 0 0
4.6568 -0.3674 -0.5905 C 0 0 0 0 0 0 0 0 0 0 0 0
4.3589 0.9126 -1.3992 C 0 0 0 0 0 0 0 0 0 0 0 0
4.7820 2.2540 -0.7672 C 0 0 0 0 0 0 0 0 0 0 0 0
4.5762 2.3749 0.7433 C 0 0 0 0 0 0 0 0 0 0 0 0
5.8777 2.4575 1.5392 C 0 0 0 0 0 0 0 0 0 0 0 0
2.9599 0.8866 -1.8622 N 0 0 0 0 0 0 0 0 0 0 0 0
1.9383 1.1995 -0.8522 C 0 0 0 0 0 0 0 0 0 0 0 0
0.5528 1.0066 -1.4871 C 0 0 0 0 0 0 0 0 0 0 0 0
0.0857 1.8523 -2.2550 O 0 0 0 0 0 0 0 0 0 0 0 0
-0.1239 -0.1567 -1.1819 N 0 0 0 0 0 0 0 0 0 0 0 0
0.4718 -1.3715 -0.6557 C 0 0 0 0 0 0 0 0 0 0 0 0
-0.3511 -2.4843 -1.2853 C 0 0 0 0 0 0 0 0 0 0 0 0
-1.7402 -1.8688 -1.3802 C 0 0 0 0 0 0 0 0 0 0 0 0
-1.4596 -0.4001 -1.7207 C 0 0 1 0 0 0 0 0 0 0 0 0
-2.5551 0.4934 -1.1063 C 0 0 0 0 0 0 0 0 0 0 0 0
-2.3764 1.2550 -0.1614 O 0 0 0 0 0 0 0 0 0 0 0 0
-3.7940 0.3245 -1.7045 N 0 0 0 0 0 0 0 0 0 0 0 0
-4.9755 0.9762 -1.1877 C 0 0 0 0 0 0 0 0 0 0 0 0
-5.5746 0.1875 -0.0503 C 0 0 0 0 0 0 0 0 0 0 0 0
-5.3690 0.6025 1.2748 C 0 0 0 0 0 0 0 0 0 0 0 0
-5.8718 -0.1457 2.3397 C 0 0 0 0 0 0 0 0 0 0 0 0
-6.5689 -1.3300 2.0861 C 0 0 0 0 0 0 0 0 0 0 0 0
-6.7847 -1.7537 0.7723 C 0 0 0 0 0 0 0 0 0 0 0 0
-6.2888 -0.9970 -0.2925 C 0 0 0 0 0 0 0 0 0 0 0 0
-7.0621 -2.1418 3.2266 C 0 0 0 0 0 0 0 0 0 0 0 0
-6.9770 -3.4349 3.2443 N 0 0 0 0 0 0 0 0 0 0 0 0
-7.6178 -1.5325 4.3102 N 0 0 0 0 0 0 0 0 0 0 0 0
7.7419 1.5356 1.9677 H 0 0 0 0 0 0 0 0 0 0 0 0
7.4485 1.6834 0.2574 H 0 0 0 0 0 0 0 0 0 0 0 0
5.7042 -0.2001 1.9277 H 0 0 0 0 0 0 0 0 0 0 0 0
7.3532 -0.6264 1.5399 H 0 0 0 0 0 0 0 0 0 0 0 0
6.2757 -1.6574 -0.1357 H 0 0 0 0 0 0 0 0 0 0 0 0
6.8033 -0.2075 -0.9601 H 0 0 0 0 0 0 0 0 0 0 0 0
4.0107 -0.4568 0.2903 H 0 0 0 0 0 0 0 0 0 0 0 0
4.3868 -1.2210 -1.2297 H 0 0 0 0 0 0 0 0 0 0 0 0
4.9463 0.8266 -2.3238 H 0 0 0 0 0 0 0 0 0 0 0 0
4.2270 3.0639 -1.2597 H 0 0 0 0 0 0 0 0 0 0 0 0
5.8237 2.4578 -1.0340 H 0 0 0 0 0 0 0 0 0 0 0 0
3.9570 1.5651 1.1397 H 0 0 0 0 0 0 0 0 0 0 0 0
4.0080 3.2923 0.9469 H 0 0 0 0 0 0 0 0 0 0 0 0
6.3078 3.4583 1.4052 H 0 0 0 0 0 0 0 0 0 0 0 0
5.6323 2.3774 2.6061 H 0 0 0 0 0 0 0 0 0 0 0 0
2.8420 1.5539 -2.6294 H 0 0 0 0 0 0 0 0 0 0 0 0
1.9974 2.2415 -0.5218 H 0 0 0 0 0 0 0 0 0 0 0 0
2.0337 0.5671 0.0310 H 0 0 0 0 0 0 0 0 0 0 0 0
1.5313 -1.4596 -0.9140 H 0 0 0 0 0 0 0 0 0 0 0 0
0.3583 -1.3580 0.4355 H 0 0 0 0 0 0 0 0 0 0 0 0
0.0233 -2.7036 -2.2942 H 0 0 0 0 0 0 0 0 0 0 0 0
-0.3312 -3.4117 -0.7042 H 0 0 0 0 0 0 0 0 0 0 0 0
-2.3583 -2.3591 -2.1396 H 0 0 0 0 0 0 0 0 0 0 0 0
-2.2528 -1.9676 -0.4127 H 0 0 0 0 0 0 0 0 0 0 0 0
-1.4273 -0.2442 -2.8039 H 0 0 0 0 0 0 0 0 0 0 0 0
-3.9176 -0.4046 -2.3909 H 0 0 0 0 0 0 0 0 0 0 0 0
-4.7246 1.9951 -0.8705 H 0 0 0 0 0 0 0 0 0 0 0 0
-5.6975 1.0564 -2.0075 H 0 0 0 0 0 0 0 0 0 0 0 0
-4.7947 1.5059 1.4823 H 0 0 0 0 0 0 0 0 0 0 0 0
-5.6789 0.1740 3.3599 H 0 0 0 0 0 0 0 0 0 0 0 0
-7.3480 -2.6602 0.5694 H 0 0 0 0 0 0 0 0 0 0 0 0
-6.4601 -1.3370 -1.3114 H 0 0 0 0 0 0 0 0 0 0 0 0
-6.5006 -3.7787 2.4057 H 0 0 0 0 0 0 0 0 0 0 0 0
-8.0695 -2.2082 4.9205 H 0 0 0 0 0 0 0 0 0 0 0 0
-8.1987 -0.7238 4.1176 H 0 0 0 0 0 0 0 0 0 0 0 0
1 2 1 0
2 3 1 0
3 4 1 0
4 5 1 0
5 6 1 0
6 7 1 0
7 8 1 0
5 9 1 0
9 10 1 0
10 11 1 0
11 12 2 0
11 13 1 0
13 14 1 0
14 15 1 0
15 16 1 0
16 17 1 0
17 18 1 0
18 19 2 0
18 20 1 0
20 21 1 0
21 22 1 0
22 23 2 0
23 24 1 0
24 25 2 0
25 26 1 0
26 27 2 0
25 28 1 0
28 29 2 3
28 30 1 0
8 1 1 0
17 13 1 0
27 22 1 0
1 31 1 0
1 32 1 0
2 33 1 0
2 34 1 0
3 35 1 0
3 36 1 0
4 37 1 0
4 38 1 0
5 39 1 0
6 40 1 0
6 41 1 0
7 42 1 0
7 43 1 0
8 44 1 0
8 45 1 0
9 46 1 0
10 47 1 0
10 48 1 0
14 49 1 0
14 50 1 0
15 51 1 0
15 52 1 0
16 53 1 0
16 54 1 0
17 55 1 6
20 56 1 0
21 57 1 0
21 58 1 0
23 59 1 0
24 60 1 0
26 61 1 0
27 62 1 0
29 63 1 0
30 64 1 0
30 65 1 0
M END

156 changes: 156 additions & 0 deletions biochemist-python/chapters/data/ligand13Uipr-3D.sdf
Original file line number Diff line number Diff line change
@@ -0,0 +1,156 @@

RDKit 3D

74 76 0 0 0 0 0 0 0 0999 V2000
7.9438 -1.7628 1.6641 C 0 0 0 0 0 0 0 0 0 0 0 0
7.3325 -2.7439 0.6656 C 0 0 0 0 0 0 0 0 0 0 0 0
5.8011 -2.6913 0.5581 C 0 0 0 0 0 0 0 0 0 0 0 0
5.2581 -1.7295 -0.5007 C 0 0 0 0 0 0 0 0 0 0 0 0
4.8616 -0.3278 0.0099 C 0 0 0 0 0 0 0 0 0 0 0 0
5.9450 0.3826 0.8431 C 0 0 0 0 0 0 0 0 0 0 0 0
7.3496 0.3418 0.2390 C 0 0 0 0 0 0 0 0 0 0 0 0
8.3675 -0.4053 1.0977 C 0 0 0 0 0 0 0 0 0 0 0 0
4.4792 0.4496 -1.1805 N 0 0 0 0 0 0 0 0 0 0 0 0
3.5540 1.5610 -0.9066 C 0 0 0 0 0 0 0 0 0 0 0 0
2.1463 1.0966 -0.4792 C 0 0 0 0 0 0 0 0 0 0 0 0
1.7323 -0.0570 -0.6156 O 0 0 0 0 0 0 0 0 0 0 0 0
1.3355 2.0690 0.0745 N 0 0 0 0 0 0 0 0 0 0 0 0
1.6741 3.4526 0.3566 C 0 0 0 0 0 0 0 0 0 0 0 0
0.3634 4.0919 0.7959 C 0 0 0 0 0 0 0 0 0 0 0 0
-0.4240 2.9198 1.3806 C 0 0 0 0 0 0 0 0 0 0 0 0
-0.0305 1.7560 0.4701 C 0 0 1 0 0 0 0 0 0 0 0 0
-0.9700 1.6927 -0.7555 C 0 0 0 0 0 0 0 0 0 0 0 0
-0.7540 2.2972 -1.8036 O 0 0 0 0 0 0 0 0 0 0 0 0
-2.1294 0.9788 -0.5294 N 0 0 0 0 0 0 0 0 0 0 0 0
-3.0680 0.7418 -1.6007 C 0 0 0 0 0 0 0 0 0 0 0 0
-4.4525 0.5247 -1.0492 C 0 0 0 0 0 0 0 0 0 0 0 0
-5.0154 -0.7610 -0.9895 C 0 0 0 0 0 0 0 0 0 0 0 0
-6.3138 -0.9792 -0.4817 C 0 0 0 0 0 0 0 0 0 0 0 0
-6.8703 -2.3939 -0.3975 C 0 0 0 0 0 0 0 0 0 0 0 0
-6.0923 -3.2303 0.6251 C 0 0 0 0 0 0 0 0 0 0 0 0
-6.8984 -3.0870 -1.7629 C 0 0 0 0 0 0 0 0 0 0 0 0
-7.0287 0.1417 -0.0209 C 0 0 0 0 0 0 0 0 0 0 0 0
-6.4658 1.4236 -0.0503 C 0 0 0 0 0 0 0 0 0 0 0 0
-5.1862 1.6136 -0.5641 C 0 0 0 0 0 0 0 0 0 0 0 0
-8.4151 0.0540 0.4989 C 0 0 0 0 0 0 0 0 0 0 0 0
-9.4003 -0.3270 -0.2474 N 0 0 0 0 0 0 0 0 0 0 0 0
-8.6723 0.3980 1.7894 N 0 0 0 0 0 0 0 0 0 0 0 0
7.2745 -1.6298 2.5204 H 0 0 0 0 0 0 0 0 0 0 0 0
8.8504 -2.2249 2.0782 H 0 0 0 0 0 0 0 0 0 0 0 0
7.5801 -3.7491 1.0306 H 0 0 0 0 0 0 0 0 0 0 0 0
7.8101 -2.6565 -0.3164 H 0 0 0 0 0 0 0 0 0 0 0 0
5.3385 -2.5097 1.5354 H 0 0 0 0 0 0 0 0 0 0 0 0
5.4799 -3.6983 0.2581 H 0 0 0 0 0 0 0 0 0 0 0 0
4.3488 -2.1858 -0.9188 H 0 0 0 0 0 0 0 0 0 0 0 0
5.9576 -1.6702 -1.3426 H 0 0 0 0 0 0 0 0 0 0 0 0
3.9889 -0.4635 0.6597 H 0 0 0 0 0 0 0 0 0 0 0 0
5.6675 1.4341 0.9894 H 0 0 0 0 0 0 0 0 0 0 0 0
5.9365 -0.0299 1.8571 H 0 0 0 0 0 0 0 0 0 0 0 0
7.3501 -0.0790 -0.7719 H 0 0 0 0 0 0 0 0 0 0 0 0
7.7092 1.3734 0.1163 H 0 0 0 0 0 0 0 0 0 0 0 0
9.2807 -0.5364 0.5050 H 0 0 0 0 0 0 0 0 0 0 0 0
8.6518 0.2347 1.9422 H 0 0 0 0 0 0 0 0 0 0 0 0
5.3111 0.8034 -1.6477 H 0 0 0 0 0 0 0 0 0 0 0 0
3.4252 2.1300 -1.8354 H 0 0 0 0 0 0 0 0 0 0 0 0
3.9783 2.2340 -0.1573 H 0 0 0 0 0 0 0 0 0 0 0 0
2.4064 3.4592 1.1710 H 0 0 0 0 0 0 0 0 0 0 0 0
2.1093 3.9359 -0.5213 H 0 0 0 0 0 0 0 0 0 0 0 0
-0.1649 4.4999 -0.0730 H 0 0 0 0 0 0 0 0 0 0 0 0
0.5119 4.9041 1.5122 H 0 0 0 0 0 0 0 0 0 0 0 0
-0.0788 2.7260 2.4040 H 0 0 0 0 0 0 0 0 0 0 0 0
-1.4993 3.1187 1.4108 H 0 0 0 0 0 0 0 0 0 0 0 0
-0.0600 0.7966 0.9988 H 0 0 0 0 0 0 0 0 0 0 0 0
-2.1606 0.3435 0.2580 H 0 0 0 0 0 0 0 0 0 0 0 0
-3.0731 1.5809 -2.3055 H 0 0 0 0 0 0 0 0 0 0 0 0
-2.7204 -0.1384 -2.1534 H 0 0 0 0 0 0 0 0 0 0 0 0
-4.4301 -1.6042 -1.3516 H 0 0 0 0 0 0 0 0 0 0 0 0
-7.9053 -2.3696 -0.0448 H 0 0 0 0 0 0 0 0 0 0 0 0
-6.5570 -4.2161 0.7472 H 0 0 0 0 0 0 0 0 0 0 0 0
-5.0485 -3.3843 0.3273 H 0 0 0 0 0 0 0 0 0 0 0 0
-6.0929 -2.7403 1.6066 H 0 0 0 0 0 0 0 0 0 0 0 0
-7.4420 -2.4821 -2.4967 H 0 0 0 0 0 0 0 0 0 0 0 0
-5.8911 -3.2681 -2.1536 H 0 0 0 0 0 0 0 0 0 0 0 0
-7.4016 -4.0575 -1.6905 H 0 0 0 0 0 0 0 0 0 0 0 0
-7.0335 2.2770 0.3107 H 0 0 0 0 0 0 0 0 0 0 0 0
-4.7627 2.6167 -0.5956 H 0 0 0 0 0 0 0 0 0 0 0 0
-9.0692 -0.5246 -1.1986 H 0 0 0 0 0 0 0 0 0 0 0 0
-7.9433 0.1931 2.4597 H 0 0 0 0 0 0 0 0 0 0 0 0
-9.5898 0.0660 2.0767 H 0 0 0 0 0 0 0 0 0 0 0 0
1 2 1 0
2 3 1 0
3 4 1 0
4 5 1 0
5 6 1 0
6 7 1 0
7 8 1 0
5 9 1 0
9 10 1 0
10 11 1 0
11 12 2 0
11 13 1 0
13 14 1 0
14 15 1 0
15 16 1 0
16 17 1 0
17 18 1 0
18 19 2 0
18 20 1 0
20 21 1 0
21 22 1 0
22 23 2 0
23 24 1 0
24 25 1 0
25 26 1 0
25 27 1 0
24 28 2 0
28 29 1 0
29 30 2 0
28 31 1 0
31 32 2 3
31 33 1 0
8 1 1 0
17 13 1 0
30 22 1 0
1 34 1 0
1 35 1 0
2 36 1 0
2 37 1 0
3 38 1 0
3 39 1 0
4 40 1 0
4 41 1 0
5 42 1 0
6 43 1 0
6 44 1 0
7 45 1 0
7 46 1 0
8 47 1 0
8 48 1 0
9 49 1 0
10 50 1 0
10 51 1 0
14 52 1 0
14 53 1 0
15 54 1 0
15 55 1 0
16 56 1 0
16 57 1 0
17 58 1 1
20 59 1 0
21 60 1 0
21 61 1 0
23 62 1 0
25 63 1 0
26 64 1 0
26 65 1 0
26 66 1 0
27 67 1 0
27 68 1 0
27 69 1 0
29 70 1 0
30 71 1 0
32 72 1 0
33 73 1 0
33 74 1 0
M END

Loading

0 comments on commit 4f7f1d4

Please sign in to comment.