Improvements to fix missing data sources #12

joschka-gross · 2024-10-11T13:30:42Z

Tries to fix #11 (comment)

…ediction

review-notebook-app · 2024-10-11T13:30:47Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

mbackenkoehler · 2024-10-11T13:32:24Z

kinodata/data/dataset.py

@@ -87,7 +87,7 @@ def process_raw_data(
        removeHs=remove_hydrogen,
    )
    if activity_type_subset is not None:
-        df = df.query("activities.standard_type in @activity_type_subset")
+        df = df[df["activities.standard_type"].isin(activity_type_subset)]


mbackenkoehler

@joschka-gross change that now or later?

mbackenkoehler · 2024-10-11T13:34:08Z

kinodata/data/dataset.py

@@ -102,7 +102,9 @@ def process_raw_data(
    )
    best_structure = (
        df.sort_values(by="docking.predicted_rmsd", ascending=True)
-        .groupby(group_key)[group_key + ["docking.predicted_rmsd", "molecule"]]
+        .groupby(group_key)[
+            group_key + ["docking.predicted_rmsd", "molecule", "activities.activity_id"]


should probably be the best structure overall (for each activity_id)

Joschka Groß and others added 4 commits August 13, 2024 14:15

change subset check pandas dataset activity type

2493718

Merge branch 'main' of github.com:volkamerlab/kinodata-3D-affinity-pr…

1f5d4f7

…ediction

add chembl and klifs id by default

a9673fc

add method for patching dataset

6b71c78

mbackenkoehler reviewed Oct 11, 2024

View reviewed changes

joschka-gross mentioned this pull request Oct 11, 2024

About bioactivity anotations #11

Open

mbackenkoehler requested changes Oct 11, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improvements to fix missing data sources #12

Improvements to fix missing data sources #12

joschka-gross commented Oct 11, 2024

review-notebook-app bot commented Oct 11, 2024

mbackenkoehler Oct 11, 2024

mbackenkoehler left a comment

mbackenkoehler Oct 11, 2024

Improvements to fix missing data sources #12

Are you sure you want to change the base?

Improvements to fix missing data sources #12

Conversation

joschka-gross commented Oct 11, 2024

review-notebook-app bot commented Oct 11, 2024

mbackenkoehler Oct 11, 2024

Choose a reason for hiding this comment

mbackenkoehler left a comment

Choose a reason for hiding this comment

mbackenkoehler Oct 11, 2024

Choose a reason for hiding this comment