Maint: parc.find_regions as module method. clarify find_regions #517

AhmetNSimsek · 2023-11-29T10:23:33Z

This PR addresses the following issues/points (see):

parcellation.find_regions() was a static method of Parcellation but it searches through all parcellations in the registry. For the user, when they have a parcellation instance parc, there is no clear difference between parc.find_regions() and parc.find() while they should be using find to search within the parcellation. (find is a method of Region).
There is also atlas.find_regions() as an instance method. It behaves differently than parcellation.find_regions(). Needs to be clarified.
Finally, instead of using caching with a dictionary with keys of kwarg tuples, why not use cache property?
- lru_cache might be necessary for long-running systems like siibra-api

(Please see the discussions on the code changes for further details)

codecov-commenter · 2023-11-29T14:42:15Z

Codecov Report

Attention: Patch coverage is 94.11765% with 1 line in your changes missing coverage. Please review.

Project coverage is 45.97%. Comparing base (dc63457) to head (650ffb9).
Report is 10 commits behind head on main.

Files with missing lines	Patch %	Lines
siibra/features/anchor.py	50.00%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #517      +/-   ##
==========================================
- Coverage   46.00%   45.97%   -0.04%     
==========================================
  Files          75       75              
  Lines        7232     7224       -8     
==========================================
- Hits         3327     3321       -6     
+ Misses       3905     3903       -2

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

xgui3783

Mostly fine.

I have questions where find_region should still be in parcellation.py

and the aggressive caching strategy we are using everywhere.

siibra/core/parcellation.py

xgui3783 · 2023-11-30T15:46:45Z

siibra/core/parcellation.py

@@ -364,3 +326,45 @@ def __lt__(self, other):
            )
            return self.name < other.name
        return self.version.__lt__(other.version)
+
+
+def find_regions(


~~is it feasible that user would try to find_regions without a parcellation?~~

~~(correct me if I am wrong, but this is what this function is doing?)~~

edit: I think I see why this is needed (it's needed to decode region spec in anatomic anchor)

since it's not used anywhere else, I almost wonder if module level function should go to anatomical anchor (where it is used exclusively)

or at top level under region (where it makes more semantic sense)

Rather than internal use, it is also exposed as siibra.find_regions(). We can potentially use region.find() in anchor. I think the original implementation also had species as a keyword argument, which made more sense to use there.

It needs access to parcellation.registry so I think it is better in parcellation module.

The fact that it needs Parcellation.registry is not relevant to users.

If I am a user, and expect to find a method called find_regions (which, in fact, does not depend on any parcellations), I would expect it to find it under region.py module.

I do like the idea of a class/static method under Region class, which is proxy to this method (Indeed, I think rather than it being a standalone method, I think it makes more sense to be a static/class method of Region class)

Hmm then it would still be accessible from region and parcellation instances. We can carry it to the region module but if we are going through all parcellations, why shouldn't it be under the parcellation module? And this also creates less confusion given that Region class has find method.

Since there is a also a similar function in atlas, which goes through all parcellations in an atlas, I think the design is reasonable. But we might want to discuss this design with @dickscheid , who initially implemented it.

We can carry it to the region module but if we are going through all parcellations, why shouldn't it be under the parcellation module?

But the user doesn't know that. (nor should they)

The promise of find_region, at least to me, is: you provide a string, and I will find you all regions which fits this string.

Whether, internally, we go through all parcellations, and iteratively, go through all regions for that parcellation, OR, we some how have a region instance table, that is not relevant to the user.

Which is why I say, either keep it private (prefix with _) or it should go in regions module.

I see what you mean now. I thought you were advocating for it to be a class method or static method of Region not region module. I'll also get the opinion of Timo and make a change accordingly.

I agree that this is better placed here at the module level to avoid the confusion with Region.find()

siibra/VERSION

siibra/core/parcellation.py

AhmetNSimsek · 2023-12-13T09:48:10Z

siibra/core/parcellation.py

@@ -364,3 +326,44 @@ def __lt__(self, other):
            )
            return self.name < other.name
        return self.version.__lt__(other.version)
+
+
+@lru_cache(maxsize=128)


TD: investigate how lru_cache hashes the inputs to compare with dictionary caching

https://docs.python.org/3/library/functools.html#functools.lru_cache
"Since a dictionary is used to cache results, the positional and keyword arguments to the function must be hashable."

Since the kwargs of find_regions are either a string a boolean, they are hashable. So essentially, the key of the old implementation (key = (regionspec, filter_children, find_topmost)) is used but hashed.

dickscheid · 2024-02-21T11:46:12Z

Just to explain the original logic - there were different entry points for different scopes:

search any region known to siibra was dessigned as a class method Parcellation.find_regions(), searching through all instances of parcellations at runtime (maybe it was implemented as static which I agree was not proper)
search only the regions linked to a species was implemented as an instance method of Atlas, which holds the parcellations of that species
search regions in one particular parcellation only was implemented as an instance method of the parcellation, actually implemented as find in Region, since region is the parent class which holds the subtree and allows for a recursive implementation.

I think the logic is clear, but I understand that the automatic exposure of the class method Parcellation.find_regions for Parcellation instances leads to confusion.

siibra/core/parcellation.py

AhmetNSimsek · 2024-02-21T11:55:44Z

Just to explain the original logic - there were different entry points for different scopes:

* search any region known to siibra was dessigned as a class method `Parcellation.find_regions()`, searching through all instances of parcellations at runtime (maybe it was implemented as static which I agree was not proper)

* search only the regions linked to a species was implemented as an instance method of Atlas, which holds the parcellations of that species

* search regions in one particular parcellation only was implemented as an instance method of the parcellation, actually implemented as  `find` in Region, since region is the parent class which holds the subtree and allows for a recursive implementation.

I think the logic is clear, but I understand that the automatic exposure of the class method Parcellation.find_regions for Parcellation instances leads to confusion.

Thank you. Then, IMO, we should make find_regions a module level method in parcellation.py or if it is static or class method, it should be hidden and only be forwarded with siibra.find_regions. I'd prefer the first as I do not see any reason for such a method to be class or instance method.

siibra/core/region.py

…, region.species does not need to check parcellation's existence

AhmetNSimsek added 2 commits November 29, 2023 11:21

Maint: parc.find_regions as module method. clarify find_regions

a1ce2d5

do not get top most with find_regions by default

827db31

AhmetNSimsek force-pushed the maint_find_regions branch from f0427d2 to 827db31 Compare November 29, 2023 14:33

AhmetNSimsek requested a review from xgui3783 November 30, 2023 15:38

xgui3783 approved these changes Nov 30, 2023

View reviewed changes

AhmetNSimsek and others added 2 commits December 1, 2023 12:29

Use lru_cache for Region.find and parcellation.find_regions

290067f

Merge branch 'main' into maint_find_regions

761a05d

AhmetNSimsek added the maintenance Not a bug or breaking issue. Code maintenance related. label Dec 7, 2023

AhmetNSimsek commented Dec 13, 2023

View reviewed changes

AhmetNSimsek assigned dickscheid and AhmetNSimsek Jan 29, 2024

dickscheid reviewed Feb 21, 2024

View reviewed changes

siibra/core/parcellation.py Outdated Show resolved Hide resolved

dickscheid reviewed Feb 21, 2024

View reviewed changes

siibra/core/parcellation.py Outdated Show resolved Hide resolved

dickscheid reviewed Feb 21, 2024

View reviewed changes

siibra/core/region.py Outdated Show resolved Hide resolved

xgui3783 reviewed Feb 21, 2024

View reviewed changes

siibra/core/region.py Outdated Show resolved Hide resolved

AhmetNSimsek added 2 commits November 14, 2024 16:04

Merge branch 'main' into maint_find_regions

a4e5e43

find_regions docstring, use lru_cache not directly on member function…

650ffb9

…, region.species does not need to check parcellation's existence

AhmetNSimsek requested review from xgui3783 and dickscheid November 15, 2024 09:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Maint: parc.find_regions as module method. clarify find_regions #517

Maint: parc.find_regions as module method. clarify find_regions #517

AhmetNSimsek commented Nov 29, 2023 •

edited

Loading

codecov-commenter commented Nov 29, 2023 •

edited by codecov bot

Loading

xgui3783 left a comment

xgui3783 Nov 30, 2023

AhmetNSimsek Dec 1, 2023

xgui3783 Dec 1, 2023

AhmetNSimsek Dec 1, 2023

xgui3783 Dec 1, 2023

AhmetNSimsek Dec 1, 2023

dickscheid Feb 21, 2024

AhmetNSimsek Dec 13, 2023

AhmetNSimsek Feb 20, 2024 •

edited

Loading

dickscheid commented Feb 21, 2024 •

edited

Loading

AhmetNSimsek commented Feb 21, 2024

Maint: parc.find_regions as module method. clarify find_regions #517

Are you sure you want to change the base?

Maint: parc.find_regions as module method. clarify find_regions #517

Conversation

AhmetNSimsek commented Nov 29, 2023 • edited Loading

codecov-commenter commented Nov 29, 2023 • edited by codecov bot Loading

Codecov Report

xgui3783 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

AhmetNSimsek Feb 20, 2024 • edited Loading

Choose a reason for hiding this comment

dickscheid commented Feb 21, 2024 • edited Loading

AhmetNSimsek commented Feb 21, 2024

AhmetNSimsek commented Nov 29, 2023 •

edited

Loading

codecov-commenter commented Nov 29, 2023 •

edited by codecov bot

Loading

AhmetNSimsek Feb 20, 2024 •

edited

Loading

dickscheid commented Feb 21, 2024 •

edited

Loading