Replies: 1 comment
-
I think this is a good way of integrating it and also the differentiation between things that should only return numerical distances and things for which we are interested in the significance (p-values, confidence intervals) is quite clear with the Additionally, by keeping the prototypes for distances strictly as (x: pd.Series, y: pd.Series) -> float and the prototypes for tests as (x: pd.Series, y: pd.Series) -> DistanceResult I imagine it will not be very difficult to make use of the metrics from Insight, as they have consistent types throughout. A similar thing can be done for correlations, as some methods used for calculating coefficients are actually based on hypothesis testing (e.g. Kruskal-Wallis H Test) and we are simply using that |
Beta Was this translation helpful? Give feedback.
-
We have to integrate metrics from insight into fairlens and move metrics in fairlens to insight.
I'm thinking we could wrap statistical distance and correlation metrics using the following structure.
Then for metrics which return a p-value and hypothesis tests we could use the following structure.
Perhaps worth renaming
DistanceResult
toTestResult
or perhaps redefining something similar in fairlens.For methods like
stat_distance
we can loop through the methods indistance.py
and use their names as mode and remove the p-value option. Alternatively we could renamestat_distance
tostat_similarity
and make it so it always returns a p-value.Thoughts?
Beta Was this translation helpful? Give feedback.
All reactions