feat: Add Variance Reduction to Delta Method and couple it with CUPAC #230

LGonzalezGomez · 2025-03-02T19:01:37Z

CUPED is used with the Delta Method.
This is intended to be used directly with the Power Analysis so the user is expected to use CUPAC. This simplifies a bit the usage although it is a slight approximation but should work for our implementation.
Otherwise we have to change how the interface works so we can pass ratio_metrics and normal metrics, which might imply a ton of refactoring and changing how users usually interact with the library.

Delta Method Isolated would be doing the Delta Method Properly with CUPAC, were we don't predict the ratio but both numerator and denominator and then apply the Delta Method on that to consider it when using CUPED. As can be seen predicting the ratio is a good approximate and tends to overestimate variance.
Recall that we apply Delta Method as otherwise we underestimate variance. I prefer having a very small type I error rather than changing how users interact.

Please feel free to check it and we can discuss implementation and align on the best way going forward!

Remove possibility to pass ratio covariates. Ideally this should always be done through CUPAC adjustment as it was observed a very small deviation and it was a very good approximation

codecov-commenter · 2025-03-02T19:10:30Z

⚠️ Please install the to ensure uploads and comments are reliably processed by Codecov.

Codecov Report

Attention: Patch coverage is 96.82540% with 2 lines in your changes missing coverage. Please review.

Project coverage is 96.38%. Comparing base (9570cf6) to head (e65774e).

Files with missing lines	Patch %	Lines
cluster_experiments/cupac.py	90.00%	1 Missing ⚠️
cluster_experiments/experiment_analysis.py	98.11%	1 Missing ⚠️

❗ Your organization needs to install the Codecov GitHub app to enable full functionality.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #230      +/-   ##
==========================================
+ Coverage   96.28%   96.38%   +0.10%     
==========================================
  Files          17       17              
  Lines        1668     1716      +48     
==========================================
+ Hits         1606     1654      +48     
  Misses         62       62

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

LGonzalezGomez · 2025-03-06T07:09:34Z

Codecov is not updating. The only line that is not covered is present in CUPAC. Not sure if it is really needed, seems a bit too much IMO, but we could add a test in power config to test that CUPAC with Delta Method is indeed working.

david26694 · 2025-03-11T10:06:42Z

cluster_experiments/cupac.py

@@ -136,7 +140,12 @@ def _prep_data_cupac(
        df_predict = df.drop(columns=[self.target_col])
        # Split data into X and y
        pre_experiment_x = pre_experiment_df.drop(columns=[self.target_col])
-        pre_experiment_y = pre_experiment_df[self.target_col]
+        if self.scale_col:


perhaps adding a property in the class called is_delta_method would make this more explicit

david26694 · 2025-03-11T10:09:29Z