Exposure fitting, curve calibration & reach and frequency allocator #1132

gufengzhou · 2024-11-12T15:48:25Z

Type of change

feat: Exposure fitting is being reactivated. When paid_media_vars are different than paid_media_spend, paid_media_vars will be used for fitting. The extrapolation from exposure to spend uses the metric CPE (cost per exposure) = spend / exposure. Respectively, Michaelis Menten function for spend-exposure fitting is deprecated, because it doesn't provide significant improvement in fitting, while being very error-prone. The missing nonlinear relationship between spend and exposure will be address by curve calibration.
feat: curve calibration. The new robyn_calibrate feature works standalone and without the other main Robyn functions. It consumes a dataframe a two columns: spend and response. This allows users to provide any saturation curves as source of truth to improve Robyn's saturation parameter identification and narrow down sample the ranges for alpha and gamma. Typical source of truth for saturation curves are Meta conversion lift, as well as Halo cumulative reach.
feat: reach and frequency allocator allows users to find optimum reach and frequency combination for media planning.
Refactor: aligned and simplified nonlinear transformation process across functions like run_transformation, robyn_response, robyn_pareto and robyn_allocator
Refactor: inflexion point calculation now changed from .dot_product(range(x), gamma) to sum(x) * gamma to allow more flexibility in curve estimation
various fixes
update of documentation

Unit test (tbc)

- in order to use real exposure metrics in modeling, fit_spend_exposure must be inverted into spend ~ exposure - adapted vmax start value to avoid 0 denominator in the inverted equation - adapted spend exposure plots - the impact on saturation curve and budget allocation needs to be reassessed

- now only warns weak relationship when both rsq_nls and rsq_lm are smaller than threshold

- exposure_handling function for better R&F integration and readability - better col readability in dt_plotNLS - simplify dt_transform ETL

This is the proof of concept of a R&F allocator that includes - Simulated R&F data - Implemented multiplicative model - visualisation of surface - R&F allocator with nlopt - constrain validation

- Imp/GRP will be used for parent model fitting. This addition aims to decompose saturated imp into its R&F component with separated saturation - R&F will obtain the same adstock param as imp and be transformed. No separated adstock param estimation - Then adstocked R&F will be fitted with different hill transformations (diff set of alphas /gammas) - The R&F hill params are estimated using a multiplicative equation with Nevergrad - First simulated results shows close to perfect R&F->Imp fitting.

- Deprecate function fit_spend_exposure, incl. Michaelis Menten. Nonlinear fitting between spend and exposure wasn't improving fitting significantly. Instead, future curve calibration feature will aim to improve curve identification. - Use linear model only: cpm as ratio for spend to exposure translation. - remove minpack.lm / nlsLM dependency - robyn_input() works - to-do: export InputCollect$ExposureCollect$plot_spend_exposure

- update exposure handling, esp. introduce metric "cost per exposure" as linear scaler between expo and spend. - use cpe_window to scale the whole dataset in order to obtain the right spend scale for modeling period. - simplify check_varnames - update check_paidmedia - update check_factorvars

- exposure_handling with scaled spend will now replace respective media exposure vars in dt_transform - adapt model.R, incl. reset run_transformations params to have clearer overview of params needed. - simplify transformation.R by removing unnecessary checks - remove documentation for fit_spend_exposure - include rlang::`:=` operater for easier dynamic variable assignment in the future - check with document update successful. no error /no warning/ no notes

- In model.R & pareto.R: remove decompSpendDist from both scripts to reduce memory leak. Use xDecompAgg subsets instead - In transformation.R & response.R: unify transformation namings in run_transformation and robyn_response - In response.R: remove exposure extrapolation because it's already done in robyn_input. Also add inflexion point to output. - In plots.R: fix onepager saturation plot issues - In pareto.R: rewrite run_dt_resp() as response_wrapper and align transformation logic & naming. - In pareto: Replace foreach response loop with lapply for simplicity. - In pareto.R: Simplify plot data generation process, esp for saturation curve plot, actual vs predicted plot & immediate vs carryover plot. - In pareto.R: Remove redundancy in xDecompVecCollect -> remove type rawMedia, rawSpend, predictedExposure, saturatedMedia & saturatedSpendReversed. Only keep adstockedMedia & decompMedia for response curve plotting.

- add titles and change y axis.

- use reach Halo cumulative reach and simulated spend - curve fitting with Hill using Nevergrad - early stop convergence with while loop - CI range for alpha & gamma - plotting

- Create robyn_calibrate that consumes curve input and outputs hyperparameter ranges as input. - Rename previous internal robyn_calibrate function as lift_calibration - add support function .dot_product, .qti, .mse_loss, geom_density_ci, check_qti - to-do: Dataframe input df_curve_sot needs to be extended for multiple campaigns. Plot needs to be exported. New checks needed too.

- include inflexion into resultHypParam for better curve calibration handling - fix plot warning

- replace paid_media_spend with paid_media_selected in script - clean up variable retrieval and sorting - deprecate get_hill_params because inflexions are now included in resultHypParam

- fix transformation loop error - add set_default_hyppar for easier testing

…mental/Robyn into reach_and_frequency

- add param force_curve in robyn_calibrate to allow c or s shape control - change inflexion calculation to sum(x) * gamma to increase flexibility of inflexion point

simplfy allocator checks

gufengzhou and others added 25 commits July 30, 2024 23:57

fix: spend exposure warning error

083e700

- now only warns weak relationship when both rsq_nls and rsq_lm are smaller than threshold

refactor: create exposure_handling function

f8b2dfc

- exposure_handling function for better R&F integration and readability - better col readability in dt_plotNLS - simplify dt_transform ETL

poc: reach and frequency allocator

9483eeb

This is the proof of concept of a R&F allocator that includes - Simulated R&F data - Implemented multiplicative model - visualisation of surface - R&F allocator with nlopt - constrain validation

Merge branch 'main' into reach_and_frequency

2856fca

Merge branch 'main' into reach_and_frequency

95245d1

update: exposure plot

53c1c90

- add titles and change y axis.

poc: update curve fitting feature

200692b

- use reach Halo cumulative reach and simulated spend - curve fitting with Hill using Nevergrad - early stop convergence with while loop - CI range for alpha & gamma - plotting

refactor: export inflexion

4bf45da

- include inflexion into resultHypParam for better curve calibration handling - fix plot warning

fix: adapt allocator to paid_media_selected

cb42f29

- replace paid_media_spend with paid_media_selected in script - clean up variable retrieval and sorting - deprecate get_hill_params because inflexions are now included in resultHypParam

recode: avoid additional rlang dependency

1fcd11f

feat: viz improvements to match Robyn's style

081061a

recode: apply styler::tidyverse_style()

46c9203

fix: include MSE y-label

b5028b1

fix: paid_media_selected[i] iteration

b4ee45a

fix & feat: fix input error & add set_default_hyppar

1708e70

- fix transformation loop error - add set_default_hyppar for easier testing

Merge branch 'reach_and_frequency' of github.com-robyn:facebookexperi…

a16b227

…mental/Robyn into reach_and_frequency

feat & update: force_curve and inflexion

b613b10

- add param force_curve in robyn_calibrate to allow c or s shape control - change inflexion calculation to sum(x) * gamma to increase flexibility of inflexion point

Merge branch 'main' into reach_and_frequency

90cdf39

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 12, 2024

gufengzhou requested a review from igorskokan November 12, 2024 15:50

fix: allocator checks

01ae4ff

simplfy allocator checks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Exposure fitting, curve calibration & reach and frequency allocator #1132

Exposure fitting, curve calibration & reach and frequency allocator #1132

gufengzhou commented Nov 12, 2024

Exposure fitting, curve calibration & reach and frequency allocator #1132

Are you sure you want to change the base?

Exposure fitting, curve calibration & reach and frequency allocator #1132

Conversation

gufengzhou commented Nov 12, 2024

Type of change

Unit test (tbc)