[Feature Request]: Implement Policy Learning based on APO Model

### Describe the feature you want to propose or implement

Implement a general policy learning method based on multiple possible treatments.

### Propose a possible solution or implementation

Based on [Policy Learning with Confidence](https://arxiv.org/abs/2502.10653).
Using the APOs and several possible weighted options (and e.g. tree search) one could implement the proposed algorithms as the solely rely on value estimates and standard errors.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature Request]: Implement Policy Learning based on APO Model #333

Describe the feature you want to propose or implement

Propose a possible solution or implementation

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Feature Request]: Implement Policy Learning based on APO Model #333

Description

Describe the feature you want to propose or implement

Propose a possible solution or implementation

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions