Skip to content

Library for Financial Portfolio Optimization in Python

Notifications You must be signed in to change notification settings

alvarosf07/PyFinPO

Repository files navigation


python   platforms  


PyFinPO - Python Financial Portfolio Optimization

Welcome to PyFinPO, my personal library for Financial Portfolio Optimization in Python.

PyFinPO is an abstraction of other Python libraries for Portfolio Optimization (PyPortOpt, skfolio, scikit-learn, cvxpy...), aiming to provide a structured, simple and extensible infrastructure to perform Financial Portfolio Optimization.

Context

When I first set out to develop a Portfolio Optimization Library in Python, my goal was simple: to gain a deep understanding of portfolio theory and its main methods. Building everything from scratch felt like the best way to learn the nuances of the different models, approaches, estimators... and their implementations.

However, as I started exploring the existing Portfolio Optimization libraries in Python, I discovered that there are already VERY GOOD resources available for free (see list below). I quickly came to the realization that it made no sense to implement my own portfolio tool from scratch having all those open-source libraries available –tested and improved through user feedback– that already included most of the Portfolio Optimization tools needed. Reinventing the wheel no longer seemed practical or efficient.

So I changed my mind. Instead of creating yet another standalone library, I decided that in order to understand the nuances of Portfolio Optimization, it would make more sense for me to build a centralized library which organizes the different functionalities available in other open-source libraries under a single, cohesive API. By integrating their strengths with an intuitive high-level interface, this tool could help me (and potentially other users) to seamlessly access the best functionalities each library has to offer without needing to master multiple APIs.

Additionally, I decided to prioritize modularity and extensibility, making extremely easy to add new proprietary models and integrate them in the buit-in API with minimal effort.

In a nutshell, I defined 3 main principles for the project: unified structure, intuitive use, and scalable modularity.

Objectives

PyFinPO aims to satisfy 3 main objectives:

  1. To unify the different Portfolio Optimization functionalities offered in other libraries under a common, structured API.
  2. To provide a high-level interface to intuitively perform Portfolio Optimization with Python.
  3. To offer a MODULAR and EXTENSIBLE tool which anyone can use to build proprietary models and feed them into the created API structure to leverage the already implemented functionalities in existing libraries.

Note

Please note that PyFinPO Library is still under construction, therefore many of the functionalities have not been implemented yet. The summary tables below detail all the necessary information about the functionalities, documentation links and source code status for each model.


Table of Contents

  1. Installation

  2. Library Structure

  3. PyFinPO Features

  4. Future Works

  5. Source Libraries


1) Installation

If you would like to install PyFinPO, you can try any of the following 3 options:

  1. Clone the project and use the source code:
git clone https://github.com/alvarosf07/pyfinpo
  1. Alternatively, you can just install the project directly from your terminal:
pip install -e git+https://github.com/alvarosf07/pyfinpo.git
  1. However, it is best practice to use a dependency manager within a virtual environment. You can import all the dependencies within PyFinPO to your local environment by cloning/downloading the project and then in the project directory just can just run:
python setup.py install

2) Library Structure

  • config - library configuration files
  • data - stores datasets of raw, processed or forecasted asset prices/returns
  • docs - documentation for the different models and functions/classes of the library
  • media - images, files other media content used in the library
  • pyfinpo - library source code. See Features section for full details.
  • scripts - python scripts with diverse contents and applications
  • tests- test implementations of all the different functionalities of the library
  • tutorials - tutorials and practical use cases of implementations with PyFinPO

3) PyFinPO Features

3.0) Overview of Portfolio Selection Problem

Portfolio optimization, as formalized by Harry Markowitz in his seminal 1952 paper "Portfolio Selection," revolutionized investment theory by introducing a systematic and quantitative framework. Markowitz demonstrated that by combining assets with differing expected returns and volatilities, investors can construct portfolios that either minimize risk for a given return or maximize return for a specific level of risk. These optimal portfolios form the efficient frontier, a cornerstone of what is now known as Modern Portfolio Theory (MPT).

At the heart of Markowitz’s framework is mean-variance optimization, which requires two key inputs: estimates of expected returns and a covariance matrix to capture asset interdependencies. While the model provides a theoretically optimal solution, its practical application is limited by the difficulty in accurately forecasting these inputs. Historical data is often used as a proxy, though this approach sacrifices the theoretical guarantees of precision.

Advancements in portfolio optimization have expanded upon these ideas, introducing diverse objective functions and measures of risk such as maximizing Sharpe ratios or minimizing Conditional Value at Risk (CVaR). Constraints such as budget limits, sector exposure, or regulatory requirements are incorporated to reflect real-world considerations.

Therefore, we can divide portfolio optimization process in 4 main steps, each with their own sub-steps. These depicted in the Figure below, and fully detailed in the following section.


3.1) Input Estimators

3.1.1) Expected Return Models

Historical Averages

Model Tag Model Name Documentation Implementation
mh_rm Mean Historical Return Model (aka Empirical Returns) PyPortOpt mh_rm.py
ewmh_rm Exponentially Weighted Mean Return Model PyPortOpt ewmh_rm.py
mceq_rm Market Cap Equilibrium Expected Return Model skfolio To be implemented
eqweq_rm Equal Weight Equilibrium Expected Return Model skfolio To be implemented

Economic & Factor Models

Model Tag Model Name Documentation Implementation
capm_rm Capital Asset Pricing Return Model (Equilibrium) PyPortOpt capm_rm.py
apt_rm Arbitrage Pricing Theory (APT) Return Model See docs To be implemented
fama_factor_rm Fama-French Three-Factor Return Model See docs To be implemented
multi_factor_rm Multi-Factor Return Models skfolio To be implemented

Statistical/Machine Learning Models

  • Shrinkage Estimators -> The estimator shrinks the sample mean toward a target vector.
  • Regression-Based Return Models -> Predict future returns using historical and fundamental data
  • Bayesian Models -> Model relationships between variables and incorporate new information/views to estimate probabilistic returns.
  • Time-Series Forecasting Return Models
  • Supervised ML Models -> Support Vector Machines, Gradient Boosting, Random Forests...
  • Neural Networks -> Capture non-linear relationships and patterns, suitable for complex datasets.
Model Tag Model Name Documentation Implementation
Shrinkage Estimators
shrinkage_rm Shrinkage Return Models Estimators skfolio - Expected Return Shrinkage To be implemented
shrinkage_rm.james_stein Shrinkage Return Models - James-Stein skfolio - Expected Return Shrinkage To be implemented
shrinkage_rm.bayes_stein Shrinkage Return Models - Bayes-Stein skfolio - Expected Return Shrinkage To be implemented
shrinkage_rm.bop Shrinkage Return Models - Bodnar Okhrin Parolya skfolio - Expected Return Shrinkage To be implemented
Bayesian Models
black_litterman_rm Black Litterman Model (return + risk model) PyPortOpt - Black-Litterman black_litterman_rm.py

Hybrid Models

Combine any of the previous models for more robust estimates.

Model Tag Model Name Documentation Code
TBD - To be implemented

3.1.2) Risk Models

Covariance Estimators

Model Tag Model Name Documentation Code
sample_cov Sample Covariance Risk Model RiskModels.md sample_cov.py
empirical_cov Empirical Covariance (Max Likelihood Covariance Estimator) Scikit-learn To be implemented
implied_cov Implied Covariance Risk Model Implied Covariance Matrix To be implemented
semi_cov Semi-covariance Risk Model Semi-Covariance Matrix semi_cov.py
ew_cov Exponentially-Weigthed Covariance Risk Model EW Covariance Matrix ew_cov.py
cov_denoising Covariance Denoising Risk Model Covariance Denoising To be implemented
cov_detoning Covariance Detoning Risk Model Covariance Detoning To be implemented

Covariance Shrinkage

Model Tag Model Name Documentation Code
cov_shrinkage Covariance Shrinkage Risk Models RiskModels.md cov_shrinkage.py
cov_shrinkage.shrunk_covariance Covariance Shrinkage - Manual Shrinkage RiskModels.md cov_shrinkage.py
cov_shrinkage.ledoit_wolf Covariance Shrinkage - Ledoit-Wolf RiskModels.md cov_shrinkage.py
cov_shrinkage.oracle_approximating Covariance Shrinkage - Oracle Approximating RiskModels.md cov_shrinkage.py

Sparse Inverse Covariance Estimators

Model Tag Model Name Documentation Code
graph_lasso_cov Sparse Inverse Graphical Lasso Covariance Estimator Scikit-learn To be implemented

Robust Covariance Estimators

Model Tag Model Name Documentation Code
mcd_cov Robust Minimum Covariance Determinant (MCD) Estimator Scikit-learn To be implemented
gerber_cov Robust Gerber Statistic for Covariance Estimation The Gerber Statistic To be implemented

Quick Example
from pyfinpo.input_estimates import risk_models

cov_matrix = risk_models.sample_cov(prices)
po_plotting.plot_covariance(sample_cov, plot_correlation=True)
plt.show()


3.2) Portfolio Optimization (PO)

Once the input estimates have been determined, we can proceed with the portfolio optimization process.

3.2.1) Portfolio Optimization Models

The first step of the portfolio optimization is to choose the optmization model/framework. The model of choice is going to determine the type of approach we want for the portfolio optimization, the different objective functions that we will be able to optimize for, and the solver used to approach the optimization problem, among other things. As we will see later on, each optimization model/framework has its particular objective functions and solvers/optimizers.

The following Portfolio Optimization Models have been/will be implemented:

Naive PO Models

Model Tag Model Name Documentation Code
eqw_po Equal Weight (1/N) Portfolio Optimization skfolio To be implemented
ivp_po Inverse Volatility Portfolio (IVP) Optimization skfolio To be implemented
random_po Random Portfolio Optimization (Dirichlet Distribution) skfolio To be implemented

Risk-Based PO Models

Model Tag Model Name Documentation Code
gmv_po* Global Minimum Variance PO PyPortOpt, GMVP To be implemented
maxdiv_po* Maximum Diversification Portfolio Optimization skfolio To be implemented
risk_parity_po Risk Parity Portfolio Optimization skfolio, Risk Parity PO To be implemented
risk_budgeting_po Risk Budgeting Portfolio Optimization skfolio, Risk Budgeting PO To be implemented
  • *Both gmv_po and maxdiv_po are not implemented as strict PO models, but as objective functions for optimization inside other PO models (such as Mean-Risk Models detailed below).

Mean-Risk PO Models

Mean-Risk Portfolio Optimization models aim to find asset combinations which optimize the relationship return vs risk. The most well-known Mean-Risk model is Mean-Variance Portfolio Optimization, which uses the variance of asset returns as measure of risk. However, many different measures of risk can be selected, giving rise to a wide range of Mean-Risk models.

PyFinPO provides direct implementation of the most relevant Mean-Risk models, as detailed in the summary table below (Mean-Variance, Mean-Semivariance, Mean-CVaR, Mean-CDaR). In order to select any other measure of risk which is not directly implemented in the PO models below, the parent class MeanRiskPO generalizes all mean-risk models and allows to choose among any of the available risk metrics (see list below).

Model Tag Model Name Documentation Code
mr_po Mean-Risk Portfolio Optimization (generalizes all models below) skfolio To be implemented
mv_po Mean-Variance Portfolio Optimization PyPortOpt mv_po.py
msv_po Mean-Semivariance Portfolio Optimization PyPortOpt msv_po.py
mcvar_po Mean-CVaR (Conditional Value at Risk) Portfolio Optimization PyPortOpt mcvar_po.py
mcdar_po Mean-CDaR (Conditional Drawdown at Risk) Portfolio Optimization PyPortOpt mcdar_po.py

List of Risk Measures Supported under MeanRiskPO class:

  • Mean Absolute Deviation
  • Variance (Second Central Moment)
  • Skewness (Normalized Third Central Moment)
  • Kurtosis (Normalized Fourth Central Moment minus 3)
  • Fourth Central Moment
  • First Lower Partial Moment
  • Semi-Variance (Second Lower Partial Moment)
  • Fourth Lower Partial Moment
  • Value at Risk
  • Maximum Drawdown
  • Average Drawdown
  • Drawdown at Risk
  • Entropic Risk Measure
  • CVaR (Conditional Value at Risk)
  • EVaR (Entropic Value at Risk)
  • CDaR (Conditional Drawdown at Risk)
  • EDaR (Entropic Drawdown at Risk)
  • Worst Realization
  • Ulcer Index
  • Gini Mean Difference

Robust Mean-Risk PO Models

Model Tag Model Name Documentation Code
cla_po Critical Line Algorithm (CLA) Portfolio Optimization PyPortOpt cla_po.py
dr_cvar_po Distributionally Robust CVaR Portfolio Optimization skfolio To be implemented

Clustering PO Models

Model Tag Model Name Documentation Code
hrp_po Hierarchical Risk Parity (HRP) Portfolio Optimization mlfinlab hrp_po.py
herc_po Hierarchical Equal Risk Contribution (HERC) Portfolio Optimization mlfinlab To be implemented
nco_po Nested Clustered Optimization (NCO) mlfinlab To be implemented

Ensemble PO Models

Model Tag Model Name Documentation Code
stack_po Stacking Portfolio Optimization - To be implemented

3.2.2) Portfolio Optimization Objectives


Figure 2 - Mean-Variance Efficient Frontier. Source: PyPortOpt


Objective Functions

Some of the Portfolio Optimization Models exposed above are self-descriptive, in the sense that by definition they only posses a single possible objective function to optimize. In these cases, the default objective function will be automatically chosen by the code implementation, and the user will not need to specify any objective.

However, there are other models (the most obvious kind being Mean-Risk models) which allow to select different objective functions to optimize for. The easiest way to picture this behavior is with Mean-Variance theory as an example (see Figure 2). Within the MVT framework we can define the Efficient Frontier, which represents a Pareto Optimal set of possible optimal portfolios.For a given Efficient Frontier, different points can be selected as optimal under different objective functions.

In this context, PyFinPO provides five main objective functions:

  • Minimum Risk - global_min_risk
    • Represents the point of the Optimal Set with lowest level of risk. Its calculation can be useful to have an idea of how low risk could be for a given problem/portfolio.
  • Minimize Risk - min_risk
    • Minimizes risk for a given level of return.
  • Maximize Return - max_return
    • Minimizes return for a given level of risk.
  • Maximize Ratio - max_ratio
    • Maximizes the ratio return-risk for the whole portfolio. This returns the tangency portfolio, as it represents the point on a returns-risk graph where the tangent to the efficient frontier intersects the y-axis at the risk-free rate. This is the default choice as it finds the optimal return per level of risk at portfolio level.
  • Maximize Utility - max_return
    • Maximizes the utility function provided manually by the user, which specifies its level of risk aversion.

Once again, the word "risk" can be replaced for any of the available risk metrics defined under MeanRiskPO class (see list above).

Note: not all of these objective functions may be available for all the different Mean-Risk models.


Adding Custom Objectives

In addition, sometimes we may want to add extra optimization objectives that are not defined in the previous 5 objective functions.

PyFinPO supports the addition of extra optimization objectives. Note that there are 2 types of objectives, convex and non-convex. While convex objectives fit nicely under a convex optimization solver, non-convex objectives may be treated with care as they may produce incompatibilities with the solvers implemented in this library.

  • Convex Objectives:

    • L2 regularisation (minimising this reduces nonzero weights)
    • Transaction cost model (can be added also as constraint)
    • Custom convex objectives (must be expressed with cvxpy atomic functions)
  • Non-Convex Objectives:

    • See example in the original docs of the PyPortOpt library here

For an example on how to implement custom objectives, see PyFinPO-UserGuide.


Single-Objective vs Multi-Objective Optimization

Note that when we provide an specific objective function or set of objectives for the portfolio optimization, it will return a single solution (i.e. single set of optimal weights that represent the optimal portfolio).

Alternatively, we can change our approach and implement a multi-objective optimization where we obtain an optimal set of solutions from which we can choose the preferred one for our objectives. A way to do this, is by plotting the optimal set of solutions (e.g. efficient frontier in case of Mean-Variance Theory) and then choosing the desired portfolio.


3.2.3) Portfolio Optimization Constraints

Constraints and objectives are treated similarly from an optimization point of view. Therefore, the methodology to add constraints to a portfolio optimization is almost equivalent to the process of adding objectives described above.

The main portfolio optimization constraints available in PyFinPO are:

  • Market Neutrality

  • Long/Short Portfolio

  • Weight Bounds (limit the position size of securities)

  • Sector & Security Constraints

  • Score Constraints

  • Nº of Asset Constraints (Convexity Constraints)

  • Tracking Error Constraints


For examples on how to implement custom objectives in PyFinPO, see PyFinPO-UserGuide.


3.2.4) Portfolio Optimization Period

One of the main limitations of PyFinPO and most other Portfolio Optimization libraries is that the optimization is static (single-period), meaning that based on the input parameters the output optimal portfolio is only valid for a static period of time. Of course, due to the dynamic nature of financial markets it would be preferable to have a dynamic optimization in order to reflect the latest information available in the optimized portfolio and take optimal rebalancing decisions.

Future development plans for PyFinPO include extending the optimization functionalities to address the portfolio optimization problem dynamically. Two main approaches can be found in literature that address PO dynamically, as a Multi-Period Portfolio Optimization (MPPO):

  • The first approach considers a discrete-time PO, where the expected utility of the investor terminal wealth is maximized over a multi-period investment horizon, and portfolio can be rebalanced only at discrete points in time (e.g. for a 1-year PSP, adjusting portfolio weights at the beginning of every month).
  • The second approach is a continuous-time optimization, where asset weights can be reallocated at any time within the investment horizon.

Zhang et. al. provide a detailed review on the formulation and advantages of dynamic PO techniques.


3.2.5) Portfolio Optimization Solver (Optimizer)

The solver implemented to address the optimization problem in the previous examples has been implemented under the hood of the MeanVariancePO class, which inherits its methods from the BaseConvexOptimizer class. In the context of Mean-Variance theory, as the optimization problem is typically convex (unless non-convex constraints or objectives are introduced), it can be solved via quadratic programming with the cvxpy python library for convex optimization.

While Mean-Variance optimization framework can be addressed with convex optimization, other portfolio optimization models which are completely different in character may use different optimization schemes. An overall summary is presented below for quick reference, including only the main portfolio optimization models and the optimization solver they use in this library.

Portfolio Optimization Model Module Main Class Optimization Solver Optimization Solver Details
Mean-Variance Portfolio Optimization mv_po.py MeanVariancePO BaseConvexOptimizer MVPO is addressed with convex optimization via cvxpy
Mean-SemiVariance Portfolio Optimization msv_po.py MeanSemivariancePO BaseConvexOptimizer MSVPO can be re-written as convex problem (full details here)
Mean-CVaR Portfolio Optimization mcvar_po.py MeanCVaRPO BaseConvexOptimizer MCVaRPO can be reduced to linear program (full details here)
Mean-CDaR Portfolio Optimization mcdar_po.py MeanCDaRPO BaseConvexOptimizer MCDaRPO can be reduced to linear program (full details here)
Critical Line Algorithm Portfolio Optimization cla_po.py CLAPO BaseOptimizer CLAPO uses CLA convex optimization solver, specifically designed for PO
Hierarchical Risk Parity Portfolio Optimization hrp_po.py HRPPO BaseOptimizer HRPPO implements hierarchical clustering optimization (more details here)

For a more detailed analysis on how to choose the right solver for any risk metric, visit Riskfolio-Lib - Choosing a Solver.


3.3) Portfolio Performance

To compute the portfolio performance (expected annual return, volatility and Sharpe Ratio) simply call the method .portfolio_performance() on the portfolio model object after having calculated the optimization objectives of choice:

from pyfinpo.portfolio_performance import portfolio_performance

mv_po = po_models.MeanVariancePO(mhr, sample_cov, weight_bounds=(0,1))
opw = mv_po.min_volatility()
mv_po.portfolio_performance(verbose=True)

3.4) Portfolio Optimization Post-Processing

3.4.1) Portfolio Tidy Weights

There is an option to express portfolio optimal weights in a more visual, tidy way with the .clean_weights() method (see PyFinPO-UserGuide).

mv_po = po_models.MeanVariancePO(mhr, sample_cov, weight_bounds=(0,1))

mv_po.min_volatility()
optimal_portfolio_clean_weights = mv_po.clean_weights()
optimal_portfolio_clean_weights
pd.Series(mv_po).plot.pie(figsize=(7,7), colors=colors, autopct='%1.1f%%', pctdistance=0.85)


3.4.2) Portfolio Discrete Allocation

Once we have the optimal portfolio weights, it is probably useful to express it in execution terms (how much quantity of each stock to buy at the current market price). The function discrete_allocation helps us with this purpose:

from pyfinpo.utils import DiscreteAllocation

latest_prices = prices.iloc[-1]  # prices as of the day you are allocating
da = DiscreteAllocation(optimal_portfolio_weights, latest_prices, total_portfolio_value=20000, short_ratio=0.3)
alloc, leftover = da.lp_portfolio()
print(f"Discrete allocation performed with ${leftover:.2f} leftover")
{key: float(value) for key, value in alloc.items()}

Output:

Discrete allocation performed with $13.46 leftover
{'AAPL': 3.0,
 'GOOG': 8.0,
 'META': 5.0,
 'MSFT': 3.0,
 'PFE': 218.0,
 'TSLA': 2.0,
 'UNH': 4.0,
 'XOM': 46.0}

4) Future Works

Some of the improvements that may be added in future versions of PyFinPO include the following:

  • Develop detailed documentation for models and API usage under docs folder.
  • Finish implementation of all the return models, risk models and portfolio optimization models whose status appears as "To be implemented" in the summary tables described above.
  • Include the implementation of multi-period optimization. See more details in Section 3.2.4.
  • Improve Portfolio Performance function to depict a more detailed analysis about the optimized portfolio, including additional measures of risk, performance ratios, backtesting functionalities, sensitivity analysis...
  • Implement tests for all library functionalities under the folder tests.
  • Add additional tutorials under the folder tutorials (currently only 2 are available) to cover more practical use cases of different optimizations and functionalities that can be performed with PyFinPO.

5) Source Libraries

PyFinPO is an abstraction built on top of the following Python libraries:


About

Library for Financial Portfolio Optimization in Python

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published