WaveBreaking is a Python package that provides detection, classification and tracking of Rossby Wave Breaking (RWB) in weather and climate data. The detection of RWB is based on analyzing the dynamical tropopause represented by a closed contour line encircling the pole as for example the 2 Potential Vorticity Units (PVU) contour line in Potential Vorticity (PV) fields. By applying three different breaking indices, regions of RWB are identified and different characteristics of the events such as area and intensity are calculated. The event tracking provides information about the temporal evolution of the RWB events. Finally, the implemented plotting methods allow for a first visualization. This tool was developed during my master studies at the University of Bern.
The detection of RWB is based on applying a RWB index to the dynamical tropopause. The WaveBreaking package provides three different RWB indices:
- Streamer Index: The streamer index is based on work by Wernli and Sprenger (2007) (and Sprenger et al. 2017). Streamers are elongated structures present on the contour line that represents the dynamical tropopause. They can be described by a pair of contour points that are close together considering their geographical distance but far apart considering their distance connecting the points on the contour. Further description can be found in my master thesis.
- Overturning Index: The overturning index is based on work by Barnes and Hartmann (2012). This index identifies overturning structures of the contour line. An overturning of the contour line is present if the contour intersects at least three times with the same longitude. Further description can be found in my master thesis.
- Cutoff Index: The Cutoff Index provides information about the decaying of a wave breaking event. From a Potential Vorticity perspective, a wave breaking event is formed by an elongation of the 2 PVU contour line. These so-called streamers can elongate further until they separate from the main stratospheric or tropospheric body. The separated structure is referred to as a cutoff (Wernli and Sprenger (2007).
The tool is designed to analyze gridded data provided as an xarray.DataArray. Output is provided either in a geopandas.GeoDataFrame or in an xarray.DataArray.
Parts of the data setup functions and of the tracking function are based on the ConTrack - Contour Tracking tool developed by Daniel Steinfeld.
Important information:
- Free software: MIT license
- Further documentation about the implemented methods can be found in my master thesis.
Referencing:
- Please cite WaveBreaking in your publication: Kaderli, S., 2023. WaveBreaking - Detection, Classification and Tracking of Rossby Wave Breaking. https://doi.org/10.5281/zenodo.14214463
- If you are using the Streamer Index, please cite Wernli and Sprenger (2007) (and Sprenger et al. 2017).
- If you are using the Overturning INdex, please cite Barnes and Hartmann (2012).
To install WaveBreaking, run this command in your terminal:
pip install wavebreaking
This is the preferred method to install WaveBreaking, as it will always install the most recent stable release. Your virtual environment is automatically checked for the necessary dependencies. After the installation, you can start calculating RWB events by following the tutorial below.
The sources for WaveBreaking can be downloaded in two different ways. You can either install WaveBreaking directly from the GitHub repository:
pip install git+https://github.com/skaderli/WaveBreaking
Or you can clone the GitHub repository first and then install WaveBreaking locally. For that, start with setting the working directory and cloning the repository.
git clone https://github.com/skaderli/WaveBreaking.git cd /path/to/local/WaveBreaking
Second, set up the conda environment and install the necessary dependencies (this may take some time):
conda create -y -n wb_env conda env update -f environment.yml -n wb_env
Now activate the environment and install the WaveBreaking package locally by using the developer mode “-e”:
conda activate wb_env pip install -e .
To check if the installation was successful, perform some tests:
python -m unittest tests.test_wavebreaking
This tutorial shows how to calculate RWB events step by step. After successfully installing WaveBreaking, the module needs to be imported. Make sure that the Python kernel with the correct virtual environment (where WaveBreaking is installed) is running.
import wavebreaking as wb
More information about the functions presented below can be found in the documentation.
Please note that the algorithm depends on the order of the spatial dimensions. Both the longitude and latitude dimensions should be in ascending order. Although the WaveBreaking tool identifies and adjusts descending coordinates, the dataset should be checked and adapted before starting the calculation to get the best performance.
Optionally, the variable intended for the RWB calculations can be smoothed. The smoothing routine applies by default a 5-point smoothing (not diagonally) with a double-weighted center and an adjustable number of smoothing passes. Since the smoothing is based on the scipy.ndimage.convolve function, array-like weights and the mode for handling boundary values can be passed as an argument. This routine returns a xarray.DataArray with the variable "smooth_<variable>".
# read data
import xarray as xr
demo_data = xr.open_dataset("tests/data/demo_data.nc")
# smooth variable with 5 passes
import numpy as np
smoothed = wb.calculate_smoothed_field(data=demo_data.PV,
passes=5,
weights=np.array([[0, 1, 0], [1, 2, 1], [0, 1, 0]]), # optional
mode="wrap") # optional
The wavebreaking module calculates the intensity for each identified event, if an intensity field is provided. In my master thesis, the intensity is represented by the momentum flux derived from the product of the (daily) zonal deviations of both wind components. The routine creates a xarray.DataArray with the variable "mflux". More information can be found in my master thesis.
# calculate momentum flux
mflux = wb.calculate_momentum_flux(u=demo_data.U,
v=demo_data.V)
All RWB indices are based on a contour line representing the dynamical tropopause. The "calculate_contours()" function calculates the dynamical tropopause on the desired contour levels (commonly the 2 PVU level for Potential Vorticity). The function supports several contour levels at a time which allows for processing data of both hemispheres at the same time (e.g., contour levels -2 and 2). The contour calculation is also included in the RWB index functions and doesn't need to be performed beforehand. However, you can also pass the contours directly to the index functions. This is especially useful if you want to perform the calculation of several indices at once.
If the input field is periodic, the parameter "periodic_add" can be used to extend the field in the longitudinal direction (default 120 degrees) to correctly extract the contour at the date border. With "original_coordinates = False", array indices are returned (used for the index calculations) instead of original coordinates. The routine returns a geopandas.GeoDataFrame with a geometry column and some properties for each contour.
# calculate contours
contours = wb.calculate_contours(data=smoothed,
contour_levels=[-2, 2],
periodic_add=120, # optional
original_coordinates=True) # optional
All three RWB indices perform the contour calculation before identifying the RWB events. If you pass the separately calculated contours, the contour calcultion is skipped. For the streamer index, the default parameters are taken from Wernli and Sprenger (2007) (and Sprenger et al. 2017) and for the overturning index from Barnes and Hartmann (2012). If the intensity is provided (momentum flux, see data pre-processing), it is calculated for each event. All index functions create a geopandas.GeoDataFrame with a geometry column and some properties for each event.
# calculate streamers
streamers = wb.calculate_streamers(data=smoothed,
contour_levels=[-2, 2],
contours=contours, #optional
geo_dis=800, # optional
cont_dis=1200, # optional
intensity=mflux, # optional
periodic_add=120) # optional
# calculate overturnings
overturnings = wb.calculate_overturnings(data=smoothed,
contour_levels=[-2, 2],
contours=contours, #optional
range_group=5, # optional
min_exp=5, # optional
intensity=mflux, # optional
periodic_add=120) # optional
# calculate cutoffs
cutoffs = wb.calculate_cutoffs(data=smoothed,
contour_levels=[-2, 2],
contours=contours, #optional
min_exp=5, # optional
intensity=mflux, # optional
periodic_add=120) # optional
The event classification is based on selecting the events of interest from the geopandas.GeoDataFrame provided by the index calculation functions.
Some suggested classifications:
# stratospheric and tropospheric (only for streamers and cutoffs)
stratospheric = events[events.mean_var >= contour_level]
tropospheric = events[events.mean_var < contour_level]
# anticyclonic and cyclonic by intensity for the Northern Hemisphere
anticyclonic = events[events.intensity >= 0]
cyclonic = events[events.intensity < 0]
# anticyclonic and cyclonic by intensity for the Southern Hemisphere
anticyclonic = events[events.intensity <= 0]
cyclonic = events[events.intensity > 0]
# anticyclonic and cyclonic by orientation (only for overturning events)
anticyclonic = events[events.orientation == "anticyclonic"]
cyclonic = events[events.orientation == "cyclonic"]
In addition, a subset of events with certain characteristics can be selected, e.g. the 10% largest events:
# 10 percent largest events
large = events[events.event_area >= events.event_area.quantile(0.9)]
To calculate and visualize the occurrence of RWB events, it comes in handy to transform the coordinates of the events into a xarray.DataArray. The "to_xarray" function flags every grid cell where an event is present with the value 1. Before the transformation, it is suggested to classify the events first and only use for example stratospheric events.
# classify events
stratospheric = streamers[streamers.mean_var.abs() >= 2]
# transform to xarray.DataArray
flag_array = wb.to_xarray(data=smoothed,
events=stratospheric)
WaveBreaking provides two options to do a first visual analysis of the output. Both options are based on the xarray.DataArray with the flagged grid cells from the "to_xarray" function.
To analyze a specific large scale situation, the RWB events on a single time steps can be plotted:
# import cartopy for projection
import cartopy.crs as ccrs
wb.plot_step(flag_data=flag_array,
step="1959-06-05T06", #index or date
data=smoothed, # optional
contour_level=[-2, 2], # optional
proj=ccrs.PlateCarree(), # optional
size=(12,8), # optional
periodic=True, # optional
labels=True,# optional
levels=None, # optional
cmap="Blues", # optional
color_events="gold", # optional
title="") # optional
The analyze Rossby wave breaking from a climatological perspective, the occurrence (for specific seasons) can be plotted:
wb.plot_clim(flag_data=flag_array,
seasons=None, # optional
proj=ccrs.PlateCarree(), # optional
size=(12,8), # optional
smooth_passes=0, # optional
periodic=True, # optional
labels=True, # optional
levels=None, # optional
cmap=None, # optional
title="") # optional
Last but not least, WaveBreaking provides a routine to track events over time. Beside the time range of the temporal tracking, two methods for defining the spatial coherence are available. Events receive the same label if they either spatially overlap (method "by_overlapping") or if the centre of mass lies in a certain radius (method "by_radius"). Again, it is suggested to classify the events first and only use for example stratospheric events. This routine adds a column "label" to the events geopandas.GeoDataFrame.
# classify events
anticyclonic = overturnings[overturnings.orientation == "anticyclonic"]
# track events
tracked = wb.track_events(events=anticyclonic,
time_range=6, #time range for temporal tracking in hours
method="by_overlap", #method for tracking ["by_overlap", "by_distance"], optional
buffer=0, # buffer in degrees for polygons overlapping, optional
overlap=0, # minimum overlap percentage, optinal
distance=1000) # distance in km for method "by_distance"
The result can be visualized by plotting the paths of the tracked events:
wb.plot_tracks(data=smoothed,
events=tracked,
proj=ccrs.PlateCarree(), # optional
size=(12,8), # optional
min_path=0, # optional
plot_events=True, # optional
labels=True, # optional
title="") # optional
- The installation guide is to some extend based on the ConTrack - Contour Tracking tool developed by Daniel Steinfeld.
- This package was created with Cookiecutter and the audreyr/cookiecutter-pypackage project template.