Performance improvements to AreaWeighted with sparse matrices

## 📰 Custom Issue


As a subtask of #4754 is to rewrite the algorithm for AreaWeighted to use sparse matrices as the standard (if this improves performance). Looking at the code for area weighted regridding, it looks like there are a few opportunities to improve performance.

1. Weight calculation happens in a nested for loop over x and y dimensions, e.g: https://github.com/SciTools/iris/blob/dab88e8bd7f76f9bb82ee07b12b1264e56c5bc5e/lib/iris/analysis/_area_weighted.py#L889 This calculation ought to be seperable into two different loops with their results then *combined*.
2. The weights are *applied* using numpy average functions where matrix multiplication may be more appropriate https://github.com/SciTools/iris/blob/dab88e8bd7f76f9bb82ee07b12b1264e56c5bc5e/lib/iris/analysis/_area_weighted.py#L510
3. It may turn out to be even more performant to store *two* weight matrices, one for each dimension. Each may then be applied independently.
4. **Note:** In order for the application of weights to be separated, normalisation due to masked data will have to be handled *after* the application of weights as a separate step. This is not possible when using numpy average methods where the normalisation step is built in.

This would require rewriting much of the code for AreaWeighted from the ground up. This may be slightly simplified by following templates set by the use of sparse matrices in other Iris regridders.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Performance improvements to AreaWeighted with sparse matrices #5365

📰 Custom Issue

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Performance improvements to AreaWeighted with sparse matrices #5365

Description

📰 Custom Issue

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions