Feature request: Skill Scores #49

ahuang11 · 2019-11-21T22:32:17Z

https://confluence.ecmwf.int/display/FUG/12.B+Statistical+Concepts+-+Probabilistic+Data
I think it's just:

def brier_skill_score(obs, fct, ref, threshold):
    bscore_fct = (xs.brier_score(obs > threshold, (fct > threshold).mean('member')) 
    bscore_ref = (xs.brier_score(obs > threshold, (ref > threshold).mean('member')) 
    bskill = 1 - (bscore_fct / bscore_ref)
    return bskill

where ref can be persistence or climatology (where each year of the climatology is a member)

The text was updated successfully, but these errors were encountered:

raybellwaves · 2019-11-22T02:09:52Z

You can see how @matteodefelice did it in his notebook (https://colab.research.google.com/drive/1wWHz_SMCHNuos5fxWRUJTcB6wqkTJQCR)

brier_score = xs.brier_score(obs > obs.quantile(2/3, dim='time'),
                            (fct > fct.quantile(2/3, dim='time')).mean("member"))
baseline = xs.brier_score(obs > obs.quantile(2/3, dim='time'), 2/3)
bss = 1 - (brier_score / baseline)

ahuang11 · 2019-11-22T04:01:02Z

Yeah, I think both ways are valid, just a different baseline; maybe his can be used as the default if ref=None

def brier_skill_score(obs, fct, threshold, ref=None):
    bscore_fct = (xs.brier_score(obs > threshold, (fct > threshold).mean('member'))
    if ref is None:
        bscore_ref = xs.brier_score(obs > obs.quantile(2/3, dim='time'), 2/3)
    else:
        bscore_ref = (xs.brier_score(obs > threshold, (ref > threshold).mean('member')) 
    bskill = 1 - (bscore_fct / bscore_ref)
    return bskill

aaronspring · 2020-08-16T10:31:48Z

I think we should somehow come up with a pattern/wrapper that translates every skill into a skill score.

Brier Score and Brier Skill Score
RMSE and RMSE Skill Score
...

The formula would be from https://www.cawcr.gov.au/projects/verification/

anyone ideas how to do this with few lines of code?

ahuang11 · 2020-08-17T17:13:34Z

Is it applicable across all scores though? I am only familiar with brier skill score (first time heard of RMSE skill score)

aaronspring · 2020-08-17T17:38:28Z

I think so. In climpred we have a metrics class with attributes min max perfect. Maybe this would be nice to move to xskillscore

bradyrx · 2020-09-06T00:48:05Z

I think this sort of thing should be at climpred. a and b in xskillscore are two time series for instance. Let's say a is the forecast and b the verification product. To get BSS, you'd evaluate a against b, and then a persistence derivation of b against b for instance. Since we manage reference forecasts at climpred I'd think over there we would just call xskillscore BS twice.

bradyrx · 2020-09-10T14:31:01Z

I guess with the addition of the sign test, this could still fit here. If you hand xskillscore two different forecasts (one being the dynamical and the other the reference) you can compute skill scores. climpred would then wrap that to do nice looping through all the leads.

aaronspring · 2020-09-10T14:54:42Z

this is could be a template: https://github.com/bradyrx/climpred/blob/8a3fc954df2043f998987a1964059a9dc0d2e11c/climpred/metrics.py#L128-L140 were we would need the perfect attribute.

But many we should overdo things. xs is perfect for doing easy functions xs.metric(a,b,dim). skill scores can actually really easy be calculated from this manually. I dont see a large computational benefit of implementing this compared to having this sequentially manual_scoring_function(xs.metric(a,b,dim), reference(*args,**kwargs)).

We at least shouldnt change the existing API when implementing this.

raybellwaves · 2021-03-18T19:03:37Z

See #284. Leaving open as good discussion here.

matteodefelice · 2021-03-18T19:28:16Z

By the way, it seems that the line:

baseline = xs.brier_score(obs_final >  obs_final.quantile(2/3, dim = 'time'), 2/3)

doesn't work any more, I get this error on my Colab:

AttributeError                            Traceback (most recent call last)

<ipython-input-68-ad9dcdfd6ce1> in <module>()
----> 1 baseline = xs.brier_score(obs_final >  obs_final.quantile(2/3, dim = 'time'), 2/3)

/usr/local/lib/python3.8/site-packages/xskillscore/core/probabilistic.py in brier_score(observations, forecasts, member_dim, fair, dim, weights, keep_attrs)
    348                 res = (e / M - o) ** 2 - e * (M - e) / (M ** 2 * (M - 1))
    349         else:
--> 350             if member_dim in forecasts.dims:
    351                 forecasts = forecasts.mean(member_dim)
    352             res = xr.apply_ufunc(

AttributeError: 'float' object has no attribute 'dims'

EDIT: yes, now it requires an xarray object and it doesn't work any more with numbers.

aaronspring · 2021-03-18T20:47:06Z

Thanks for reporting @matteodefelice Can you please open an issue with this? Conversion from number to xr.dataarray can be easily added then

matteodefelice · 2021-03-28T17:04:05Z

Has anyone an example on how to calculate the skill score for the RPS or CRPS? I still haven't found the way to do that...

aaronspring · 2021-03-28T20:25:39Z

We have crpss in climpred.

aaronspring · 2021-03-28T20:26:44Z

For crpss look into the proper scoring example

raybellwaves · 2021-03-29T02:01:24Z

@matteodefelice I believe rpss is (kind of) in climpred as well https://github.com/pangeo-data/climpred/blob/main/climpred/tests/test_probabilistic.py#L252.
crpss is here: https://github.com/pangeo-data/climpred/blob/main/climpred/metrics.py#L2176
Porting here would be appreciated.

aaronspring · 2021-03-29T07:35:54Z

I do RPSS from RPS here: https://renkulab.io/gitlab/aaron.spring/s2s-ai-competition-bootstrap/-/blob/master/notebooks/verification_RPSS.ipynb

matteodefelice · 2021-03-29T08:35:53Z

Thanks @aaronspring but the real difficulty for me is calculating the RPS of the climatology, but I will find a way. Thanks to everyone for sharing.

aaronspring · 2021-03-29T09:11:04Z

generalized for perfect-models and hindcasts.

https://github.com/pangeo-data/climpred/blob/14f7458c1f2e944990d7008001adab49480fe07d/climpred/reference.py#L106

I create a fake 1-member forecast from groupby(dayofyear).

matteodefelice · 2021-03-29T10:58:44Z

that's a good idea. I did it, and the results seem reasonable. Given that I am working with seasonal averages (one point per year), this is what I have done:

cat_edges = obs.quantile(q = [1/3, 2/3], dim = 'year').rename({'quantile':'category_edge'})
rps_clim = xskillscore.rps(obs, obs.mean(dim = 'year').expand_dims({'member':1}), cat_edges, dim = 'year', member_dim='member')

The RPSS is comparable with the BSS, so I think this should work.

aschl · 2023-11-28T15:16:38Z

Just came across this package. Nice work.
What is the relationship between climpred and xskillscore? Some metrics are only available in climpred and are missing in xskillscore. A function to calculate the skillscore would be actually really nice (even if it was just the basic function that @aaronspring mentioned here.

aaronspring · 2023-11-28T18:31:56Z

Relationship: climpred imports (nearly?) all xskillscore metrics by wrapping xskillscore. climpred might have some metrics also defined only in climpred, such as skillscores for some metrics, i.e. normalised mse as mse/std or msess = 1 - nmse

aschl · 2023-11-29T08:29:49Z

Any objections in including these missing metrics in xskillscore? Seems to me the better place than having two packages with mixed functionalities. All benchmarking related scoring functions should be available in xskillscore IMO (and not climpred).
Re skill score (the topic of this issue): I was a bit surprised that the package (which is called xskillscore) does not include a function to calculate skillscores. It's a standard benchmarking metric that seems to me very suitable for this package?!

raybellwaves · 2023-11-29T15:56:21Z

Any objections in including these missing metrics in xskillscore? Seems to me the better place than having two packages with mixed functionalities. All benchmarking related scoring functions should be available in xskillscore IMO (and not climpred). Re skill score (the topic of this issue): I was a bit surprised that the package (which is called xskillscore) does not include a function to calculate skillscores. It's a standard benchmarking metric that seems to me very suitable for this package?!

No objections to additional metrics here that are missing and in climpred. Just make sure they give the same result

aaronspring · 2023-11-29T17:59:02Z

I think we wanted to keep the api consistent for all metrics: metric(forecast, observation) for skillscore you'd need to pass a reference skill somehow. I'm for that and suggest to have this api discussion first.

raybellwaves changed the title ~~metric request: Brier Skill Score~~ Feature request: Brier Skill Score Dec 21, 2019

raybellwaves added the enhancement New feature or request label Mar 27, 2020

aaronspring changed the title ~~Feature request: Brier Skill Score~~ Feature request: Skill Scores Aug 16, 2020

aaronspring pinned this issue Aug 16, 2020

raybellwaves mentioned this issue Mar 18, 2021

metrics to add #284

Open

matteodefelice mentioned this issue Mar 19, 2021

Problem when calculating a baseline using a constant number #285

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature request: Skill Scores #49

Feature request: Skill Scores #49

ahuang11 commented Nov 21, 2019 •

edited

Loading

raybellwaves commented Nov 22, 2019

ahuang11 commented Nov 22, 2019 •

edited

Loading

aaronspring commented Aug 16, 2020 •

edited

Loading

ahuang11 commented Aug 17, 2020

aaronspring commented Aug 17, 2020

bradyrx commented Sep 6, 2020

bradyrx commented Sep 10, 2020

aaronspring commented Sep 10, 2020

raybellwaves commented Mar 18, 2021

matteodefelice commented Mar 18, 2021 •

edited

Loading

aaronspring commented Mar 18, 2021

matteodefelice commented Mar 28, 2021

aaronspring commented Mar 28, 2021

aaronspring commented Mar 28, 2021

raybellwaves commented Mar 29, 2021

aaronspring commented Mar 29, 2021

matteodefelice commented Mar 29, 2021

aaronspring commented Mar 29, 2021

matteodefelice commented Mar 29, 2021

aschl commented Nov 28, 2023

aaronspring commented Nov 28, 2023

aschl commented Nov 29, 2023

raybellwaves commented Nov 29, 2023

aaronspring commented Nov 29, 2023

Feature request: Skill Scores #49

Feature request: Skill Scores #49

Comments

ahuang11 commented Nov 21, 2019 • edited Loading

raybellwaves commented Nov 22, 2019

ahuang11 commented Nov 22, 2019 • edited Loading

aaronspring commented Aug 16, 2020 • edited Loading

ahuang11 commented Aug 17, 2020

aaronspring commented Aug 17, 2020

bradyrx commented Sep 6, 2020

bradyrx commented Sep 10, 2020

aaronspring commented Sep 10, 2020

raybellwaves commented Mar 18, 2021

matteodefelice commented Mar 18, 2021 • edited Loading

aaronspring commented Mar 18, 2021

matteodefelice commented Mar 28, 2021

aaronspring commented Mar 28, 2021

aaronspring commented Mar 28, 2021

raybellwaves commented Mar 29, 2021

aaronspring commented Mar 29, 2021

matteodefelice commented Mar 29, 2021

aaronspring commented Mar 29, 2021

matteodefelice commented Mar 29, 2021

aschl commented Nov 28, 2023

aaronspring commented Nov 28, 2023

aschl commented Nov 29, 2023

raybellwaves commented Nov 29, 2023

aaronspring commented Nov 29, 2023

ahuang11 commented Nov 21, 2019 •

edited

Loading

ahuang11 commented Nov 22, 2019 •

edited

Loading

aaronspring commented Aug 16, 2020 •

edited

Loading

matteodefelice commented Mar 18, 2021 •

edited

Loading