You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
labs(x = "Date", y = "smoothed, day of week adjusted covid-like doctors visits") +
383
-
scale_color_viridis_c(option = "turbo", direction = -1) +
384
-
scale_fill_viridis_c(option = "turbo", direction = -1) +
394
+
scale_color_viridis_c(option = "viridis", direction = -1) +
395
+
scale_fill_viridis_c(option = "viridis", direction = -1) +
385
396
theme(legend.position = "none")
386
397
p2
387
398
```
@@ -391,17 +402,17 @@ p2
391
402
p1
392
403
```
393
404
394
-
The version faithful and un-faithful forecasts look moderately similar except for the 1 day horizons
395
-
(although neither approach produces amazingly accurate forecasts).
405
+
There are some weeks when the forecasts are somewhat similar, and others when they are wildly different, although neither approach produces amazingly accurate forecasts.
396
406
397
407
In the version faithful case for California, the March 2021 forecast (turquoise)
398
408
starts at a value just above 10, which is very well lined up with reported values leading up to that forecast.
399
409
The measured and forecasted trends are also concordant (both increasingly moderately fast).
400
410
401
411
Because the data for this time period was later adjusted down with a decreasing trend, the March 2021 forecast looks quite bad compared to finalized data.
402
-
403
412
The equivalent version un-faithful forecast starts at a value of 5, which is in line with the finalized data but would have been out of place compared to the version data.
404
413
414
+
The October 2021 forecast for the version faithful case floors out at zero, whereas the un-faithful is much closer to the finalized data.
Copy file name to clipboardExpand all lines: vignettes/custom_epiworkflows.Rmd
+3-3Lines changed: 3 additions & 3 deletions
Original file line number
Diff line number
Diff line change
@@ -347,7 +347,7 @@ There are many ways we could modify `four_week_ahead`. We might consider:
347
347
expect there to be a strong seasonal component to the outcome
348
348
- Scaling by a factor
349
349
350
-
We will demo a couple of these modifications below.
350
+
We will demonstrate a couple of these modifications below.
351
351
352
352
## Growth rate
353
353
@@ -456,9 +456,9 @@ result_plot
456
456
457
457
## Population scaling
458
458
459
-
Suppose we want to modify our predictions to apply to counts, rather than rates.
459
+
Suppose we want to modify our predictions to return a rate prediction, rather than the count prediction.
460
460
To do that, we can adjust _just_ the `frosting` to perform post-processing on our existing rates forecaster.
461
-
Since rates are calculated as counts per 100 000 people, we will convert back to counts by multiplying rates by the factor $\frac{regional \text{ } population}{100000}$.
461
+
Since rates are calculated as counts per 100 000 people, we will convert back to counts by multiplying rates by the factor $\frac{ \text{regional population} }{100,000}$.
Copy file name to clipboardExpand all lines: vignettes/epipredict.Rmd
+68-52Lines changed: 68 additions & 52 deletions
Original file line number
Diff line number
Diff line change
@@ -325,8 +325,18 @@ Instead, we'll use the fluview ILI dataset, which is weekly influenza like illne
325
325
We'll predict the 2023/24 season using all previous data, including 2020-2022, the two years where there was approximately no seasonal flu, forecasting from the start of the season, `2023-10-08`:
Estimating separate models for each geography is both 56 times slower[^7]than geo-pooling, and uses far less data for each estimate.
492
+
Estimating separate models for each geography uses far less data for each estimate than geo-pooling and is 56 times slower[^7].
482
493
If a dataset contains relatively few observations for each geography, fitting a geo-pooled model is likely to produce better, more stable results.
483
494
However, geo-pooling can only be used if values are comparable in meaning and scale across geographies or can be made comparable, for example by normalization.
484
495
@@ -488,7 +499,56 @@ workflow](custom_epiworkflows) with geography as a factor.
488
499
489
500
# Anatomy of a canned forecaster
490
501
491
-
This section describes the resulting object from `arx_forecaster()`, an `arx_fcast` object, along with a fairly minimal description of the actual mathematical model used for `arx_forecaster()`.
502
+
This section describes the resulting object from `arx_forecaster()`, a fairly minimal description of the mathematical model used, and a description of an `arx_fcast` object.
503
+
504
+
## Mathematical description
505
+
506
+
Let's look at the mathematical details of the model in more detail, using a minimal version of
0 commit comments