-
Notifications
You must be signed in to change notification settings - Fork 2
/
Copy pathPensionAge_anzjs.tex
434 lines (339 loc) · 42.1 KB
/
PensionAge_anzjs.tex
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
% Options for packages loaded elsewhere
\PassOptionsToPackage{unicode}{hyperref}
\PassOptionsToPackage{hyphens}{url}
%
\documentclass[
doublespace]{anzsauth}
\usepackage{amsmath,amssymb}
\usepackage{lmodern}
\usepackage{ifxetex,ifluatex}
\ifnum 0\ifxetex 1\fi\ifluatex 1\fi=0 % if pdftex
\usepackage[T1]{fontenc}
\usepackage[utf8]{inputenc}
\usepackage{textcomp} % provide euro and other symbols
\else % if luatex or xetex
\usepackage{unicode-math}
\defaultfontfeatures{Scale=MatchLowercase}
\defaultfontfeatures[\rmfamily]{Ligatures=TeX,Scale=1}
\fi
% Use upquote if available, for straight quotes in verbatim environments
\IfFileExists{upquote.sty}{\usepackage{upquote}}{}
\IfFileExists{microtype.sty}{% use microtype if available
\usepackage[]{microtype}
\UseMicrotypeSet[protrusion]{basicmath} % disable protrusion for tt fonts
}{}
\makeatletter
\@ifundefined{KOMAClassName}{% if non-KOMA class
\IfFileExists{parskip.sty}{%
\usepackage{parskip}
}{% else
\setlength{\parindent}{0pt}
\setlength{\parskip}{6pt plus 2pt minus 1pt}}
}{% if KOMA class
\KOMAoptions{parskip=half}}
\makeatother
\usepackage{xcolor}
\IfFileExists{xurl.sty}{\usepackage{xurl}}{} % add URL line breaks if available
\IfFileExists{bookmark.sty}{\usepackage{bookmark}}{\usepackage{hyperref}}
\hypersetup{
pdftitle={Forecasting the old-age dependency ratio to determine a sustainable pension age},
pdfauthor={Rob J Hyndman\^{}\{1*\}, Yijun Zeng\^{}2, Han Lin Shang\^{}3},
hidelinks,
pdfcreator={LaTeX via pandoc}}
\urlstyle{same} % disable monospaced font for URLs
\usepackage{longtable,booktabs,array}
\usepackage{calc} % for calculating minipage widths
% Correct order of tables after \paragraph or \subparagraph
\usepackage{etoolbox}
\makeatletter
\patchcmd\longtable{\par}{\if@noskipsec\mbox{}\fi\par}{}{}
\makeatother
% Allow footnotes in longtable head/foot
\IfFileExists{footnotehyper.sty}{\usepackage{footnotehyper}}{\usepackage{footnote}}
\makesavenoteenv{longtable}
\usepackage{graphicx}
\makeatletter
\def\maxwidth{\ifdim\Gin@nat@width>\linewidth\linewidth\else\Gin@nat@width\fi}
\def\maxheight{\ifdim\Gin@nat@height>\textheight\textheight\else\Gin@nat@height\fi}
\makeatother
% Scale images if necessary, so that they will not overflow the page
% margins by default, and it is still possible to overwrite the defaults
% using explicit options in \includegraphics[width, height, ...]{}
\setkeys{Gin}{width=\maxwidth,height=\maxheight,keepaspectratio}
% Set default figure placement to htbp
\makeatletter
\def\fps@figure{htbp}
\makeatother
\setlength{\emergencystretch}{3em} % prevent overfull lines
\providecommand{\tightlist}{%
\setlength{\itemsep}{0pt}\setlength{\parskip}{0pt}}
\setcounter{secnumdepth}{5}
%% Any special functions or other packages can be loaded here.
\usepackage[T1]{fontenc}
\usepackage[utf8]{inputenc}
\usepackage{float, bm}
\usepackage{moreverb}
\usepackage{url}
\usepackage{grffile}
\usepackage{lineno}
\usepackage[UKenglish]{isodate}
\usepackage{hyperref}
\usepackage{mathtools}
\mathtoolsset{showonlyrefs}
\let\origfigure\figure
\let\endorigfigure\endfigure
\renewenvironment{figure}[1][2] {
\expandafter\origfigure\expandafter[tbp]
} {
\endorigfigure
}
\let\origtable\table
\let\endorigtable\endtable
\renewenvironment{table}[1][2] {
\expandafter\origtable\expandafter[tbp]
} {
\endorigtable
}
\runningheads{Forecasting the OADR to determine a sustainable pension age}{HYNDMAN, ZENG, SHANG}
\def\volumeyear{2020}
\let\oldmaketitle\maketitle
\def\maketitle{\cleanlookdateon\oldmaketitle}
\linenumbers
\usepackage{titling}
\pretitle{\begin{center}\Large\bfseries}
\posttitle{\end{center}\vspace{0.5cm}}
\preauthor{\begin{center}\large}
\postauthor{\end{center}\begin{center}
$^1$\emph{Department of Econometrics \& Business Statistics, Monash University, Clayton, Australia}\\
$^2$\emph{Shanghai, China}\\
$^3$\emph{Macquarie University, Sydney, Australia}
\footnotetext{$^*$Corresponding author. Email: \texttt{[email protected]}. This work is supported by the ARC Centre of Excellence for Mathematical and Statistical Frontiers.}\end{center}}
\usepackage{mathptmx}
\raggedbottom
\ifluatex
\usepackage{selnolig} % disable illegal ligatures
\fi
\usepackage[]{natbib}
\bibliographystyle{anzsj}
\title{Forecasting the old-age dependency ratio to determine a sustainable pension age}
\author{Rob J Hyndman\(^{1*}\), Yijun Zeng\(^2\), Han Lin Shang\(^3\)}
\date{}
\begin{document}
\maketitle
\hypertarget{summary}{%
\section*{Summary}\label{summary}}
\addcontentsline{toc}{section}{Summary}
We forecast the old-age dependency ratio for Australia under various pension age proposals, and estimate a pension age scheme that will provide a stable old-age dependency ratio at a specified level. Our approach involves a stochastic population forecasting method based on coherent functional data models for mortality, fertility and net migration, which we use to simulate the future age-structure of the population. Our results suggest that the Australian pension age should be increased to 68 by 2030, 69 by 2036, and 70 by 2050, in order to maintain the old-age dependency ratio at 23\%, just above the 2018 level. Our general approach can easily be extended to other target levels of the old-age dependency ratio and to other countries.
\textbf{Keywords:} coherent forecasts, demographic components, functional time series, pension age.
\hypertarget{introduction}{%
\section{Introduction}\label{introduction}}
Australia's ageing population is a result of three factors: 1) significant technological advances in medical care leading to longer life spans; 2) high fertility rates after World War II resulting in the `baby-boomer' generation; and 3) a large decline in fertility rates over the past thirty years \citep{Fehr2008}. Consequently, there are many older people leaving the workforce, and fewer workers replacing them. Many other countries face similar demographic challenges. One way of measuring this problem is via the old-age dependency ratio (OADR): the number of people post-retirement to the number of people of working age \citep{Walker1982}. The OADR has been increasing for many decades, but recently the rate of increase has jumped markedly, and we show that it will continue to increase at a rapid rate if there are no age-related policy changes.
This is consistent with the Australian Intergenerational Reports \citep{IGR02, IGR07, Swan2010} that explored the extent to which existing age-related policies would affect the burden of future working generations. To date, the Australian government has addressed the problem through a variety of approaches including modifications to compulsory superannuation policies, instigating work incentives for older people, and increasing the pension age (defined as the age at which eligible residents can access the age pension for the first time). In 2009, the federal government announced an increase in pension age from 65 to 67 by 2023 \citep{Nielson2010}. In May 2014, the Federal Treasurer proposed a further increase in the pension age to 70 by 2035, in line with the recommendations from the \citet{PC13}. We investigate the effect of these changes in the pension age on the forecasts of the OADR.
The OADR can be expressed as
\begin{equation}
\text{OADR} = \frac{\text{number of people aged over pension age}}{\text{number of people aged 15 to pension age}}*100\%
\end{equation}
We use the OADR as a measure of the financial burden on the Australian workforce because a rise in the OADR will reduce the ability of working-age population to finance pensions and health costs \citep{ARW07, IGR02, IGR07}.
We provide forecasts of future population age structures over the next 50 years, and use those forecasts to compute the OADR that would result under various pension age schemes.
We then invert the analysis by finding a pension age scheme that would lead to a specified OADR value. For our analysis, the specified OADR value has been set at 23\%, just above the level of OADR in 2018. Whether this OADR level is the most appropriate level for Australia remains to be examined and is not the focus of this paper. Instead, we provide a methodology for policymakers to find a target pension age that leads to a given OADR value.
Forecasts of population structure depend on forecasts of age- and sex-specific mortality rates, fertility rates and net migration. To this end, we combine the functional demographic models of \citet{HU07} and the product-ratio method of \citet{HBY13} within the general stochastic population forecasting framework of \citet{HB08}, to forecast a population age structure. These forecasts presuppose that the historical trends of past years will continue. Historical data of those rates have thus been used as input to simulate future paths of age- and sex-specific population structure.
Some may argue that using OADR as a measure of a financial burden for the next generation could be biased, with a large number of retirees able to rely on self-financed retirement income, such as superannuation. However, approximately two-thirds of Australians over pension age rely fully or partly on the age pension \citep{AIHW19}, and this figure has remained relatively constant over the last 20 years. Therefore, the proportion of people over working age is still a good representation of the financial burden on future working generations. Nevertheless, using OADR as a single measurement of financial burden may overlook some important economic factors, and \citet{HY12} have suggested the use of an adjusted OADR that takes account of some of these factors in analyzing pension finance. We do not consider these here. Our approach is intended to illustrate how population forecasting can be used to explore the pension age issue in a simple context, and it could be extended to take into account other variables if required.
The remainder of this paper proceeds as follows. Section \ref{sec:litreview} reviews key literature on population forecasting and the pension age in Australia. Section \ref{sec:results} describes the data and modelling framework we use in this study. We show how our framework can be used to estimate a pension age scheme that leads to a target OADR in Section \ref{sec:target}. Finally, Section \ref{sec:conclusions} gives concluding remarks as well as identifies the scope for future research.
\hypertarget{sec:litreview}{%
\section{Background}\label{sec:litreview}}
\hypertarget{the-age-pension-in-australia}{%
\subsection{The age pension in Australia}\label{the-age-pension-in-australia}}
The age pension in Australia provides income support and access to a range of concessions and additional allowances for older Australians who have passed a means test.
In 2018, 67\% of Australian residents aged 65 and over received income support payments \citep{AIHW19}. Even though the government has promoted self-financed retirement by introducing the Superannuation Guarantee in 1992, the Superannuation Guarantee accumulations are still modest, with many retirees having no superannuation or very modest balances \citep{data61}.
The ageing trend in Australia, along with retirees' dependence on the age pension, means the cost of the age pension is currently around at 2.7\% of Gross Domestic Product. With an ageing population, the pension age will need to increase or the fiscal burden will become unsustainable.
\begin{figure}
\centering
\includegraphics{PensionAge_anzjs_files/figure-latex/changespension-1.pdf}
\caption{\label{fig:changespension}Australian pension age policies.}
\end{figure}
The pension age for males was set at 65 for over a century. While the pension age for females was 60 years until 1994, it has steadily increased to 65 years in 2014, in line with that of males. From July 2017, pension eligibility ages for both males and females have been increasing by six months every two years and will reach 67 in July 2023. The \citet{PC13} proposed further lifting of the pension age to 70 on the grounds that increasing the pension age to 70 would save taxpayers \$150 billion in welfare costs and health spending, and older people are capable of continuing work. The former Treasurer Joe Hockey announced that the government would increase the pension age to 70 by 2035, and this was proposed in the Federal Budget Statement for 2014--2015, although it has never been legislated by the Australian Federal Government. Figure \ref{fig:changespension} illustrates the process of the adjustments on pension age scheme.
The government has not disclosed the method it used to determine the new pension age. \citet[p7]{CEDA07} proposed directly linking pension age with life expectancy in the way that `pension age should increase by approximately 50\% of any increase in life expectancy'. However, using a `rule of thumb' approach might not be ideal, as 50\% of the change in life expectancy may not be a good adjustment to pension age in the long term. We propose that the pension age should be linked to the old age dependency ratio, and based on forecasts of future mortality and fertility rates. Our approach has the advantage that it can be used to justify the change in pension age and directly links the pension age with the goal of a stable financial burden for the public.
\hypertarget{demographic-growth-balance-equations}{%
\subsection{Demographic growth-balance equations}\label{demographic-growth-balance-equations}}
The data required for our proposal are age- and sex-specific birth and death numbers of each calendar year, age- and sex-specific population members on 1~January of each year, and age- and sex-specific exposures to risk (i.e., the population of age \(x\) at 30~June) for each year. Let
\begin{align*}
B_t(x) &= \text{Births in calendar year $t$ to females of age $x$},\\
D_t(x) &= \text{Deaths in calendar year $t$ of persons of age $x$},\\
P_t(x) &= \text{Population of age $x$ at 1 January of year $t$},\\
E_t(x) &= \text{Population of age $x$ exposed to risk at 30 June of year $t$},
\end{align*}
where \(x=0,1,2,\dots,p-1,p^+\) denotes age with \(p^+\) the open-ended upper age group, set to 100\(^+\) in this study. The year is denoted by \(t=1,2,\dots,T\), and superscripts \(M\) and \(F\) denote male and female respectively.
We can obtain historical mortality rates \(m_t(x)\) (also called central death rates) and fertility rates \(f_t(x)\) as follows:
\begin{align*}
m_t(x) & = \frac{D_t(x)}{E_t(x)}=\text{age-sex-specific central death rates in calendar year $t$};\\
f_t(x) & = \frac{B_t(x)}{E^F_t(x)}=\text{age-specific fertility rates in calendar year $t$}.
\end{align*}
The net migration is the difference between immigration and emigration in a certain area during a specified time frame. We denote net migration by \(G\), and estimate it using demographic growth-balance equations, which summarise the relationship between population change and three factors: net migration, births and deaths. The equations are expressed as:
\begin{align*}
G_t(x,x+1) & =P_{t+1}(x+1)-P_t(x)+D_t(x,x+1)\qquad\text{for $x=0,1,2,\dots,p-2$},\\
G_t(p-1^+,p^+) & =P_{t+1}(p^+)-P_t(p^+)-P_t(p-1)+D_t(p-1^+,p^+),\\
G_t(B,0) & =P_{t+1}(0)-B_t+D_t(B,0),
\end{align*}
where \(G_t(x,x+1)\) refers to net migration in calendar year \(t\) of persons aged \(x\) at the beginning of year \(t\), \(G_t(p-1^+,p^+)\) refers to migration in calendar year \(t\) of persons aged \(p-1\) and older at the beginning of year \(t\), and \(G_t(B,0)\) refers to migration in calendar year \(t\) of persons born during year \(t\); and similarly for deaths, \(D_t(x,x+1)\), \(D_t(p-1^+,p^+)\), and \(D_t(B,0)\). The deaths are estimated using the standard life table approach of population projection \citep{Preston2000}. It should be noted that the estimated net migration includes errors in data recording.
\hypertarget{old-age-dependency-ratio}{%
\subsection{Old-Age Dependency Ratio}\label{old-age-dependency-ratio}}
Using this notation, the OADR in year \(t\) is given by
\begin{equation}\label{eq:oadr}
O_t = \frac{\displaystyle\sum_{x\ge \lfloor a_t\rfloor} P_t(x) - r_t P_t(\lfloor a_t\rfloor)}{\displaystyle\sum_{x=15}^{\lfloor a_t\rfloor-1} P_t(x) + r_t P_t(\lfloor a_t\rfloor)},
\end{equation}
where \(a_t\) is the pension age in year \(t\) and \(r_t = a_t - \lfloor a_t \rfloor\) is the non-integer part of \(a_t\). This assumes that the pension age can only be adjusted at the beginning of a year, and that the birthday of the population is uniformly distributed over the year.
\hypertarget{forecasting-demographic-components}{%
\subsection{Forecasting demographic components}\label{forecasting-demographic-components}}
Variants of the Lee--Carter method \citep{LC92} are frequently used for demographic forecasting \citep[see, e.g.,][]{SBH11}. The most recent forecast of the population in Australia by the \citet{PC13} was based on basic Lee--Carter methods plus some judgement-based adaptations to address the characteristics of demographic trends that the Lee--Carter method failed to capture. The Hyndman--Ullah method (2007) is a generalisation of the Lee--Carter method and is used as the basis for forecasting demographic components in this study. In addition, the product-ratio method \citep{HBY13} is used to eliminate the potential long-term divergence between female and male mortality rates and net-migration rates.
\hypertarget{leecarter-method}{%
\subsubsection*{Lee--Carter method}\label{leecarter-method}}
\addcontentsline{toc}{subsubsection}{Lee--Carter method}
The Lee--Carter (LC) method has been widely used to forecast mortality rates across various countries, including Australia. The model proposed by \citet{LC92} (but using our notation) is
\begin{equation}\label{eq:lc}
\log m_{t}(x)=\mu(x)+\phi(x)\beta_{t}+\varepsilon_{t,x},
\end{equation}
where \(m_{t}(x)\) is the central death rate for age \(x\) in year \(t\). This model involves fitting a matrix of log central death rates with sets of age-specific constants, \{\(\mu(x)\)\} and \{\(\phi(x)\)\}, and a time-varying index \(\beta_{t}\). The notation \(\phi(x)\) denotes a set of parameters that tell us the rate at which each age responds to changes in \(\beta_{t}\). \(\beta_{t}\) measures the general level of the log death rates; \(\mu(x)\) is the general pattern across the age of the log mortality rate and \(\varepsilon_{t,x}\) is an error term with zero mean and constant variance, reflecting the randomness that can't be captured by the model.
The model is over-parameterised. Therefore, the constraints
\begin{equation}
\sum_{t=1}^{T}\beta_t=0, \qquad\text{and}\qquad \int_x \phi(x)=1
\end{equation}
are imposed, and a singular value decomposition is used to find the least squares solution of \eqref{eq:lc} \citep{Good1969}. The estimates of \(\phi(x)\) and \(\beta_t\) are therefore the first principal component and scores of \(\bm{X}\) with \((i,t)\)th element \([\log(m_{t}(x))-\widehat\mu(x)]\), where \(\widehat\mu(x)\) is the sample mean of \(\log(m_{t}(x))\) over \(t\). The log transformation introduces some bias, so the LC method adjusts \(\beta_t\) such that for each year, the implied number of deaths will be equal to the actual number of deaths. The adjusted \(\beta_t\) is then extrapolated using a random walk with drift model.
\hypertarget{hyndmanullah-method}{%
\subsubsection*{\texorpdfstring{Hyndman--Ullah method \label{sec:HU}}{Hyndman--Ullah method }}\label{hyndmanullah-method}}
\addcontentsline{toc}{subsubsection}{Hyndman--Ullah method \label{sec:HU}}
\citet{HU07} proposed a nonparametric method for demographic modelling and forecasting that extended the LC model in four ways:
\begin{enumerate}
\def\labelenumi{\arabic{enumi}.}
\tightlist
\item
The demographic rates are smoothed before modeling.
\item
It uses functional principal components analysis, which is a continuous version of PCA.
\item
It uses more than one principal component, which addresses the main weakness of the LC method.
\item
The forecasting models for the principal component scores are more complex than the random walk with drift model used in the LC model.
\end{enumerate}
The Hyndman--Ullah (HU) model can be written as
\begin{align}
y_t(x) & = s_t(x) + \sigma_t(x)\varepsilon_{t,x}\label{eq:HU_1},\\
s_t(x) & = \mu(x) + \sum_{j=1}^{J}\beta_{t,j}\phi_j(x) + e_t(x)\label{eq:HU_2},
\end{align}
where \(y_t(x)\) may be log mortality, log fertility or net migration \citep{HB08} in year \(t\) and age \(x\). Here \(s_t(x)\) is a smoothed version of the observed data \(y_t(x)\), smoothed over age but not time, and estimated by constrained weighted penalised regression splines; and \(\{\varepsilon_{t,x}\}\) are independent and identically distributed standard normal error terms. Equation \eqref{eq:HU_2} has a similar structure to the LC method, except that it is applied to smoothed data and incorporates more than one principal component and score. The \(e_t(x)\) is the model error having mean zero and no serial correlation, while \(\mu(x)\) is estimated as the average of \(s_t(x)\) across years. Estimates of \(\phi_j(x)\) and \(\beta_{t,j}\) are the \(j\)th principal components and scores of \(s_t(x)-\mu(x)\) respectively. Following \citet{HB08}, we select \(J=6\), which we have found to be large enough for demographic applications.
By conditioning on the observed data \(\bm{I}=\left\{{y_1(x),\dots,y_n(x)}\right\}\) and the set of functional principal components \(\bm{B}=\left\{{\phi_1(x),\phi_2(x),\dots,\phi_j(x)}\right\}\), the \(h\)-step-ahead forecast of \(y_{T+h}(x)\) is given by
\begin{equation}
\widehat{y}_{T+h\mid T}(x)=\widehat{s}_{T+h\mid T}(x) = \text{E}[\widehat{s}_{T+h\mid T}(x)\mid \bm{I, B}] = \widehat{\mu}(x)+\sum_{j=1}^{J}\phi_j(x)\widehat{\beta}_{T+h\mid T,j},
\end{equation}
where \(\widehat{\beta}_{T+h\mid T,j}\) denotes the \(h\)-step-ahead forecast of \(\beta_{T+h\mid T,j}\). Because \(\left\{\beta_{t,j}\right\}\) is time-varying, it controls the dynamics of the process. Each component series \(\left\{\beta_{t,j}\right\}\) is modelled with an ARIMA\((p_j,d_j,q_j)\) process, where \(p_j,d_j,q_j\) are selected using the automated algorithm of \citet{forecastpackage}, and the parameters are estimated using maximum likelihood estimation.
\hypertarget{product-ratio-method}{%
\subsubsection*{Product-ratio method}\label{product-ratio-method}}
\addcontentsline{toc}{subsubsection}{Product-ratio method}
When we apply the \citet{HU07} model to data on males and females, we need to constrain the resulting forecasts so they do not diverge. In the product-ratio method \citep{HBY13}, the HU model is applied to the geometric mean of male and female log mortality data, and the ratio of male and female log mortality data, rather than to each series independently. That is, define
\begin{align}
p_t(x)&=\sqrt{s_{t}^{\text{M}}(x)s_{t}^{\text{F}}(x)}, \label{eq:coherent_1}\\
r_t(x)&=\sqrt{\frac{s_{t}^{\text{M}}(x)}{s_{t}^{\text{F}}(x)}}. \label{eq:coherent_2}
\end{align}
Then, the HU method is applied to these two new variables:
\begin{align*}
\log[p_t(x)]=\mu_p(x)+\sum_{j=1}^{J}\beta_{t,j}\phi_j(x)+e_t(x),\\
\log[r_t(x)]=\mu_r(x)+\sum_{l=1}^{L}\gamma_{t,l}\psi_l(x)+z_t(x),
\end{align*}
where time series \(\left\{\beta_{t,j}\right\}\) are fitted with ARIMA models, while \(\left\{\gamma_{t,l}\right\}\) are fitted by stationary ARFIMA models (i.e., the order of differencing is constrained to \(0< d < 1/2\)). It is the stationary constraint on \(\gamma_{t,l}\) that ensures the forecasts of male and female do not diverge. Then, forecasts of sex-specific data are obtained using
\begin{align*}
m_{T+h\mid T}^{\text{M}}(x) & = p_{T+h\mid T}(x)r_{T+h\mid T}(x),\\
m_{T+h\mid T}^{\text{F}}(x) & = p_{T+h\mid T}(x)/r_{T+h\mid T}(x).
\end{align*}
The method for forecasting net migration by sex is similar to the above method used on mortality rates except no log scale is applied to net migration, and a sum and difference is used rather than a product and ratio.
\hypertarget{forecasting-oadr}{%
\subsection{Forecasting OADR}\label{forecasting-oadr}}
Following the general procedure proposed in \citet{HB08}, we model each of the five components of population change: female mortality rates, male mortality rates, female net migration, male net migration and fertility rates, and simulate these five components into the future. We use the HU method applied to log-fertility, and the product-ratio method applied to log-mortality and net-migration. We used weighted penalised splines \citep{HU07} when estimating the log-fertility and log-mortality curves, where the weights account for Poisson variation in deaths and births. We use loess when estimating the net-migration curves.
From these models, sample paths are simulated for age-specific fertility rates, age- and sex-specific mortality rates and age- and sex-specific net migration. Then future population values are simulated using the demographic growth-balance equations. The future deaths and births in this equation are assumed to follow a Poisson distribution, with parameters as a function of future mortality and fertility rates. We bootstrap the residuals of the net-migration model to simulate future net-migration numbers (see \citet{HB08} for the details). Hence, we can simulate \(h\)-step-ahead age- and sex-specific population numbers based on simulated births, deaths and net-migrants.
This simulation was repeated for years \(t=T+1,\dots,T+h\) to obtain a sample path of the age-specific population. Many simulated population paths were obtained in this way. For each simulated population path, and each proposed set of pension age changes, we can simulate future OADR values using \eqref{eq:oadr}.
There are three types of randomness in the simulation: (1) randomness from the second equation in the Hyndman--Ullah specification (i.e., \(e_t(x)\) in \eqref{eq:HU_1}); (2) randomness from the ARIMA model fitted to principal component scores; and (3) randomness from the Poisson distribution for births and deaths, or the bootstrap for migrants.
\hypertarget{sec:results}{%
\section{Application to Australia}\label{sec:results}}
\hypertarget{data}{%
\subsection{Data}\label{data}}
Historical central death rates and start-year and mid-year populations of residents in Australia by sex and age in single years for age group 0--99 and 100+ have been sourced from the \citet{HMD} for 1921--2018. The start-year population for 2019 is also used to estimate net migration. Data for age-specific annual fertility rates by single years of age for 15--49 over the period 1950--2018 was obtained from the \citet{ABSBirths}. Even though annual mortality rates are available from 1921 to 2018, only data collected after 1950 is used for two reasons. First, the period between 1921--1950 has different mortality patterns from later years because of wars \citep{HB08}. Since the model for this study is valid under the assumption that extrapolative methods can be used, it seemed best to delete the less relevant data, which might influence estimations of parameters if we include them. Second, fertility rates are only available from 1950, so the mortality rates were also truncated to the same period.
\hypertarget{component-forecasts}{%
\subsection{Component forecasts}\label{component-forecasts}}
We illustrate the component model forecasts using fertility rates. Figure \ref{fig:componentsfertility} shows the estimated mean age pattern \(\widehat{\mu}(x)\), the first two basis functions, \(\widehat{\phi}_1(x)\) and \(\widehat{\phi}_2(x)\), and the first two principal component scores, \(\widehat{\beta}_{t,1}\) and \(\widehat{\beta}_{t,2}\)) for log fertility rates. The first term, \(\widehat{\phi}_1(x) \widehat{\beta}_{t,1}\), accounts for 63.7\% of the variation in the log fertility rates. The first principal component scores \(\widehat{\beta}_{t,1}\), shown in black for 1950--2018, display a sharp increase between 1950--1960, followed by a rapid decline in fertility rates from 1961. This increase of fertility is linked to the post-World War II baby boom, while the dramatic decrease from 1960 was due to the availability of contraception pills. A noticeable further reduction of fertility rates in the late 1960s may be due to a change in the abortion laws in some states.
\begin{figure}
\centering
\includegraphics{PensionAge_anzjs_files/figure-latex/componentsfertility-1.pdf}
\caption{\label{fig:componentsfertility}Fitted basis functions and coefficients for Australian age-specific fertility rates.}
\end{figure}
The first basis function \(\widehat{\phi}_1(x)\) measures the different effects of the time-trend across ages. It indicates the general pattern of fertility has little effect on females of age 15, most likely because such pregnancies are almost always accidental and usually prior to the mother taking contraception. The two `bumps' in the first basic function suggests that the historical trend has a distinct impact on females aged around their 20s and 40s. The second basis function, accounting for 31.5\% of the variation, largely concerns mothers aged over 30. The forecasts of future coefficients are shown in the lower panels with their 80\% pointwise prediction intervals shown as shaded regions. These forecasts are obtained using automatically selected ARIMA models. In practice, we use six basis functions, but the four not shown have a much smaller effect on the resulting fertility forecasts, only account for 4.6\% of the variation, and are much harder to interpret.
Similar models are used for the product and ratio of mortality rates, and sum and difference of net migration rates.
\hypertarget{population-forecasts}{%
\subsection{Population forecasts}\label{population-forecasts}}
Figure \ref{fig:totalpopulationbyyears} shows the forecast of the total population by sex over the prediction horizon, while Figure \ref{fig:totalpopulationin2038} shows the 20-year forecast of population age structure in 2038. Figure \ref{fig:totalpopulationbyyears} indicates that both the female and male population are expected to increase to almost 20 million by 2050.
\begin{figure}
\centering
\includegraphics{PensionAge_anzjs_files/figure-latex/totalpopulationbyyears-1.pdf}
\caption{\label{fig:totalpopulationbyyears}Fifty-year forecasts of the total population for each sex, along with 80\% prediction intervals.}
\end{figure}
\begin{figure}
\centering
\includegraphics{PensionAge_anzjs_files/figure-latex/totalpopulationin2038-1.pdf}
\caption{\label{fig:totalpopulationin2038}Forecast population pyramid for 2038, along with 80\% pointwise prediction intervals. The actual pyramid for 2019 is shown using dashed lines.}
\end{figure}
Figure \ref{fig:totalpopulationin2038} shows the mean and the 80\% pointwise prediction interval of simulated population paths for the year 2038 (a 20-year forecast), along with the 2019 base population. The uncertainty of the population below aged 20 is largely due to higher forecast variation of fertility rates compared to other components.
Historical OADR values are shown in black in Figure \ref{fig:OADRcurrent} for 1921--2019. The strong positive trend indicates an increasing tax burden on workers, consistent with the prediction of the intergenerational reports in 2002 \citep{IGR02} and 2007 \citep{IGR07}. The increased trend from 2010 is due to the retirements of `Baby Boomers', born after WW2. We also show forecasts for three pension schemes: (1) a fixed pension age of 65; (2) the current scheme with the pension age rising to age 67 in 2023; and (3) the proposed pension policy under which the pension ages continues to rise by 6 months every two years until it reaches 70 in 2035.
\begin{figure}
\centering
\includegraphics{PensionAge_anzjs_files/figure-latex/OADRcurrent-1.pdf}
\caption{\label{fig:OADRcurrent}Fifty-year forecasts of the OADR associated with a pension age equal to 65, the current approved pension age scheme, and the proposed scheme with the pension age rising to 70 in 2035. The actual OADRs for 1921--2019 are also shown.}
\end{figure}
Once the pension age reaches 67, the OADR under the current policy grows at nearly the same rate as the OADR of a pension age equal to 65. This is consistent with the idea that there is a persistent positive trend in OADR associated with a constant pension age, due to low fertility rates and better health care. Even the proposal to raise the pension age incrementally to 70 by 2035 leads to a similar increasing trend beyond 2035. This proposal also leads to a slow decline in the OADR over the next 15 years. Figure \ref{fig:OADRcurrent} demonstrates that the pension age scheme requires constant review to ensure a stable OADR over the long term.
\hypertarget{sec:target}{%
\section{Sustainable pension age schemes}\label{sec:target}}
Our aim is to determine a pension age scheme that would lead to the OADR being below a desired threshold, thus imposing a sustainability constraint.
Let \(a_t\) denote the pension age in year \(t\), and we denote a pension age scheme over the prediction horizon by \(\bm{P}=[a_{T+1},\dots,a_{T+H}]^\top\), with the corresponding OADR values given by \(O_{T+1},\dots,O_{T+H}\). Our goal is to find the minimum pension age in each year that gives an OADR lower than the desired OADR threshold. We impose the constraint \(0 \le a_t - a_{t-1} < 1\) to prevent two undesirable situations: (a) years where no-one is able to retire; and (b) years where retired people become ineligible for the pension. We also assume that the adjustment unit of the pension age is one month, for easier implementation.
\hypertarget{finding-the-target-pension-age-scheme}{%
\subsection{Finding the target pension age scheme}\label{finding-the-target-pension-age-scheme}}
Let \(O^*\) denote the desired OADR threshold. Because we do not observe the population in future years, we must estimate \(O_{T+h}\) from the simulated populations. Then we compute the pension age scheme \(\bm{P}\) using the following algorithm. Starting with \(h=1\):
\begin{enumerate}
\def\labelenumi{\arabic{enumi}.}
\tightlist
\item
Set \(a_{T+h}=a_{T+h-1}\).
\item
Increment \(a_{T+h}\) by one-month intervals until either \(\widehat{O}_{T+h|T} \le O^*\) or \(a_{T+h} - a_{T+h-1}=12\) months, where \(\widehat{O}_{T+h|T}\) denotes the mean of the simulated \(O_{T+h|T}\) values.
\end{enumerate}
We repeat these steps for \(h=2,\dots,H\). The second limit in Step 2 arises because the pension age should not jump more than 12 months between years, to prevent pension-age people losing their pension.
In addition to finding the target pension age based on the mean value of OADR, we also find `plausible' pension age schemes that could give the desired OADR level. Specifically, we find a range of pension age schemes for which \(O^*\) is contained within the 80\% prediction intervals of the simulated \(O_{T+H|T}\) values. Because of the monotonic relationship between pension age and OADR, we need only to find the upper (lower) boundary of plausible pension age schemes, at which the upper (lower) limit of 80\% prediction intervals of OADR is equal to the desired OADR level. Hence, any pension age scheme within those two boundaries is plausible. The range of plausible pension age schemes can be thought of as an 80\% confidence interval of pension age schemes where we are 80\% sure that they will give the desired outcome.
The algorithm to find the upper and lower boundary of plausible pension age schemes is the same as above, but the mean of \(O_{T+H|T}\) is replaced by the 10\% and 90\% quantiles.
Figure \ref{fig:comparisonofpensionages} shows the pension scheme produced by this algorithm, along with 80\% confidence intervals. In comparison, the previous, current and proposed schemes are also shown.
\begin{figure}
\centering
\includegraphics{PensionAge_anzjs_files/figure-latex/comparisonofpensionages-1.pdf}
\caption{\label{fig:comparisonofpensionages}The target pension age scheme along with the pension age schemes that have been set and proposed by the government. The shaded region shows 80\% confidence intervals for the target pension age.}
\end{figure}
\begin{figure}
\centering
\includegraphics{PensionAge_anzjs_files/figure-latex/OADRwithoptimal-1.pdf}
\caption{\label{fig:OADRwithoptimal}The mean forecast and 80\% prediction intervals for the OADR associated with the target pension age scheme, along with the desired OADR level at 0.23.}
\end{figure}
\begin{figure}
\centering
\includegraphics{PensionAge_anzjs_files/figure-latex/OADRboundaries-1.pdf}
\caption{\label{fig:OADRboundaries}The 80\% prediction intervals of OADR associated with the upper and lower boundary of plausible pension age schemes, along with desired OADR level at 0.23.}
\end{figure}
Figure \ref{fig:OADRwithoptimal} shows the OADR forecasts as a result of using the target age pension scheme shown in Figure \ref{fig:comparisonofpensionages}. The target level of 23\% and the mean forecast of the OADR corresponding to the target pension age almost overlap. Figure \ref{fig:OADRboundaries} shows the OADR forecasts corresponding to the upper and lower limits of the 80\% confidence intervals shown in Figure \ref{fig:comparisonofpensionages}, demonstrating that they correspond to having OADR forecast prediction interval boundaries that lie very close to the target OADR value.
Thus any pension age higher than the upper boundary shown in Figure \ref{fig:comparisonofpensionages} will give an OADR value that is significantly lower than the target value, while any pension age scheme lower than the lower boundary of the pension age reported in Figure \ref{fig:comparisonofpensionages} will give an OADR value that is significantly higher than the target value.
Figure \ref{fig:comparisonofpensionages} also compares the target pension age scheme and its 80\% confidence interval with the pension age scheme that has been proposed but not yet enacted. Clearly, it is outside the 80\% confidence interval and so is too high for our targeted OADR value of 23\%. Under our proposed scheme, the Australian pension age should be increased to 68 by 2030, 69 by 2036, and 70 by 2050, in order to maintain the old-age dependency ratio at 23\%.
\hypertarget{sec:conclusions}{%
\section{Conclusions}\label{sec:conclusions}}
The ageing of the population is an increasingly critical issue in Australia (and many other countries) due to the increase in life expectancy and the retirement of baby boomers. The fiscal cost of the age pension has increased as a result, and is likely to continue to be a financial burden unless there are changes in age pension schemes. Under current Australian government policy, the pension age will rise to 67 in 2023, and there is a proposal to further increase it to 70 by 2035. Our analysis suggests that this increase is too rapid unless the aim is to reduce the OADR to below 2018 values.
In this paper, we have suggested a statistical and demographic approach to address the issue. We attempt to quantify the effects of changes in the pension age on the OADR, which we take as a crude measure of the financial burden of the age pension. We have applied stochastic models to forecast the future population age structure and then used those future age structure projections to find the most appropriate pension age scheme that will ensure a stable OADR around the desired level. To forecast the population age structure, three demographic components of the population were modeled separately using functional time series methods \citep{HU07}, along with an extension of the product-ratio method \citep{HBY13}. Then the framework of \citet{HB08} was used to simulate future populations by age and sex. These were then used to simulate OADR values associated with different pension age schemes. Finally, we inverted the process to find a pension age scheme that leads to a specified OADR value.
We illustrate the approach based on assuming the desired level of OADR is around the 2018 level of 23\%. Using historical data from 1950 to 2019, the forecasts of future populations suggests a substantial increase in total population, and confirms the inherent ageing trend. It was found that pension age should not be fixed, but instead should grow at a rate that will ensure a stable OADR at a level which can be affordable to Australians of working age.
The empirical analysis presented in this paper shows the adjustment of the pension age to age 67 has been effective in stopping the increasing trend in the OADR values. However, a one-time adjustment is not sufficient to ensure a stable OADR in the long term, due to ongoing changes in the population age structure. Therefore, we recommend consistently reviewing the pension age according to the proposed statistical approach. This approach has been used to calculate the target pension age from now until 2050 along with its 80\% confidence interval. The target pension age found in this research increases to 67 by 2023, 68 by 2030, 69 by 2036 and 70 by 2050. This generates a stable OADR of around 23\%.
This research addresses the pension age issue from a purely demographic perspective. Further analysis should include an economic perspective as well including the impact of the superannuation guarantee and other economic measures to offset the fiscal burden of the age pension. There are several issues to be considered here. First, not all people aged above the pension age will receive the pension, and the ratio of those receiving the pension may decrease as more retiring people can access superannuation. Second, labour force participation is probably related to age as self-funded retirees may opt out of paid employment before they reach pension age. Hence, raising the pension age may not have as large an effect on OADR as we have estimated. Third, we have not considered the effect of changing per capita GDP which helps fund the pension. If per capita GDP declines, the fiscal burden of the pension may increase even with a stable OADR.
We have also not allowed for the impact of COVID-19 in this analysis as it is too early to properly estimate its impact on mortality rates and future age structures. It is known that the mortality rates from COVID-19 are greater amongst older people, so it should lead to a decrease in the OADR. However, the relatively small number of deaths in Australia mean that the effect of the pandemic will probably not substantially change our recommendations.
\hypertarget{reproducibility}{%
\section*{Reproducibility}\label{reproducibility}}
\addcontentsline{toc}{section}{Reproducibility}
The data and code to fully reproduce our results is available at \href{https://github.com/robjhyndman/PensionAge}{\texttt{https://github.com/robjhyndman/PensionAge}}. We used \textsf{R} \citep{RCore}, along with the \textsf{drake} \citep{Rdrake}, \textsf{tidyverse} \citep{Rtidyverse}, \textsf{demography} \citep{Rdemography} and \textsf{forecast} \citep{Rforecast} packages.
\bibliography{pensionage.bib}
\end{document}