This post is available as a PDF download here.
Summary
- Many retirement planning analyses rely on Monte Carlo simulations with static assumptions for withdrawals.
- Incorporating dynamic spending rules can more closely align the simulations with how investors would likely behave during times when the plan looked like it was on a path to failure.
- Even a modest reduction in withdrawals (e.g. 10%) can have a meaningful impact on reducing failure rates, nearly cutting it in half in a sample simulation.
- Combining dynamic spending rules with other marginal improvements, such as supplemental income and active risk management, can lead to more robust retirement plans and give investors a better understanding of the variables that are within their realm of control.
Monte Carlo simulations are a prevalent tool in financial planning, especially pertaining to retirement success calculations.
Under a typical framework of normally distributed portfolio returns and constant inflation-adjusted withdrawals, calculating the success of a given retirement portfolio is straightforward. But as with most tools in finance, the art lies both in the assumptions that go into the calculation and in the proper interpretation of the result.
If a client is told they have a 10% chance of running out of money over their projected retirement horizon, what does that mean for them?
They cannot make 9 copies of themselves to live out separate lives, with one copy (hopefully not the original) unfortunately burning through the account prematurely.
They also cannot create 9 parallel universes and ensure they do not choose whichever one does not work out.
We wrote previously how investors follow a single path (You Are Not a Monte-Carlo Simulation). If that path hits zero, the other hypothetical simulation paths don’t mean a thing.
A simulation path is only as valuable as the assumptions that go into creating it, and fortunately, we can make our simulations align more closely with investor behavior.
The best way to interpret the 10% failure rate is to think of it as a 10% chance of having to make an adjustment before it hits zero. Rarely would an investor stand by while their account went to zero. There are circumstances that are entirely out of investor control, but to the extent that there was something they could do to prevent that event, they would most likely do it.
Derek Tharp, on Michael Kitces’ blog, wrote a post a few years ago weighing the relative benefit of implementing small but permanent adjustments vs. large but temporary adjustments to retirement withdrawals and found that making small adjustments and leaving them in place led to greater likelihoods of success over retirement horizons (Dynamic Retirement Spending Adjustments: Small-But-Permanent Vs Large-But-Temporary).
In this week’s commentary, we want to dig a little deeper into some simple path dependent modifications that we can make to retirement Monte-Carlo simulations with the hope of creating a more robust toolset for financial planning.
The Initial Plan
Suppose an investor is 65 and holds a moderate portfolio of 60% U.S. stocks and 40% U.S. Treasuries. From 1871 until mid-2019, this portfolio would have returned an inflation-adjusted 5.1% per year with 10.6% volatility according to Global Financial Data.
Sticking with the rule-of-thumb 4% annual withdrawal of the initial portfolio balance and assuming a 30-year retirement horizon, this yields a predicted failure rate of 8% (plus or minus about 50 bps).
The financial plan is complete.
If you start with $1,000,000, simply withdraw $3,333/month and you should be fine 92% of the time.
But what if the portfolio drops 5% in the first month? (It almost did that in October 2018).
The projected failure rate over the next 29 years and 11 months has gone up to 11%. That violates a 10% threshold that may have been a target in the planning process.
Or what if it drops 30% in the first 6 months, like it would have in the second half of 1931?
Now the project failure rate is a staggering 46%. Retirement success has been reduced to a coin flip.
Admittedly, these are trying scenarios, but these numbers are a key driver for financial planning. If we can better understand the risks and spell out a course of action beforehand, then the risk of making a rash emotion-driven decision can be mitigated.
Aligning the Plan with Reality
When the market environment is challenging, investors can benefit by being flexible. The initial financial plan does not have to be jettisoned; agreed upon actions within it are implemented.
One of the simplest – and most impactful – modifications to make is an adjustment to spending. For instance, an investor might decide at the outset to scale back spending by a set amount when the probably of failure crosses a threshold.Source: Global Financial Data. Calculations by Newfound.
This reduction in spending would increase the probability of success going forward through the remainder of the retirement horizon.
And if we knew that this spending cut would likely happen if it was necessary, then we can quantify it as a rule in the initial Monte Carlo simulation used for financial planning.
Graphically, we can visualize this process by looking at the probabilities of failure for varying asset levels over time. For example, at 10 years after retirement, the orange line indicates that a portfolio value ~80% of the initial value would have about a 5% failure rate.
Source: Global Financial Data. Calculations by Newfound.
As long as the portfolio value remains above a given line, no adjustment would be needed based on a standard Monte Carlo analysis. Once a line is crossed, the probability of success is below that threshold.
This chart presents a good illustration of sequence risk: the lines are flatter initially after retirement and the slope progressively steepens as the time progresses. A large drawdown initially puts the portfolio below the threshold for making and adjustment.
For instance, at 5 years, the portfolio has more than a 10% failure rate if the value is below 86%. Assuming zero real returns, withdrawals alone would have reduced the value to 80%. Positive returns over this short time period would be necessary to feel secure in the plan.
Looking under the hood along the individual paths used for the Monte Carlo simulation, at 5 years, a quarter of them would be in a state requiring an adjustment to spending at this 10% failure level.
Source: Global Financial Data. Calculations by Newfound.
This belies the fact that some of the paths that would have crossed this 10% failure threshold prior to the 5-year mark improved before the 5-year mark was hit. 75% of the paths were below this 10% failure rate at some point prior to the 5-year mark. Without more appropriate expectations of a what these simulations mean, under this model, most investors would have felt like their plan’s failure rate was uncomfortable at some point in the first 5 years after retirement!
Dynamic Spending Rules
If the goal is ultimately not to run out of funds in retirement, the first spending adjustment case can substantially improve those chances (aside from a large negative return in the final periods prior to the last withdrawals).
Each month, we will compare the portfolio value to the 90% success value. If the portfolio is below that cutoff, we will size the withdrawal to hit improve the odds of success back to that level, if possible.
The benefit of this approach is greatly improved success along the different paths. The cost is forgone income.
But this can mean forgoing a lot of income over the life of the portfolio in a particularly bad state of the world. The worst case in terms of this total forgone income is shown below.
Source: Global Financial Data. Calculations by Newfound.
The portfolio gives up withdrawals totaling 74%, nearly 19 years’ worth. Most of this is given up in consecutive periods during the prolonged drawdown that occurs shortly after retirement.
This is an extreme case that illustrates how large of income adjustments could be required to ensure success under a Monte Carlo framework.
The median case foregoes 9 months of total income over the portfolio horizon, and the worst 5% of cases all give up 30% (7.5 years) of income based off the initial portfolio value.
That is still a bit extreme in terms of potential cutbacks.
As a more realistic scenario that is easier on the pocketbook, we will limit the total annual cutback to 30% of the withdrawal in the following manner:
- If the current chance of failure is greater than 20%, cut spending by 30%. This equates to reducing the annual withdrawal by $12,000 assuming a $1,000,000 initial balance.
- If the current chance of failure is between 15% and 20%, cut spending by 20%. This equates to reducing the annual withdrawal by $8,000 assuming a $1,000,000 initial balance.
- If the current chance of failure is between 10% and 15%, cut spending by 10%. This equates to reducing the annual withdrawal by $4,000 assuming a $1,000,000 initial balance.
These rules still increase the success rate to 99% but substantially reduce the amount of reductions in income.
Looking again at the worst-case scenario, we see that this case still “fails” (even though it lasts another 4.5 years) but that its reduction in come is now less than half of what it was in the extreme cutback case. This pattern is in line with the “lower for longer” reductions that Derek had looked at in the blog post.
Source: Global Financial Data. Calculations by Newfound.
On the 66% of sample paths where there was a cut in spending at some point, the average total cut amounted to 5% of the portfolio (a little over a year of withdrawals spread over the life of the portfolio).
Even moving to an even less extreme reduction regime where only 10% cuts are ever made if the probability of failure increases above 10%, the average reduction in the 66% of cases that required cuts was about 9 months of withdrawals over the 30-year period.
In these scenarios, the failure rate is reduced to 5% (from 8% with no dynamic spending rules).
Source: Global Financial Data. Calculations by Newfound.
Conclusion
Retirement simulations can be a powerful planning tool, but they are only as good as their inputs and assumptions. Making them align as closes with reality as possible can be a way to quantify the impact of dynamic spending rules in retirement.
While the magnitude of spending reductions necessary to guarantee success of a retirement plan in all potential states of the world is prohibitive. However, small modifications to spending can have a large impact on success.
For example, reducing withdrawal by 10% when the forecasted failure rate increases above 10% nearly cut the failure rate of the entire plan in half.
But dynamic spending rules do not exist in a vacuum; they can be paired with other marginal improvements to boost the likelihood of success:
- Seek out higher returns – small increases in portfolio returns can have a significant impact over the 30 -ear planning horizon.
- Supplement income – having supplements to income, even small ones, can offset spending during any market environment, improving the success rate of the financial plan.
- Actively manage risk – managing risk, especially early in retirement is a key factor to now having to reduce withdrawals in retirement.
- Plan for more flexibility – having the ability to reduce spending when necessary reduces the need to rely on the portfolio balance when the previous factors are not working.
While failure is certainly possible for investors, a “too big to fail” mentality is much more in line with the reality of retirement.
Even if absolute failure is unlikely, adjustments will likely be a requirement. These can be built into the retirement planning process and can shed light on stress testing scenarios and sensitivity.
From a retirement planning perspective, flexibility is simply another form of risk management.
Ensemble Multi-Asset Momentum
By Corey Hoffstein
On July 22, 2019
In Craftsmanship, Momentum, Popular, Risk & Style Premia, Risk Management, Weekly Commentary
This post is available as a PDF download here.
Summary
Early in the 2010s, a suite of index-linked products came to market that raised billions of dollars. These products – offered by just about every major bank – sought to simultaneously exploit the diversification benefits of modern portfolio theory and the potential for excess returns from the momentum anomaly.
While each index has its own bells and whistles, they generally follow the same approach:
And despite their differences, we can see in plotting their returns below that these indices generally share a common return pattern, indicating a common, driving style.
Source: Bloomberg.
Frequent readers will know that “monthly rebalance” is an immediate red flag for us here at Newfound: an indicator that timing luck is likely lurking nearby.
Replicating Multi-Asset Momentum
To test the impact of timing luck, we replicate a simple multi-asset momentum strategy based upon available index descriptions.
We rebalance the portfolio at the end of each month. Our optimization process seeks to identify the portfolio with a realized volatility less than 5% that would have maximized returns over the prior six months, subject to a number of position and asset-level limits. If the 5% volatility target is not achievable, the target is increased by 1% until a portfolio can be constructed that satisfies our constraints.
We use the following ETFs and asset class limits:
As a naïve test for timing luck, rather than assuming the index rebalances at the end of each month, we will simply assume the index rebalances every 21 trading days. In doing so, we can construct 21 different variations of the index, each representing the results from selecting a different rebalance date.
Source: CSI Analytics; Calculations by Newfound Research. Results are backtested and hypothetical. Results assume the reinvestment of all distributions. Results are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes, with the exception of underlying ETF expense ratios. Past performance is not an indicator of future results.
As expected, the choice of rebalance date has a meaningful impact. Annualized returns range from 4.7% to 5.5%, Sharpe ratios range from 0.6 to 0.9, and maximum drawdowns range from 9.9% to 20.8%.
On a year-by-year basis, the only thing that is consistent is the large spread between the worst and best-performing rebalance date. On average, the yearly spread exceeds 400 basis points.
Min
Max
-9.91%
0.85%
2.36%
4.59%
6.46%
9.65%
3.31%
10.15%
6.76%
10.83%
3.42%
6.13%
5.98%
10.60%
-5.93%
-2.51%
4.18%
8.45%
9.60%
11.62%
-6.00%
-2.53%
5.93%
10.01%
* Partial year starting 7/22/2018
We’ve said it in the past and we’ll say it again: timing luck can be the difference between hired and fired. And while we’d rather be on the side of good luck, the lack of control means we’d rather just avoid this risk all together.
If it isn’t nailed down for a reason, diversify it
The choice of when to rebalance is certainly not the only free variable of our multi-asset momentum strategy. Without an explicit view as to why a choice is made, our preference is always to diversify so as to avoid specification risk.
We will leave the constraints (e.g. volatility target and weight constraints) well enough alone in this example, but we should consider the process by which we’re measuring past returns as well as the horizon over which we’re measuring it. There is plenty of historical efficacy to using prior 6-month total returns for momentum, but no lack of evidence supporting other lookback horizons or measurements.
Therefore, we will use three models of momentum: prior total return, the distance of price from its moving average, and the distance of a short-term moving average from a longer-term moving average. We will vary the parameterization of these signals to cover horizons ranging from 3- to 15-months in length.
We will also vary which day of the month the portfolio rebalances on.
By varying the signal, the lookback horizon, and the rebalance date, we can generate hundreds of different portfolios, all supported by the same theoretical evidence but having slightly different realized results due to their particular specification.
Our robust portfolio emerges by calculating the weights for all these different variations and averaging them together, in many ways creating a virtual strategy-of-strategies.
Below we plot the result of this –ensemble approach– as compared to a –random sample of the underlying specifications–. We can see that while there are specifications that do much better, there are also those that do much worse. By employing an ensemble approach, we forgo the opportunity for good luck and avoid the risk of bad luck. Along the way, though, we may pick up some diversification benefits: the Sharpe ratio of the ensemble approach fell in the top quartile of specifications and its maximum drawdown was in the bottom quartile (i.e. lower drawdown).
Source: CSI Analytics; Calculations by Newfound Research. Results are backtested and hypothetical. Results assume the reinvestment of all distributions. Results are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes, with the exception of underlying ETF expense ratios. Past performance is not an indicator of future results.
Conclusion
In this commentary, we again demonstrate the potential risk of needless specification and the potential power of diversification.
Using a popular multi-asset momentum model as our example, we again find a significant amount of timing luck lurking in a monthly rebalance specification. By building a virtual strategy-of-strategies, we are able to manage this risk by partially rebalancing our portfolio on different days.
We go a step further, acknowledging that processrepresents another axis of risk. Specifically, we vary both how we measure momentum and the horizon over which it is measured. Through the variation of rebalance days, model specifications, and lookback horizons, we generate over 500 different strategy specifications and combine them into a virtual strategy-of-strategies to generate our robust multi-asset momentum model.
As with prior commentaries, we find that the robust model is able to effectively reduce the risk of both specification and timing luck. But perhaps most importantly, it was able to harvest the benefits of diversification, realizing a Sharpe ratio in the top quartile of specifications and a maximum drawdown in the lowest quartile.