This post is available for download here.
Summary
- While it may be tempting to time allocations to active strategies, it is generally best to hold them as long-term allocations.
- Despite this, some research has shown that there may be certain economic environments where trend following equity strategies are better suited.
- In this commentary, we replicate this data and find that a broad filter of recessionary periods does indeed show this for certain trend equity strategies but not for the style of trend equity in general.
- However, further decomposing the business cycle into contractions, recoveries, expansions, and slowdowns using leading economic indicators such as PMI and unemployment does show some promising relationships between the forecasted stage of the business cycle and trend following’s performance relative to buy-and-hold equities.
- Even if this data is not used to time trend equity strategies, it can be beneficial to investors for setting expectations and providing insight into performance differences.
Systematic active investing strategies are a way to achieve alternative return profiles that are not necessarily present when pursuing standard asset allocation and may therefore play an important role in developing well-diversified portfolios.
But these strategies are best viewed as allocations rather than trades.1 This is a topic we’ve written about a number of times with respect to factor investing over the past several years, citing the importance of weathering short-term pain for long-term gains. For active strategies to outperform, some underperformance is necessary. Or, as we like to say, “no pain, no premium.”
That being said, being tactical in our allocations to active strategies may have some value in certain cases. In one sense, we can view the multi-layered active decisions simply as another active strategy, distinct from the initial one.
An interesting post on Philosophical Economics looked at using a variety of recession indicators (unemployment, earnings growth, industrial production, etc.) as ways to systematically invest in either U.S. equities or a trend following strategy on U.S. equities. If the economic indicator was in a favorable trend, the strategy was 100% invested in equities. If the economic indicator was in an unfavorable trend, the strategy was invested in a trend following strategy applied to equities, holding cash when the market was in a downtrend.
The reasoning behind this strategy is intuitively appealing. Even if a recession indicator flags a likely recession, the market may still have room to run before turning south and warranting capital protection. On the other hand, when the recession indicator was favorable, purely investing in equities avoids some of the whipsaw costs that are inherent in trend following strategies.
In this commentary, we will first look at the general style of trend equity in the context of recessionary and non-recessionary periods and then get a bit more granular to see when trend following has worked historically through the economic cycle of Expansion, Slowdown, Contraction, and Recovery.
Replicating the Data
To get our bearings, we will first attempt to replicate some of the data from the Philosophical Economics post using only the classifications of “recession” and “not-recession”.
Keeping in line with the Philosophical Economics method, we will use whether the economic metric is above or below its 12-month moving average as the recession signal for the next month. We will use market data from the Kenneth French Data Library for the total U.S. stock market returns and the risk-free rate as the cash rate in the equity trend following model.
The following table shows the results of the trend following timing models using the United States ISM Purchasing Managers Index (PMI) and the Unemployment Rate as indicators.
U.S. Equities | 12mo MA Trend Equity | 12m MA Trend Timing Model (PMI) | 12mo MA Trend Timing Model (Unemployment) | |
Annualized Return | 11.3% | 11.1% | 11.3% | 12.2% |
Annualized Volatility | 14.7% | 11.2% | 11.9% | 12.4% |
Maximum Drawdown | 50.8% | 24.4% | 32.7% | 30.0% |
Sharpe Ratio | 0.49 | 0.62 | 0.61 | 0.66 |
Source: Quandl and U.S. Bureau of Labor Statistics. Calculations by Newfound Research. Results are hypothetical. Results assume the reinvestment of all distributions. Results are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes. Past performance is not an indicator of future results. You cannot invest in an index. Data is from Jan 1948 – Sep 2019.
With the trend timing model, we see an improvement in the absolute returns compared to the trend equity strategy alone. However, this comes at the expense of increasing the volatility and maximum drawdown.
In the case of unemployment, which was the strongest indicator that Philosophical Economics found, there is an improvement in risk-adjusted returns in the timing model.
Still, while there is a benefit, it may not be robust.
If we remove the dependence of the trend following model on a single metric or lookback parameter, the benefit of the macro-timing decreases. Specifically, if we replace our simple 12-month moving average trend equity rule with the ensemble approach utilized in the Newfound Trend Equity Index, we see very different results. This may indicate that one specific variant of trend following did well in this overall model, but the style of trend following might not lend itself well to this application.
U.S. Equities | Newfound Trend Equity Index | Trend Equity Index Blend (PMI) | Trend Equity Index Blend (Unemployment) | |
Annualized Return | 11.3% | 10.7% | 10.9% | 10.9% |
Annualized Volatility | 14.7% | 11.1% | 11.8% | 13.5% |
Maximum Drawdown | 50.8% | 25.8% | 36.1% | 36.0% |
Sharpe Ratio | 0.49 | 0.59 | 0.58 | 0.50 |
Source: Quandl and U.S. Bureau of Labor Statistics. Calculations by Newfound Research. Results are hypothetical. Results assume the reinvestment of all distributions. Results are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes. Past performance is not an indicator of future results. You cannot invest in an index. Data is from Jan 1948 – Sep 2019.
A more robust trend following model may already provide more upside capture during non-recessionary periods but at the expense of more downside capture during recessions. However, we cannot confidently assert that the lower level of down-capture in the single specification of the trend model is not partially due to luck.
If we desire to more thoroughly evaluate the style of trend following, we must get more granular with the economic cycles.
Breaking Down the Economic Cycle
Moving beyond the simple classification of “recession” and “not-recession”, we can follow MSCI’s methodology, which we used here previously, to classify the economic cycle into four primary states: Expansion, Slowdown, Contraction and Recovery.
We will focus on the 3-month moving average (“MA”) minus the 12-month MA for each indicator we examine according to the decision tree below. In the tree, we use the terms better or worse since lower unemployment rate and higher PMI values signal a stronger economy.
There is a decent amount of difference in the classifications using these two indicators, with the unemployment indicator signaling more frequent expansions and slowdowns. This should be taken as evidence that economic regimes are difficult to predict.
Source: Quandl and U.S. Bureau of Labor Statistics. Calculations by Newfound Research. Results are hypothetical. Results assume the reinvestment of all distributions. Results are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes. Past performance is not an indicator of future results. You cannot invest in an index. Data is from Jan 1948 – Sep 2019.
Once each indicator is in each state the transition probabilities are relatively close.
Source: Quandl and U.S. Bureau of Labor Statistics. Calculations by Newfound Research. Results are hypothetical. Past performance is not an indicator of future results.
This agrees with intuition when we consider the cyclical nature of these economic metrics. While not a perfect mathematical relationship, these states generally unfold sequentially without jumps from contractions to expansions or vice versa.
Trend Following in the Economic Cycle
Applying the four-part classification to the economic cycle shows where trend equity outperformed.
PMI Indicator | Unemployment Indicator | |||
U.S. Equities | Trend Equity | U.S. Equities | Trend Equity | |
Contraction | 7.6% | 10.3% | 1.0% | 7.3% |
Recovery | 12.2% | 9.3% | 15.4% | 15.0% |
Expansion | 14.3% | 14.4% | 13.9% | 11.3% |
Slowdown | 7.2% | 5.4% | 10.5% | 8.0% |
Source: Quandl and U.S. Bureau of Labor Statistics. Calculations by Newfound Research. Results are hypothetical. Results assume the reinvestment of all distributions. Results are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes. Past performance is not an indicator of future results. You cannot invest in an index. Data is from Jan 1948 – Sep 2019.
During contraction phases, regardless of indicators, trend equity outperformed buy-and-hold.
For the PMI indicator, trend equity was able to keep up during expansions, but this was not the case with the unemployment indicator. The reverse of this was true for recoveries: trend following was close to keeping up in the periods denoted by the unemployment indicator but not by the PMI indicator.
For both indicators, trend following underperformed during slowdowns.
This may seem contradictory at first, but these may be periods of more whipsaw as markets try to forecast future states. And since slowdowns typically occur after expansions and before contractions (at least in the idealized model), we may have to bear more of this whipsaw risk for the strategy to be adaptable enough to add value during the contraction.
The following two charts show the longest historical slowdowns for each indicator: the PMI indicator was for 11 months in late 2009 through much of 2010 and the unemployment rate indicator was for 16 months in 1984-85.
Source: Quandl and U.S. Bureau of Labor Statistics. Calculations by Newfound Research. Results are hypothetical. Results assume the reinvestment of all distributions. Results are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes. Past performance is not an indicator of future results. You cannot invest in an index.
In the first slowdown period, the trend equity strategy rode in tandem with equities as they continued to climb and then de-risked when equities declined. Equities quickly rebounded leaving the trend equity strategy underexposed to the rally.
In the second slowdown period, the trend equity strategy was heavily defensive going into the slowdown. This protected capital initially but then caused the strategy to lag once the market began to increase steadily.
The first period illustrates a time when the trend equity strategy was ready to adapt to changing market conditions and was unfortunately whipsawed. The second period illustrates a time when the trend equity strategy was already adapted to a supposedly oncoming contraction that did not materialize.
Using these historical patterns of performance, we can now explore how a strategy that systematically allocates to trend equity strategies might be constructed.
Timing Trend Following with the Economic Cycle
One simple way to apply a systematic timing strategy for shifting between equities and trend following is to only invest in equities when a slowdown is signaled.
The charts below show the returns and risk metrics for models using the PMI and unemployment rate individually and a model that blends the two allocations.
Source: Quandl and U.S. Bureau of Labor Statistics. Calculations by Newfound Research. Results are hypothetical. Results assume the reinvestment of all distributions. Results are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes. Past performance is not an indicator of future results. You cannot invest in an index. Data is from Jan 1948 – Sep 2019.
Source: Quandl and U.S. Bureau of Labor Statistics. Calculations by Newfound Research. Results are hypothetical. Results assume the reinvestment of all distributions. Results are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes. Past performance is not an indicator of future results. You cannot invest in an index. Data is from Jan 1948 – Sep 2019.
The returns increased slightly in every model relative to buy-and-hold, and the blended model performed consistently high across all metrics.
Blending multiple models generally produces benefits like these shown here, and in an actual implementation, utilizing additional economic indicators may make the strategy even more robust. There may be other ways to boost performance across the economic cycle, and we will explore these ideas in future research.
Conclusion
Should investors rotate in and out of active strategies?
Not in most cases, since the typical drivers are short-term underperformance that is a necessary component of active strategies.
However, there may be opportunities to make allocation tweaks based on the economic cycle.
The historical data suggests that a specification-neutral trend-equity strategy has outperformed buy-and-hold equities during economic contractions for both economic indicators. The performance during recoveries and expansions was mixed across indicators. It kept up with the buy-and-hold strategy during expansions denoted by PMI but not unemployment. This relationship was reversed for recoveries denoted by unemployment. In both models, trend equity has also lagged during economic slowdowns as whipsaw becomes more prevalent.
Based on the most recent PMI data, the current cycle is a contraction, indicating a favorable environment for trend equity under both cycle indicators. However, we should note that December 2018 through March 2019 was also labeled as a contraction according to PMI. Not all models are perfect.
Nevertheless, there may be some evidence that trend following can provide differentiated benefits based on the prevailing economic environment.
While an investor may not use this knowledge to shift around allocations to active trend following strategies, it can still provide insight into performance difference relative to buy-and-hold and set expectations going forward.
The Limit of Factor Timing
By Nathan Faber
On November 11, 2019
In Craftsmanship, Momentum, Risk & Style Premia, Weekly Commentary
This post is available as a PDF download here.
Summary
A few years ago, we began researching factor timing – moving among value, momentum, low volatility, quality, size etc. – with the hope of earning returns in excess not only of the equity market, but also of buy-and-hold factor strategies.
To time the factors, our natural first course of action was to exploit the behavioral biases that may create the factors themselves. We examined value and momentum across the factors and used these metrics to allocate to factors that we expected to outperform in the future.
The results were positive. However, taking into account transaction costs led to the conclusion that investors were likely better off simply holding a diversified factor portfolio.
We then looked at ways to time the factors using the business cycle.
The results in this case were even less convincing and were a bit too similar to a data-mined optimal solution to instill much faith going forward.
But this evidence does not necessarily remove the temptation to take a stab at timing the factors, especially since explicit transactions costs have been slashed for many investors accessing long-only factors through ETFs.Source: Kenneth French Data Library, AQR. Calculations by Newfound Research. Past performance is not an indicator of future results. Performance is backtested and hypothetical. Performance figures are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes. Performance assumes the reinvestment of all distributions.
After all, there is a lot to gain by choosing the right factors. For example, in the first 9 months of 2019, the spread between the best (Quality) and worst (Value) performing factors was nearly 1,000 basis points (“bps”). One month prior, that spread had been double!
In this research note, we will move away from devising a systematic approach to timing the factors (as AQR asserts, this is deceptively difficult) and instead focus on what a given method would have to overcome to achieve consistent outperformance.
Benchmarking Factor Timing
With all equity factor strategies, the goal is usually to outperform the market-cap weighted equity benchmark.
Since all factor portfolios can be thought of as a market cap weighted benchmark plus a long/short component that captures the isolated factor performance, we can focus our study solely on the long/short portfolio.
Using the common definitions of the factors (from Kenneth French and AQR), we can look at periods over which these self-financing factor portfolios generate positive returns to see if overlaying them on a market-cap benchmark would have added value over different lengths of time.1
We will also include the performance of an equally weighted basket of the four factors (“Blend”).
Source: Kenneth French Data Library, AQR. Calculations by Newfound Research. Past performance is not an indicator of future results. Performance is backtested and hypothetical. Performance figures are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes. Performance assumes the reinvestment of all distributions. Data from July 1957 – September 2019.
The persistence of factor outperformance over one-month periods is transient. If the goal is to outperform the most often, then the blended portfolio satisfies this requirement, and any timing strategy would have to be accurate enough to overcome this already existing spread.
Source: Kenneth French Data Library, AQR. Calculations by Newfound Research. Past performance is not an indicator of future results. Performance is backtested and hypothetical. Performance figures are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes. Performance assumes the reinvestment of all distributions. Data from July 1957 – September 2019.
The results for the blended portfolio are so much better than the stand-alone factors because the factors have correlations much lower than many other asset classes, allowing even naïve diversification to add tremendous value.
The blended portfolio also cuts downside risk in terms of returns. If the timing strategy is wrong, and chooses, for example, momentum in an underperforming month, then it could take longer for the strategy to climb back to even. But investors are used to short periods of underperformance and often (we hope) realize that some short-term pain is necessary for long-term gains.
Looking at the same analysis over rolling 1-year periods, we do see some longer periods of factor outperformance. Some examples are quality in the 1980s, value in the mid-2000s, momentum in the 1960s and 1990s, and size in the late-1970s.
However, there are also decent stretches where the factors underperform. For example, the recent decade for value, quality in the early 2010s, momentum sporadically in the 2000s, and size in the 1980s and 1990s. If the timing strategy gets stuck in these periods, then there can be a risk of abandoning it.
Source: Kenneth French Data Library, AQR. Calculations by Newfound Research. Past performance is not an indicator of future results. Performance is backtested and hypothetical. Performance figures are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes. Performance assumes the reinvestment of all distributions. Data from July 1957 – September 2019.
Again, a blended portfolio would have addressed many of these underperforming periods, giving up some of the upside with the benefit of reducing the risk of choosing the wrong factor in periods of underperformance.
Source: Kenneth French Data Library, AQR. Calculations by Newfound Research. Past performance is not an indicator of future results. Performance is backtested and hypothetical. Performance figures are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes. Performance assumes the reinvestment of all distributions. Data from July 1957 – September 2019.
And finally, if we extend our holding period to three years, which may be used for a slower moving signal based on either value or the business cycle, we see that the diversified portfolio still exhibits outperformance over the most rolling periods and has a strong ratio of upside to downside.
Source: Kenneth French Data Library, AQR. Calculations by Newfound Research. Past performance is not an indicator of future results. Performance is backtested and hypothetical. Performance figures are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes. Performance assumes the reinvestment of all distributions. Data from July 1957 – September 2019.
The diversified portfolio stands up to scrutiny against the individual factors but could a generalized model that can time the factors with a certain degree of accuracy lead to better outcomes?
Generic Factor Timing
To construct a generic factor timing model, we will consider a strategy that decides to hold each factor or not with a certain degree of accuracy.
For example, if the accuracy is 50%, then the strategy would essentially flip a coin for each factor. Heads and that factor is included in the portfolio; tails and it is left out. If the accuracy is 55%, then the strategy will hold the factor with a 55% probability when the factor return is positive and not hold the factor with the same probability when the factor return is negative. Just to be clear, this strategy is constructed with look-ahead bias as a tool for evaluation.
All factors included in the portfolio are equally weighted, and if no factors are included, then the returns is zero for that period.
This toy model will allow us to construct distributions to see where the blended portfolio of all the factors falls in terms of frequency of outperformance (hit rate), average outperformance, and average underperformance. The following charts show the percentiles of the diversified portfolio for the different metrics and model accuracies using 1,000 simulations.2
Source: Kenneth French Data Library, AQR. Calculations by Newfound Research. Past performance is not an indicator of future results. Performance is backtested and hypothetical. Performance figures are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes. Performance assumes the reinvestment of all distributions. Data from July 1957 – September 2019.
In terms of hit rate, the diversified portfolio behaves in the top tier of the models over all time periods for accuracies up to about 57%. Even with a model that is 60% accurate, the diversified portfolio was still above the median.
Source: Kenneth French Data Library, AQR. Calculations by Newfound Research. Past performance is not an indicator of future results. Performance is backtested and hypothetical. Performance figures are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes. Performance assumes the reinvestment of all distributions. Data from July 1957 – September 2019.
For average underperformance, the diversified portfolio also did very well in the context of these factor timing models. The low correlation between the factors leads to opportunities for the blended portfolio to limit the downside of individual factors.
Source: Kenneth French Data Library, AQR. Calculations by Newfound Research. Past performance is not an indicator of future results. Performance is backtested and hypothetical. Performance figures are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes. Performance assumes the reinvestment of all distributions. Data from July 1957 – September 2019.
For average outperformance, the diversified portfolio did much worse than the timing model over all time horizons. We can attribute this also to the low correlation between the factors, as choosing only a subset of factors and equally weighting them often leads to more extreme returns.
Overall, the diversified portfolio manages the risks of underperformance, both in magnitude and in frequency, at the expense of sacrificing outperformance potential. We saw this in the first section when we compared the diversified portfolio to the individual factors.
But if we want to have increased return potential, we will have to introduce some model risk to time the factors.
Checking in on Momentum
Momentum is one model-based way to time the factors. Under our definition of accuracy in the toy model, a 12-1 momentum strategy on the factors has an accuracy of about 56%. While the diversified portfolio exhibited some metrics in line with strategies that were even more accurate than this, it never bore concentration risk: it always held all four factors.
Source: Kenneth French Data Library, AQR. Calculations by Newfound Research. Past performance is not an indicator of future results. Performance is backtested and hypothetical. Performance figures are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes. Performance assumes the reinvestment of all distributions. Data from July 1957 – September 2019.
For the hit rate percentiles of the momentum strategy, we see a more subdued response. Momentum does not win as much as the diversified portfolio over the different time periods.
But not winning as much can be fine if you win bigger when you do win.
The charts below show that momentum does indeed have a higher outperformance percentile but with a worse underperformance percentile, especially for 1-month periods, likely due to mean reversionary whipsaw.
Source: Kenneth French Data Library, AQR. Calculations by Newfound Research. Past performance is not an indicator of future results. Performance is backtested and hypothetical. Performance figures are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes. Performance assumes the reinvestment of all distributions. Data from July 1957 – September 2019.
While momentum is definitely not the only way to time the factors, it is a good baseline to see what is required for higher average outperformance.
Now, turning back to our generic factor timing model, what accuracy would you need to beat momentum?
Sharpening our Signal
The answer is: not a whole lot. Most of the time, we only need to be about 53% accurate to beat the momentum-based factor timing.
Source: Kenneth French Data Library, AQR. Calculations by Newfound Research. Past performance is not an indicator of future results. Performance is backtested and hypothetical. Performance figures are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes. Performance assumes the reinvestment of all distributions.
The caveat is that this is the median performance of the simulations. The accuracy figure climbs closer to 60% if we use the 25th percentile as our target.
While these may not seem like extremely high requirements for running a successful factor timing strategy, it is important to observe that not many investors are doing this. True accuracy may be hard to discover, and sticking with the system may be even harder when the true accuracy can never be known.
Conclusion
If you made it this far looking for some rosy news on factor timing or the Holy Grail of how to do it skillfully, you may be disappointed.
However, for most investors looking to generate some modest benefits relative to market-cap equity, there is good news. Any signal for timing factors does not have to be highly accurate to perform well, and in the absence of a signal for timing, a diversified portfolio of the factors can lead to successful results by the metrics of average underperformance and frequency of underperformance.
For those investors looking for higher outperformance, concentration risk will be necessary.
Any timing strategy on low correlation investments will generally forego significant diversification in the pursuit of higher returns.
While this may be the goal when constructing the strategy, we should always pause and determine whether the potential benefits outweigh the costs. Transaction costs may be lower now. However, there are still operational burdens and the potential stress caused by underperformance when a system is not automated or when results are tracked too frequently.
Factor timing may be possible, but timing and tactical rotation may be better suited to scenarios where some of the model risk can be mitigated.