This post is available as a PDF download here.
Summary
- In this paper we discuss simple rules for timing exposure to 10-year U.S. Treasuries.
- We explore signals based upon the slope of the yield curve (“carry”), prior returns (“trend”), and prior equity returns (“hedge”).
- We implement long/short implementations of each strategy covering the time period of 1962-2018.
- We find that all three methods improve both total and risk-adjusted returns when compared to long-only exposure to excess bond returns.
- Naïve combination of both strategies and signals appears to improve realized risk-adjusted returns, promoting the benefits of process diversification.
Introduction
In this strategy brief, we discuss three trading rules for timing exposure to duration. Specifically, we seek to time the excess returns generated from owning 10-year U.S. Treasury bonds over short rates. This piece is meant as a companion to our prior, longer-form explorations Duration Timing with Style Premiaand Timing Bonds with Value, Momentum, and Carry. In contrast, the trading rules herein are simplistic by design in an effort to highlight the efficacy of the signals.
We explore three different signals in this piece:
- The slope of the yield curve (“term spread”);
- Prior realized excess bond returns; and
- Prior realized equity market returns.
In contrast to prior studies, we do not consider traditional value measures, such as real yields, or explicit estimates of the bond risk premium, as they are less easily calculated. Nevertheless, the signals studied herein capture a variety of potential influences upon bond markets, including inflation shocks, economic shocks, policy shocks, marginal utility shocks, and behavioral anomalies.
The strategies based upon our signals are implemented as dollar-neutral long/short portfolios that go long a constant maturity 10-year U.S. Treasury bond index and short a short-term U.S. Treasury index (assumed to be a 1-year index prior to 1982 and a 3-month index thereafter). We compare these strategies to a “long-only” implementation that is long the 10-year U.S. Treasury bond index and short the short-term U.S. Treasury index in order to capture the excess realized return associated with duration.
Implementing our strategies as dollar-neutral long/short portfolios allows them to be interpreted in a variety of useful manners. For example, one obvious interpretation is an overlay implemented on an existing bond portfolio using Treasury futures. However, another interpretation may simply be to guide investors as to whether to extend or contract their duration exposure around a more intermediate-term bond portfolio (e.g. a 5-year duration).
At the end of the piece, we explore the potential diversification benefits achieved by combining these strategies in both an integrated (i.e. signal combination) and composite (i.e. strategy combination) fashion.
Slope of the Yield Curve
In past research on timing duration, we considered explicit measures of the bond risk premium as well as valuation. In Duration Timing with Style Premiawe used a simple signal based upon real yield, which had the problem of being predominately long over the last several decades. In Timing Bonds with Value, Momentum, and Carry we compared a de-trended real yield against recent levels in an attempt to capture more short-term valuation fluctuations.
In both of these prior research pieces, we also explicitly considered the slope of the yield curve as a predictor of future excess bond returns. One complicating factor to carry signals is that rate steepness simultaneously captures both the expectation of rising short rates as well as an embedded risk premium. In particular, evidence suggests that mean-reverting rate expectations dominate steepness when short rates are exceptionally low or high. Anecdotally, this may be due to the fact that the front end of the curve is determined by central bank policy while the back end is determined by inflation expectations.
Thus, despite being a rather blunt measure, steepness may simultaneously be related to business cycles, credit cycles and monetary policy cycles. To quote Ilmanen (2011):
A steep [yield curve] coincides with high unemployment rate (correlation +0.45) and predictsfast economic growth. [Yield curve] countercyclicality may explain its ability to predict near-term bond and stock returns: high required premia near business cycle troughs result in a steep [yield curve], while low required premia near business cycle peaks result in an inverted [yield curve].
Therefore, while estimates of real yield may seek to be explicit measures of value, we may consider carry to be an ancillary measure as well, as a high carry tends to be associated with a high term premium. In Figure 1 we plot the annualized next month excess bond return based upon the quartile (using the prior 10 years of information) that the term spread falls into. We can see a significant monotonic improvement from the 1stto the 4thquartiles, indicating that higher levels of carry, relative to the past, are positive indicators of future returns.
Therefore, we construct our carry strategy as follows:
- At the end of each month, calculate the term spread between 10- and 1-year U.S. Treasuries.
- Calculate the realized percentile of this spread by comparing it against the prior 10-years of daily term spread measures.
- If the carry score is in the top two thirds, go long excess bond returns. If the carry score is in the bottom third, go short excess bond returns.
- Trade at the close of the 1sttrading day of the month.
Returns for this strategy are plotted in Figure 2. Our research suggests that the backtested results of this model can be significantly improved through the use of longer holding periods and portfolio tranching. Another potential improvement is to scale exposure linearly to the current percentile. We will leave these implementations as exercises to readers.
Figure 1
Source: Kenneth French Data Library, Federal Reserve of St. Louis. Calculations by Newfound Research. Returns are backtested and hypothetical. Return data relies on hypothetical indices and is exclusive of all fees and expenses. Returns assume the reinvestment of all distributions. The Carry Long/Short strategy does not reflect any strategy offered or managed by Newfound Research and was constructed exclusively for the purposes of this commentary. It is not possible to invest in an index. Past performance does not guarantee future results.
Figure 2
Data from 1972-2018
| Annualized Return | Annualized Volatility | Sharpe Ratio |
Long Only | 2.1% | 7.6% | 0.27 |
CARRY L/S | 2.6% | 7.7% | 0.33 |
Source: Kenneth French Data Library, Federal Reserve of St. Louis. Calculations by Newfound Research. Returns are backtested and hypothetical. Return data relies on hypothetical indices and is exclusive of all fees and expenses. Returns assume the reinvestment of all distributions. The Carry Long/Short strategy does not reflect any strategy offered or managed by Newfound Research and was constructed exclusively for the purposes of this commentary. It is not possible to invest in an index. Past performance does not guarantee future results.
Trend in Bond Returns
Momentum, in both its relative and absolute (i.e. “trend”) forms, has a long history among both practitioners and academics (see our summary piece Two Centuries of Momentum).
The literature covering momentum in bond returns, however, varies in precisely whatprior returns matter. There are three primary categories: (1) change in bond yields (e.g. Ilmanen (1997)), (2) total return of individual bonds (e.g. Kolanovic and Wei (2015) and Brooks and Moskowitz (2017)), and (3) total return of bond indices (or futures) (e.g. Asness, Moskowitz, and Pedersen (2013), Durham (2013), and Hurst, Ooi, Pedersen (2014))
In our view, the approaches have varying trade-offs:
- While empirical evidence suggests that nominal interest rates can exhibit secular trends, rate evolution is most frequently modeled as mean-reversionary. Our research suggests that very short-term momentum can be effective, but leads to a significant amount of turnover.
- The total return of individual bonds makes sense if we plan on running a cross-sectional bond model (i.e. identifying individual bonds), but is less applicable if we want to implement with a constant maturity index.
- The total return of a bond index may capture past returns that are attributable to securities that have been recently removed.
We think it is worth noting that the latter two methods can capture yield curve effects beyond shift, including roll return, steepening and curvature changes. In fact, momentum in general may even be able to capture other effects such as flight-to-safety and liquidity (supply-demand) factors.
In this piece, we elect to measure momentum as an exponentially-weighting average of prior log returns of the total return excess between long and short bond indices. We measure this average at the end of each month and go long duration when it is positive and short duration when it is negative. In Figure 4 we plot the results of this method based upon a variety of lookback periods that approximate 1-, 3-, 6-, and 12-month formation periods.
Figure 3
| MOM 21 | MOM 63 | MOM 126 | MOM 252 |
MOM 21 | 1.00 | 0.87 | 0.65 | 0.42 |
MOM 63 | 0.87 | 1.00 | 0.77 | 0.53 |
MOM 126 | 0.65 | 0.77 | 1.00 | 0.76 |
MOM 252 | 0.42 | 0.53 | 0.76 | 1.00 |
We see varying success in the methods, with only MOM 63 and MOM 256 exhibiting better risk-adjusted return profiles. Despite this long-term success, we can see that MOM 63 remains in a drawdown that began in the early 2000s, highlighting the potential risk of relying too heavily on a specific measure or formation period. In Figure 3 we calculate the correlation between the different momentum strategies. As we found in Measuring Process Diversification in Trend Following, diversification opportunities appear to be available by mixing both short- and long-term formation periods.
With this in mind, we elect for the following momentum implementation:
- At the end of each month, calculate both a 21- and 252-day exponentially-weighted moving average of realized daily excess log returns.
- When both signals are positive, go long duration; when both signals are negative, go short duration; when signals are mixed, stay flat.
- Rebalance at the close of the next trading day.
The backtested results of this strategy are displayed in Figure 5.
As with carry, we find that there are potential craftsmanship improvements that can be made with this strategy. For example, implementing with four tranches, weekly rebalances appears to significantly improve backtested risk-adjusted returns. Furthermore, there may be benefits that can be achieved by incorporating other means of measuring trends as well as other lookback periods (see Diversifying the What, When, and How of Trend Following and Measuring Process Diversification in Trend Following).
Figure 4
Data from 1963-2018
| Annualized Return | Annualized Volatility | Sharpe Ratio |
Long Only | 1.5% | 7.3% | 0.21 |
MOM 21 | 1.4% | 7.5% | 0.19 |
MOM 63 | 1.8% | 7.4% | 0.25 |
MOM 128 | 1.3% | 7.4% | 0.18 |
MOM 252 | 1.9% | 7.4% | 0.26 |
Source: Kenneth French Data Library, Federal Reserve of St. Louis. Calculations by Newfound Research. Returns are backtested and hypothetical. Return data relies on hypothetical indices and is exclusive of all fees and expenses. Returns assume the reinvestment of all distributions. The Momentum strategies do not reflect any strategies offered or managed by Newfound Research and were constructed exclusively for the purposes of this commentary. It is not possible to invest in an index. Past performance does not guarantee future results.
Figure 5

Data from 1963-2018
| Annualized Return | Annualized Volatility | Sharpe Ratio |
Long Only | 1.5% | 7.2% | 0.21 |
MOM L/S | 1.7% | 6.3% | 0.28 |
Source: Kenneth French Data Library, Federal Reserve of St. Louis. Calculations by Newfound Research. Returns are backtested and hypothetical. Return data relies on hypothetical indices and is exclusive of all fees and expenses. Returns assume the reinvestment of all distributions. The Momentum Long/Short strategy does not reflect any strategy offered or managed by Newfound Research and was constructed exclusively for the purposes of this commentary. It is not possible to invest in an index. Past performance does not guarantee future results.
Safe-Haven Premium
Stocks and bonds generally exhibit a positive correlation over time. One thesis for this long-term relationship is the present value model, which argues that declining yields, and hence increasing bond prices, increase the value of future discounted cash flows and therefore the fair value of equities. Despite this long-term relationship, shocks in economic growth, inflation, and even monetary policy can overwhelm the discount rate thesis and create a regime-varying correlation structure.
For example, empirical evidence suggests that high quality bonds can exhibit a safe haven premium during periods of economic stress. Using real equity prices as a proxy for wealth, Ilmanen (1995) finds that “wealth-dependent relative risk aversion appears to be an important source of bond return predictability.” Specifically, inverse wealth is a significant positive predictor of future excess bond returns at both world and local (U.S., Canada, Japan, Germany, France, and United Kingdom) levels. Ilmanen (2003) finds that, “stock-bond correlations are more likely to be negative when inflation is low, growth is slow, equities are weak, and volatility is high.”
To capitalize on this safe-haven premium, we derive a signal based upon prior equity returns. Specifically, we utilize an exponentially weighted average of prior log returns to estimate the underlying trend of equities. We then compare this estimate to a 10-year rolling window of prior estimates, calculating the current percentile.
In Figure 6 we plot the annualized excess bond return for the month following, assuming signals are generated at the close of each month and trades are placed at the close of the following trading day. We can see several effects. First, next month returns for 1st quartile equity momentum – i.e. very poor equity returns – tends to be significantly higher than other quartiles. Second, excess bond returns in the month following very strong equity returns tend to be poor. We would posit that these two effects are two sides of the same coin: the safe-haven premium during 1st quartile periods and an unwind of the premium in 4th quartile periods. Finally, we can see that 2nd and 3rd quartile returns tend to be positive, in line with the generally positive excess bond return over the measured period.
In an effort to isolate the safe-haven premium, we construct the following strategy:
- At the end of each month, calculate an equity momentum measure by taking a 63-day exponentially weighted average of prior daily log-returns.
- Calculate the realized percentile of this momentum measure by comparing it against the prior 10-years of daily momentum measures.
- If the momentum score is in the bottom quartile, go long excess bond returns. If the momentum score is in the top quartile, go short excess bond returns. Otherwise, remain flat.
- Trade at the close of the 1st trading day of the month.
Returns for this strategy are plotted in Figure 7. As expected based upon the quartile design, the strategy only spends 24% of its time long, 23% of its time short, and the remainder of its time flat. Despite this even split in time, approximately 2/3rds of the strategy’s return comes from the periods when the strategy is long.
Figure 6

Source: Kenneth French Data Library, Federal Reserve of St. Louis. Calculations by Newfound Research. Returns are backtested and hypothetical. Return data relies on hypothetical indices and is exclusive of all fees and expenses. Returns assume the reinvestment of all distributions. The Equity Momentum Long/Short strategy does not reflect any strategy offered or managed by Newfound Research and was constructed exclusively for the purposes of this commentary. It is not possible to invest in an index. Past performance does not guarantee future results.
Figure 7

Data from 1962-2018
| Annualized Return | Annualized Volatility | Sharpe Ratio |
Long Only | 1.5% | 7.2% | 0.21 |
Equity Mom L/S | 1.9% | 5.7% | 0.34 |
Source: Kenneth French Data Library, Federal Reserve of St. Louis. Calculations by Newfound Research. Returns are backtested and hypothetical. Return data relies on hypothetical indices and is exclusive of all fees and expenses. Returns assume the reinvestment of all distributions. The Equity Momentum Long/Short strategy does not reflect any strategy offered or managed by Newfound Research and was constructed exclusively for the purposes of this commentary. It is not possible to invest in an index. Past performance does not guarantee future results.
Combining Signals
Despite trading the same underlying instrument, variation in strategy construction means that we can likely benefit from process diversification in constructing a combined strategy. Figure 8 quantifies the available diversification by measuring full-period correlations among the strategies from joint inception (1972). We can also see that the strategies exhibit low correlation to the Long Only implementation, suggesting that they may introduce diversification benefits to a strategic duration allocation as well.
Figure 8
| LONG ONLY | CARRY L/S | MOM L/S | EQ MOM L/S |
LONG ONLY | 1.00 | 0.42 | 0.33 | -0.09 |
CARRY L/S | 0.42 | 1.00 | 0.40 | -0.09 |
MOM L/S | 0.33 | 0.40 | 1.00 | -0.13 |
EQ MOM L/S | -0.10 | -0.10 | -0.19 | 1.00 |
We explore two different implementations of a diversified strategy. In the first, we simply combine the three strategies in equal-weight, rebalancing on a monthly basis. This implementation can be interpreted as three sleeves of a larger portfolio construction. In the second implementation, we combine underlying long/short signals. When the net signal is positive, the strategy goes 100% long duration and when the signal is negative, it goes 100% short. This can be thought of as an integrated approach that takes a majority-rules voting approach. Results for these strategies are plotted in Figure 9. We note the substantial increase in the backtested Sharpe Ratio of these diversified approaches in comparison to their underlying components outlined in prior sections.
It is important to note that despite strong total and risk-adjusted returns, the strategies spend only approximately 54% of their time net-long duration, with 19% of their time spent flat and 27% of their time spent short. While slightly biased long, this breakdown provides evidence that strategies are not simply the beneficiaries of a bull market in duration over the prior several decades.
Figure 9

Data from 1972-2018
| Annualized Return | Annualized Volatility | Sharpe Ratio |
Long Only | 2.1% | 7.6% | 0.27 |
Combined L/S | 2.5% | 4.3% | 0.58 |
Integrated L/S | 3.5% | 7.1% | 0.49 |
Source: Kenneth French Data Library, Federal Reserve of St. Louis. Calculations by Newfound Research. Returns are backtested and hypothetical. Return data relies on hypothetical indices and is exclusive of all fees and expenses. Returns assume the reinvestment of all distributions. Neither the Combined Long/Short or Integrated Long/Short strategies reflect any strategy offered or managed by Newfound Research and were constructed exclusively for the purposes of this commentary. It is not possible to invest in an index. Past performance does not guarantee future results.
Conclusion
In this research brief, we continued our exploration of duration timing strategies. We aimed to implement several signals that were simple by construction. Specifically, we evaluated the impact of term spread, prior excess bond returns, and prior equity returns on next month’s excess bond returns. Despite their simplicity, we find that all three signals can potentially offer investors insight for tactical timing decisions.
While we believe that significant craftsmanship improvements can be made in all three strategies, low hanging improvement may simply come from combining the approaches. We find a meaningful improvement in Sharpe Ratio by naively combining these strategies in both a sleeve-based and integrated signal fashion.
Bibliography
Asness, Clifford S. and Moskowitz, Tobias J. and Pedersen, Lasse Heje, Value and Momentum Everywhere (June 1, 2012). Chicago Booth Research Paper No. 12-53; Fama-Miller Working Paper. Available at SSRN: https://ssrn.com/abstract=2174501 or http://dx.doi.org/10.2139/ssrn.2174501
Brooks, Jordan and Moskowitz, Tobias J., Yield Curve Premia (July 1, 2017). Available at SSRN: https://ssrn.com/abstract=2956411 or http://dx.doi.org/10.2139/ssrn.2956411
Durham, J. Benson, Momentum and the Term Structure of Interest Rates (December 1, 2013). FRB of New York Staff Report No. 657. Available at SSRN: https://ssrn.com/abstract=2377379 or http://dx.doi.org/10.2139/ssrn.2377379
Hurst, Brian and Ooi, Yao Hua and Pedersen, Lasse Heje, A Century of Evidence on Trend-Following Investing (June 27, 2017). Available at SSRN: https://ssrn.com/abstract=2993026 or http://dx.doi.org/10.2139/ssrn.2993026
Ilmanen, Antti, Time-Varying Expected Returns in International Bond Markets, Journal of Finance, Vol. 50, No. 2, 1995, pp. 481-506.
Ilmanen, Antti, Forecasting U.S. Bond Returns, Journal of Fixed Income, Vol. 7, No. 1, 1997, pp. 22-37.
Ilmanen, Antti, Stock-Bond Correlations, Journal of Fixed Income, Vol. 13, No. 2, 2003, pp. 55-66.
Ilmanen, Antti. Expected Returns an Investor’s Guide to Harvesting Market Rewards. John Wiley, 2011.
Kolanovic, Marko, and Wei, Zhen, Momentum Strategies Across Asset Classes (April 2015). Available at https://www.cmegroup.com/education/files/jpm-momentum-strategies-2015-04-15-1681565.pdf
Measuring the Benefit of Diversification
By Nathan Faber
On November 5, 2018
In Portfolio Construction, Risk Management, Weekly Commentary
This post is available as a PDF download here.
Summary
Introduction
Diversification is a standard risk management tool in any portfolio. Reducing the impact of idiosyncratic risks in individual investments by holding a suite of stocks, asset classes, strategies, etc. produces a smoother investment ride most of the time and reduces the risk of negative surprises.
But in a world where we only experience one outcome out of the multitude of possibilities, gauging the benefit of diversification is difficult. It is even hard to do in hindsight, not so much because we can’t but more often that we won’t. The results already happened.
Over a single time period with no rebalancing, a diversified portfolio will underperform the best asset that it holds. This is a mathematical fact when there is any dispersion in the returns of the assets and it is why we have said that diversification will always disappoint. Our natural behavioral tendencies can often get the better of us, despite the fact that diversification might be doing a great job, especially when examined through the appropriate lens and measured in the context of what could have happened.
Last summer, we published a presentation entitled Building an Unconstrained Sleeve. In it, we looked at ways to combine traditional and non-traditional assets and strategies to target specific objectives: equity hedging, absolute return, and equity-like with downside management.
Now that we have 15 months of subsequent data for all the underlying strategies, we want to revisit that piece and explore the benefit of diversification in the context of hindsight.
A Recap of the Process
As a quick refresher, we included seven strategies and asset classes in the construction of our unconstrained sleeves:
While these strategies are surely not exhaustive, they cover a range of factors (value, momentum, low volatility, etc.) and a global set of asset classes (equities, bonds, commodities, and currencies) commonly included in unconstrained sleeves. They were also selected because many of these strategies are conveniently packaged as ETFs or mutual funds, making the resulting sleeves more easily implementable.
Over the 15 months, world equity was by far the best performer and the spread between best-performing and worst-performing positions exceeded 20 percentage points. If you wanted high returns – and going back to our statement about how diversification will always disappoint – you could have just held world equities and been quite content.
But putting ourselves back in June 2017, we did not know a priori that simply holding equities would have generated the highest returns. Looking at this type of chart in November 2008 would have led to a very different emotional conclusion.
The aim of our original study was to develop unconstrained sleeves that would meet their objectives regardless of how the future played out. Therefore, we employed a simulation-based method that aimed to preserve some of the unique correlation structure between the strategies across different market environments and reduce the risk of overfitting to a single realization of history. With this approach, we constructed portfolios that targeted three different objectives that investors might be interested in:
(Note: Greater detail about portfolio construction process, strategy descriptions, and performance attributes of each strategy can be found in our original presentation.)
But were our constructed portfolios successful in achieving their objectives out-of-sample? To analyze this question, as well as explore the benefits/detractors of diversification for each objective, we will calculate the distribution of what could have happened. The hope is that, each strategy would perform well relative to all other possible portfolios that could have been chosen for the sleeve.
Saying exactly what portfolios we could have chosen is where a little art comes into play. For example, in the equity-like strategies, it is difficult to say that a 100% bond portfolio would have ever been a viable option and therefore may not be an apt out-of-sample comparison.
However, since our original process did not have any specific override for these intuitive constraints, and since we do not wish to assert after-the-fact which portfolios would have been rejected, we will allow the entire potential allocation space to be fair game in our comparison.
There are a number of ways to sample the set of allocations over the 7 asset classes that could have formed the portfolios for each sleeve. Perhaps the most obvious choice would be to sample uniformly over the possible allocations. The issue to balance in this case is coverage of the space (a 6-dimensional simplex) with the number of samples. To be 95% confident that we sampled an allocation above 95% for only a single asset class would require nearly 200 million samples. We have used modified Sobol sequences in the past to ensure coverage of more of the space with fewer points. However, in the current case, to mimic the rounding that is often found in portfolio allocations, we will use a lattice of points spaced 2.5% apart covering the entire space. This requires just under 10 million points in the simulations.
Equity Hedge
This sleeve was designed to offset significant equity losses by limiting downside capture. The resulting optimized portfolio was relatively concentrated in two main positions that historically have exhibited low-to-negative correlations to equities and exhibited potential crisis alpha during significant and prolonged drawdowns.
Source: St. Louis Federal Reserve, MSCI, Salient, HFRI, CSI Analytics. Calculations by Newfound Research.
The down capture this portfolio during the out-of-sample period was 0.44. This result falls in the 70th percentile (that is, better than 70% of the other sample portfolios and where lower down-capture is better) when compared to the 10 million possible other portfolios we could have originally selected. Not surprisingly, the 100% intermediate-term Treasury portfolio had the best down capture (-0.05) over the out-of-sample. Of the portfolios with better down capture, Intermediate Treasuries and Macro – Income were generally the highest allocations.
This does not come as much of a surprise to anyone who has followed the managed futures space for the last 15 months. The category largely remains in a multi-year drawdown (peaking in early 2014), but it has also done little to offset the rapid sell-offs seen in equities in 2018. Therefore, with the full benefit of hindsight, any allocation to Macro – Trend in the original portfolio would be a detriment realizing our out-of-sample objective.
Yet even with this lackluster performance, an out-of-sample realized 70th percentile result over a short, 15-month horizon is a result to be pleased with.
Absolute Return
This sleeve was designed to seek a stable and consistent return stream in all market environments. We aimed to accomplish this by utilizing a risk parity approach. As expected, this sleeve holds all asset classes and is very well diversified across them.
To measure the success of the risk parity over the live period, we will look at the Gini coefficient for each of the ten million potential portfolios we could have initially selected. The Gini coefficient quantifies the equality of the distribution, with a value of 1 representing 100% concentration and 0 representing perfect equality.
The Gini coefficient of the actual portfolio was 0.25 which was in the 99.8th percentile of possible outcomes (i.e. highly diversified on a relative basis). Here, the percentile estimate is padded by the fact that many of the simulated portfolios (e.g. the 100% ones) would clearly not be close to equal risk contribution.
Did our original portfolio achieve its out-of-sample goal? Here, we can evaluate success as to whether the realized contribution to risk of each exposure was close to equivalent; i.e. did we actually achieve risk parity as desired? We can see below that indeed we did, with the main exception of Macro – Trend, which was the most volatile asset class over the period.
Over the sample space of potential portfolios, the portfolio with the minimum out-of-sample Gini coefficient (0.08) was tilted toward the less volatile and more diversifying asset classes (Intermediate Treasuries and Macro – Income). Even so, due to the limited granularity of the sampled portfolios, the risk contribution of Macro – Income was still half of that for each of the other strategies.
It is also worth noting how similar this solution is – generated with the complete benefit of hindsight – to our originally constructed portfolio.
Source: St. Louis Federal Reserve, MSCI, Salient, HFRI, CSI Analytics. Calculations by Newfound Research.
Equity-like with Downside Management
This sleeve was designed in an effort to capture equity market growth while managing the risk of severe and prolonged drawdowns. It was tilted toward the equity-like exposures with a split among risk management styles (trend, minimum volatility, macro strategies, etc.). The allocation to U.S. Treasuries is very small.
For this portfolio, we have two variables to analyze: the up capture relative to global equities and the Ulcer index, a measure of the severity and duration of drawdowns. In the construction of the sleeve, the target was to keep the Ulcer index less than 25% of the value for global equities. The joint distribution of these quantities over the live period is shown below with the actual values over the live period for the sleeve indicated.
The realized Ulcer level was 68% of that of world equity – a far cry from the 25% that the portfolio was optimized for – and was in the 42nd percentile while the up capture of 0.60 was in the 93rd percentile.
With the explicit goal of achieving a relative Ulcer level, a comparison against the entire potential allocation space of 10 million portfolios is not appropriate. Therefore, we reduce the set of 10 million comparative portfolios to only those that would have given a relative Ulcer index less than 25% compared to world equities, eliminating approximately 40% of possible portfolios.
The distributions of allocations to each of the strategies in the acceptable subset are shown below. We can see that the more diversifying strategies take on a larger range of allocations.
Interestingly, looking only over this subset of the original 10 million portfolios improves the out-of-sample up capture of our originally constructed portfolio to the 99th percentile but does not change the percentile of the Ulcer index over the live period. Why is this?
The correlation of the relative Ulcer index over the live period with that over the historical period is only 0.1, indicating that the out of sample data did not line up with our expectations at first glance. However, this makes sense when we recall that the optimization was carried out using data from much more extreme market environments (think 2001 and 2008). It is a good reminder that, just because you optimize for a certain parameter value does not mean you will get it over the live data.
Higher up-capture typically goes hand-in-hand with a higher Ulcer index, as higher return often requires bearing more risk. Therefore, one way to standardize our measures across the potential set of portfolios is to calculate the ratio of up-capture to the Ulcer index. With this transformation, the risk-adjusted up capture falls in the 87th percentile over the set of sample allocations, indicating a very high realized risk-adjusted return.
Conclusion
We only experience one path of the world and do not know the infinite alternate course history could have taken. But it is exactly this infinitude of alternate states that diversification is meant to address.
Diversification generally has no apparent benefit unless we envision what could have happened. Unfortunately our innate natures make this difficult. We do not often value our realized path in this context. After all, none of these alternate states actually happened, so it is difficult to picture what we did not experience.
A quantitative approach can yield a systematic way to evaluate the benefit (or detriment) of diversification. This way, we are not relying as much on intuition – how did our performance feel? – and are looking through a more objective lens at our initial decisions.
In the examples using the Unconstrained Sleeves, diversification focused on more than just returns. The objectives that initially went in to the portfolio construction were the parameters of interest.
Taking a systematic approach does not fully remove the art of the analysis, as was evident in the construction of the potential sample of portfolios used in the comparisons, but having a process can remove some of the behavioral biases that make sticking with a portfolio difficult in the first place.