The Research Library of Newfound Research

Category: Momentum Page 2 of 4

Risk-Adjusted Momentum: A Momentum and Low-Volatility Barbell?

This post is available as a PDF download here.

Summary

  • After the Great Financial Crisis, the Momentum factor has exhibited positive returns, but those returns have been largely driven by the short side of the portfolio.
  • One research note suggests that this is driven by increased risk aversion among investors, using the correlation of high volatility and low momentum baskets as evidence.
  • In contradiction to this point, the iShares Momentum ETF (MTUM) has generated positive excess annualized returns against its benchmark since inception. The same note suggests that this is due to the use of risk-adjusted momentum measures.
  • We explore whether risk-adjusting momentum scores introduces a meaningful and structural tilt towards low-volatility equities.
  • For the examples tested, we find that it does not, and risk-adjusted momentum portfolios behave very similarly to momentum portfolios.

A research note recently crossed my desk that aimed to undress the post-Global Financial Crisis (GFC) performance of the momentum factor in U.S. equities.  Not only have we witnessed a significant reduction in the factor’s return, but the majority of the return has been generated by the short side of the strategy, which can be more difficult for long-only investors to access.

Source: Sharadar.  Calculations by Newfound Research.  Past performance is not an indicator of future results.  Performance is backtested and hypothetical.  Performance figures are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes.  Performance assumes the reinvestment of all distributions.  The Long (Alpha) strategy is a monthly rebalanced portfolio that goes long, in equal weight, the top 50 securities in the S&P 500 ranked on 12-1 month momentum and shorts an equal-weight S&P 500 portfolio.  The Short (Alpha) strategy is a monthly rebalanced portfolio that goes long an equal-weight S&P 500 portfolio and shorts, in equal weight, the bottom 50 securities in the S&P 500 ranked on 12-1 month momentum.

The note makes the narratively-appealing argument that the back-to-back recessions of the dot-com bubble and the Great Financial Crisis amplified investor risk aversion to downside losses.  The proposed evidence of this fact is the correlation of the cumulative alpha generated from shorting low momentum stocks and the cumulative alpha generated from shorting high volatility stocks.

While correlation does not imply causation, one argument might be that in a heightened period of risk aversion, investors may consistently punish higher risk stocks, causing them to become persistent losers.  Or, conversely, losers may be rapidly sold, creating both persistence and high levels of volatility.  We can arguably see this in the convergence of holdings in low momentum and high volatility stocks during “risk off” regimes.

Source: Sharadar.  Calculations by Newfound Research.  Past performance is not an indicator of future results.  Performance is backtested and hypothetical.  Performance figures are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes.  Performance assumes the reinvestment of all distributions.  The HI VOL (Alpha) strategy is a monthly rebalanced portfolio that goes long an equal-weight S&P 500 portfolio and shorts, in equal weight, the bottom 50 securities in the S&P 500 ranked on trailing 252-day realized volatility.  The LO MOM (Alpha) strategy is a monthly rebalanced portfolio that goes long an equal-weight S&P 500 portfolio and shorts, in equal weight, the bottom 50 securities in the S&P 500 ranked on 12-1 month momentum.

Given these facts, we would expect long-only momentum investors to have harvested little out-performance in recent years.  Yet we find that the popular iShares Momentum ETF (MTUM) has out-performed the S&P 500 by 290 basis points per year since its inception in 2013.

The answer to this conundrum, as proposed by the research note, is that MTUM’s use of risk-adjusted momentum is the key.

If we think of risk-adjusted momentum as simply momentum divided by volatility (which is how MTUM defines it), we might interpret it as an integrated signal of both the momentum and low-volatility factors.  Therefore, risk-adjusting creates a multi-factor portfolio that tilts away from high volatility stocks.

And hence the out-performance.

Except if we actually create a risk-adjusted momentum portfolio, that does not appear to really be the case at all.

Source: Sharadar.  Calculations by Newfound Research.  Past performance is not an indicator of future results.  Performance is backtested and hypothetical.  Performance figures are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes.  Performance assumes the reinvestment of all distributions.  The alpha of the risk-adjusted momentum strategy is defined as the return of a monthly rebalanced portfolio that goes long, in equal weight, the top 50 securities in the S&P 500 ranked on risk-adjusted momentum (12-1 month momentum divided by 252-day realized volatility) and shorts an equal-weight S&P 500 portfolio.

To be fair, MTUM’s construction methodology differs quite a bit from that employed herein.  We are simply equally-weighting the top 50 stocks in the S&P 500 when ranked by risk-adjusted momentum, whereas MTUM uses a blend of 6- and 12-month risk-adjusted momentum scores and then tilts market-capitalization weights based upon those scores.

Nevertheless, if we look at actual holdings overlap over time of our Risk-Adjusted Momentum portfolio versus Momentum and Low Volatility portfolios, not only do we see persistently higher overlap with the Momentum portfolio, but we see fairly low average overlap with the Low Volatility portfolio.

For the latter point, it is worth first anchoring ourselves to the standard overlap between Momentum and Low Volatility (green line below).  While we can see that the Risk-Adjusted Momentum portfolio does indeed have a higher average overlap with Low Volatility than does the Momentum portfolio, the excess tilt to Low Volatility due to the use of risk-adjusted momentum (i.e. the orange line minus the green line) appears rather small.  In fact, on average, it is just 10%.

Source: Sharadar.  Calculations by Newfound Research.  Past performance is not an indicator of future results.  Performance is backtested and hypothetical.  Performance figures are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes.  Performance assumes the reinvestment of all distributions.  The risk-adjusted momentum strategy is a monthly rebalanced portfolio that goes long, in equal weight, the top 50 securities in the S&P 500 ranked on risk-adjusted momentum (12-1 month momentum divided by 252-day realized volatility).  The momentum strategy is a monthly rebalanced portfolio that goes long, in equal weight, the top 50 securities in the S&P 500 ranked on 12-1 month momentum.  The low volatility strategy is a monthly rebalanced portfolio that goes long, in equal weight, the top 50 securities in the S&P 500 ranked on trailing 252-day realized volatility.

This is further evident by looking at the actual returns of the strategies themselves:

Source: Sharadar.  Calculations by Newfound Research.  Past performance is not an indicator of future results.  Performance is backtested and hypothetical.  Performance figures are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes.  Performance assumes the reinvestment of all distributions.  The risk-adjusted momentum strategy is a monthly rebalanced portfolio that goes long, in equal weight, the top 50 securities in the S&P 500 ranked on risk-adjusted momentum (12-1 month momentum divided by 252-day realized volatility).  The momentum strategy is a monthly rebalanced portfolio that goes long, in equal weight, the top 50 securities in the S&P 500 ranked on 12-1 month momentum.  The low volatility strategy is a monthly rebalanced portfolio that goes long, in equal weight, the top 50 securities in the S&P 500 ranked on trailing 252-day realized volatility.

The Risk-Adjusted Momentum portfolio performance tracks that of the Momentum portfolio very closely.

As it turns out, the step of adjusting for risk creates far less of a low volatility factor tilt in our top-decile portfolio than one might initially suspect.  (Or, at least, I’ll speak for myself: it created far less of a tilt than I expected.)

To understand this point, we will first re-write our risk-adjusted momentum signal as:

While trivial algebra, re-writing risk-adjusted momentum as the product of momentum and inverse volatility is informative to understanding why risk-adjusted momentum appears to load much more heavily on momentum than low volatility.

At a given point in time, it would appear as if Momentum and Low Volatility should have an equal influence on the rank of a given security.  However, we need to dig a level deeper and consider how changes in these variables impact change in risk-adjusted momentum.

Fortunately, the product makes this a trivial exercise: holding INVVOL constant, changes in MOM are scaled by INVVOL and vice versa.  This scaling effect can cause large changes in risk-adjusted momentum – and therefore ordinal ranking – particularly as MOM crosses the zero level.

Consider a trivial example where INVVOL is a very large number (e.g. 20) due to a security having a very low volatility profile (e.g. 5%).  This would appear, at first glance, to give a security a structural advantage and hence create a low volatility tilt in the portfolio.  However, a move from positive prior returns to negative prior returns would shift the security from ranking among the best to ranking among the worst in risk-adjusted momentum.1

A first order estimate of change in risk-adjusted momentum is:

So which term ultimately has more influence on the change in scores over time?

To get a sense of relative scale, we plot the cross-sectional mean absolute difference between the two terms over time.  This should, at least partially, capture interaction effects between the two terms.

Source: Sharadar.  Calculations by Newfound Research.

We can see that the term including the change in MOM has a much more significant influence on changes in risk-adjusted momentum than changes in INVVOL do.  Thus, we might expect a portfolio driven entirely by changes in momentum to share more in common with our risk-adjusted momentum portfolio than one driven entirely by changes in volatility.

This is somewhat evident when we plot the return of MTUM versus our top 50 style portfolios.  The correlation of daily returns between MTUM and our Momentum, Low Volatility, and Risk-Adjusted Momentum portfolios is 0.93, 0.72, and 0.93 respectively, further suggesting that MTUM is driven more by momentum than volatility.

Source: Sharadar.  Calculations by Newfound Research.  Past performance is not an indicator of future results.  Performance is backtested and hypothetical.  Performance figures are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes.  Performance assumes the reinvestment of all distributions.  The risk-adjusted momentum strategy is a monthly rebalanced portfolio that goes long, in equal weight, the top 50 securities in the S&P 500 ranked on risk-adjusted momentum (12-1 month momentum divided by 252-day realized volatility).  The momentum strategy is a monthly rebalanced portfolio that goes long, in equal weight, the top 50 securities in the S&P 500 ranked on 12-1 month momentum.  The low volatility strategy is a monthly rebalanced portfolio that goes long, in equal weight, the top 50 securities in the S&P 500 ranked on trailing 252-day realized volatility.

This is only one part of the equation, however, as it is possible that changes to the risk-adjusted momentum score are so small – despite being largely driven by momentum – that relative rankings never actually change.  Or, because we have constructed our portfolios by choosing only the top 50 ranked securities, that momentum does drive the majority of change across the entire universe, but the top 50 are always structurally advantaged by the non-linear scaling of low volatility.

To create a more accurate picture, we can rank-weight the entire S&P 500 and evaluate the holdings overlap over time.

Source: Sharadar.  Calculations by Newfound Research.

Note that by now including all securities, and not just selecting the top 50, the overlap with both the Momentum and Low Volatility portfolios naturally appears higher on average.  Nonetheless, we can see that the overlap with the Momentum portfolio is consistently higher than that of the Low Volatility portfolio, again suggesting that momentum has a larger influence on the overall portfolio composition than volatility does.

Conclusion

Without much deep thought, it would be easy to assume that a risk-adjusted momentum measure – i.e. prior returns divided by realized volatility – would tilt a portfolio towards both prior winners and low-volatility securities, resulting in a momentum / low-volatility barbell.

Upon deeper consideration, however, the picture complicates quickly.  For example, momentum can be both positive and negative; dividing by volatility creates a non-linear impact; and momentum tends to change more rapidly than volatility.

We do not attempt to derive a precise, analytical equation that determines which of the two variables ultimately drives portfolio composition, but we do construct long-only example portfolios for empirical study.  We find that a high-concentration risk-adjusted momentum portfolio has significantly more overlap in holdings with a traditional momentum portfolio than a low-volatility portfolio, resulting in a more highly correlated return stream.

The most important takeaway from this note is that intuition can be deceiving: it is important to empirically test our assumptions to ensure we truly understand the impact of our strategy construction choices.

 


 

Yield Curve Trades with Trend and Momentum

This post is available as a PDF download here.

Summary­

  • Yield curve changes over time can be decomposed into Level, Slope, and Curvature changes, and these changes can be used to construct portfolios.
  • Market shocks, monetary policy, and preferences of different segments of investors (e,g. pensions) may create trends within these portfolios that can be exploited with absolute and relative momentum.
  • In this commentary, we investigate these two factors in long/short and long/flat implementations and find evidence of success with some structural caveats.
  • Despite this, we believe the results have potential applications as either a portable beta overlay or for investors who are simply trying to figure out how to position their duration exposure.
  • Translating these quantitative signals into a forecast about yield-curve behavior may allow investors to better position their fixed income portfolios.

It has been well established in fixed income literature that changes to the U.S. Treasury yield curve can be broken down into three primary components: a level shift, a slope change, and a curvature twist.

A level change occurs when rates increase or decrease across the entire curve at once.  A slope change occurs when short-term rates decrease (increase) while long-term rates increase (decrease).  Curvature defines convexity and concavity changes to the yield curve, capturing the bowing that occurs towards the belly of the curve.

Obviously these three components do not capture 100% of changes in the yield curve, but they do capture a significant portion of them. From 1962-2019 they explain 99.5% of the variance in daily yield curve changes.

We can even decompose longer-term changes in the yield curve into these three components.  For example, consider how the yield curve has changed in the three years from 6/30/2016 to 6/30/2019.

Source: Federal Reserve of St. Louis.

We can see that there was generally a positive increase across the entire curve (i.e. a positive level shift), the front end of the curve increased more rapidly (i.e. a flattening slope change) and the curve flipped from concave to convex (i.e. an inverted bowing of the curve).

Using the historical yield curve changes, we can mathematically estimate these stylized changes using principal component analysis.  We plot the loadings of the first three components below for this three-year change.

Source: Federal Reserve of St. Louis.  Calculations by Newfound Research.

We can see that –PC1– has generally positive loadings across the entire curve, and therefore captures our level shift component.  –PC2– exhibits negative loadings on the front end of the curve and positive loadings on the back, capturing our slope change.  Finally, –PC3– has positive loadings from the 1-to-5-year part of the curve, capturing the curvature change of the yield curve itself.

Using a quick bit of linear algebra, we can find the combination of these three factors that closely matches the change in the curve from 6/30/2016 to 6/30/2019.  Comparing our model versus the actual change, we see a reasonably strong fit.

Source: Federal Reserve of St. Louis.  Calculations by Newfound Research.

So why might this be useful information?

First of all, we can interpret our principal components as if they are portfolios.  For example, our first principal component is saying, “buy a portfolio that is long interest rates across the entire curve.”  The second component, on the other hand, is better expressed as, “go short rates on the front end of the curve and go long rates on the back end.”

Therefore, insofar as we believe changes to the yield curve may exhibit absolute or relative momentum, we may be able to exploit this momentum by constructing a portfolio that profits from it.

As a more concrete example, if we believe that the yield curve will generally steepen over the next several years, we might buy 2-year U.S. Treasury futures and short 10-year U.S. Treasury futures.  The biggest wrinkle we need to deal with is the fact that 2-year U.S. Treasury futures will exhibit very different sensitivity to rate changes than 10-year U.S. Treasury futures, and therefore we must take care to duration-adjust our positions.

Why might such changes exhibit trends or relative momentum?

  • During periods where arbitrage capital is low, trends may emerge. We might expect this during periods of extreme market shock (e.g. recessions) where we might also see the simultaneous influence of monetary policy.
  • Effects from monetary policy may exhibit autocorrelation. If investors exhibit any anchoring to prior beliefs, they might discount future policy changes.
  • Segmented market theory suggests that different investors tend to access different parts of the curve (e.g. pensions may prefer the far end of the curve for liability hedging purposes). Information flow may therefore be segmented, or even impacted by structural buyers/sellers, creating autocorrelation in curve dynamics.

In related literature, Fan et al (2019) find that the net hedging or speculative position has strong cross-sectional explanatory power for agricultural and currency futures returns, but not in fixed income markets.  To quote,

“In sharp contrast, we find no evidence of a significant speculative pressure premium in the interest rate and fixed income futures markets. Thus, albeit from the lens of different research questions, our paper reaffirms Bessembinder (1992) and Moskowitz et al. (2012) in establishing that fixed income futures markets behave differently from other futures markets as regards the information content of the net positions of hedgers or speculators.  A hedgers-to-speculators risk transfer in fixed income futures markets would be obscured if agents choose to hedge their interest rate risk with other strategies (i.e. immunization, temporary change in modified duration).”

Interestingly, Markowitz et al. (2012) suggest that speculators may be profiting from time-series momentum at the expense of hedgers, suggesting that they earn a premium for providing liquidity.  Such does not appear to be the case for fixed income futures, however.

As far as we are aware, it has not yet been tested in the literature whether the net speculator versus hedger position has been tested for yield curve trades, and it may be possible that a risk transfer does not exist at the individual maturity basis, but rather exists for speculators willing to bear level, slope, or curvature risk.

Stylized Component Trades

While we know the exact loadings of our principal components (i.e. which maturities make up the principal portfolios), to avoid the risk of overfitting our study we will capture level, slope, and curvature changes with three different stylized portfolios.

To implement our portfolios, we will buy a basket of 2-, 5-, and 10-year U.S. Treasury futures contracts (“UST futures”).  We will assume that the 5-year contract has 2.5x the duration of the 2-year contract and the 10-year contract has 5x the duration of the 2-year contract.

To capture a level shift in the curve, we will go long across all the contracts.  Specifically, for every dollar of 2-year UST futures exposure we purchase, we will buy $0.4 of 5-year UST futures and $0.20 of 10-year UST futures.  This creates equal duration exposure across the entire curve.

To capture slope change, we will go short 2-year UST futures and long the 10-year UST futures, holding zero position in the 5-year UST futures.  As before, we will duration-adjust our positions such that for each $1 short of the 2-year UST futures position, we are $0.20 long the 10-year UST futures.

Finally, to capture curvature change we will construct a butterfly trade where we short the 2- and 10-year UST futures and go long the 5-year UST futures.  For each $1 long in the 5-year UST futures, we will short $1.25 of 2-year UST futures and $0.25 of 10-year UST futures.

Note that the slope and curvature portfolios are implemented such that they are duration neutral (based upon our duration assumptions) so a level shift in the curve will generate no profit or loss.

An immediate problem with our approach arises when we actually construct these portfolios.  Unless adjusted, the volatility exhibited across these trades will be meaningfully different.  Therefore, we target a constant 10% volatility for all three portfolios by adjusting the notional exposure of each portfolio based upon an exponentially-weighted estimate of prior 3-month realized volatility.

Source: Stevens Futures.  Calculations by Newfound Research.  Past performance is not an indicator of future results.  Performance is backtested and hypothetical.  Performance figures are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes.  Performance assumes the reinvestment of all distributions.

It appears, at least to the naked eye, that changes in the yield curve – and therefore the returns of these portfolios – may indeed exhibit positive autocorrelation.  For example, –Slope– appears to exhibit significant trends from 2000-2004, 2004-to 2007, and 2007-2012.

Whether those trends can be identified and exploited is another matter entirely.  Thus, with our stylized portfolios in hand, we can begin testing.

Trend Signals

We begin our analysis by exploring the application of time-series momentum signals across all three of the portfolios.  We evaluate lookback horizons ranging from 21-to-294 trading days (or, approximately 1-to-14 months).  Portfolios assume a 21-trading-day holding period and are implemented using 21 overlapping portfolios to control for timing luck.

Source: Stevens Futures.  Calculations by Newfound Research.  Past performance is not an indicator of future results.  Performance is backtested and hypothetical.  Performance figures are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes.  Performance assumes the reinvestment of all distributions.

Some observations:

  • Time-series momentum appears to generate positive returns for the Level portfolio. Over the period tested, longer-term measures (e.g. 8-to-14-month horizons) offer more favorable results.
  • Time-series momentum on the Level portfolio does, however, underperform naïve buy-and-hold. The returns of the strategy also do not offer a materially improved Sharpe ratio or drawdown profile.
  • Time-series momentum also appears to capture trends in the Slope portfolio. Interestingly, both short- and long-term lookbacks are less favorable over the testing period than intermediate-term (e.g. 4-to-8 month) ones.
  • Finally, time-series momentum appeared to offer no edge in timing curvature trades.

Here we should pause to acknowledge that we are blindly throwing strategies at data without much forethought.  If we consider, however, that we might reasonably expect duration to be a positively compensated risk premium, as well as the fact that we would expect the futures to capture a generally positive roll premium (due to a generally upward sloping yield curve), then explicitly shorting duration risk may not be a keen idea.

In other words, it may make more sense to implement our level trade as a long/flat rather than a long/short.  When implemented in this fashion, we see that the annualized return versus buy-and-hold is much more closely maintained while volatility and maximum drawdown are significantly reduced.

Source: Stevens Futures.  Calculations by Newfound Research.  Past performance is not an indicator of future results.  Performance is backtested and hypothetical.  Performance figures are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes.  Performance assumes the reinvestment of all distributions.

Taken together, it would appear that time-series momentum may be effective for trading the persistence in Level and Slope changes, though not in Curvature.

Momentum Signals

If we treat each stylized portfolio as a separate asset, we can also consider the returns of a cross-sectional momentum portfolio.  For example, each month we can rank the portfolios based upon their prior returns.  The top-ranking portfolio is held long; the 2nd ranked portfolio is held flat; and the 3rd ranked portfolio is held short.

As before, we will evaluate lookback horizons ranging from 21-to-294 trading days (approximately 1-to-14 months) and assuming a 21-trading-day holding period, implemented with 21 overlapping portfolios.

Results – as well as example allocations from the 7-month lookback portfolio – are plotted below.

Source: Stevens Futures.  Calculations by Newfound Research.  Past performance is not an indicator of future results.  Performance is backtested and hypothetical.  Performance figures are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes.  Performance assumes the reinvestment of all distributions.

Here we see very strong performance results except in the 1- and 2-month lookback periods.  The allocation graph appears to suggest that results are not merely the byproduct of consistently being long or short a particular portfolio and the total return level appears to suggest that the portfolio is able to simultaneously profit from both legs.

If we return back to the graph of the stylized portfolios, we can see a significant negative correlation between the Level and Slope portfolios from 1999 to 2011.  The negative correlation appears to disappear after this point, almost precisely coinciding with a 6+ year drawdown in the cross-sectional momentum strategy.

This is due to a mixture of construction and the economic environment.

From a construction perspective, consider that the Level portfolio is long the 2-, the 5-, and the 10-year UST futures while the Slope portfolio is short 2-year and long the 10-year UST futures.  Since the positions are held in a manner that targets equivalent duration exposure, when the 2-year rate moves more than the 10-year rate, we end up in a scenario where the two trades have negative correlation, since one strategy is short and the other is long the 2-year position.  Conversely, if the 10-year rate moves more than the 2-year rate, we end up in a scenario of positive correlation, since both strategies are long the 10-year.

Now consider the 1999-2011 environment.  We had an easing cycle during the dot-com bust, a tightening cycle during the subsequent economic expansion, and another easing cycle during the 2008 crisis.  This caused significantly more directional movement in the 2-year rate than the 10-year rate.  Hence, negative correlation.

After 2008, however, the front end of the curve became pinned to zero.  This meant that there was significantly more movement in the 10-year than the 2-year, leading to positive correlation in the two strategies.  With positive correlation there is less differentiation among the two strategies and so we see a considerable increase in strategy turnover – and effectiveness – as momentum signals become less differentiated.

With that in mind, had we designed our Slope portfolio to be long 2-year UST futures and short 10-year UST futures (i.e. simply inverted the sign of our allocations), we would have seen positive correlation between Level and Slope from 1999 to 2011, resulting in a very different set of allocations and returns.  In actually testing this step, we find that the 1999-2011 period is no longer dominated by Level versus Slope trades, but rather Slope versus Curvature.  Performance of the strategy is still largely positive, but the spread among specifications widens dramatically.

Taken all together, it is difficult to conclude that the success of this strategy was not, in essence, driven almost entirely by autocorrelation in easing and tightening cycles with a relatively stable back end of the curve.1   Given that there have only been a handful of full rate cycles in the last 20 years, we’d be reluctant to rely too heavily on the equity curve of this strategy as evidence of a robust strategy.

Conclusion

In this research note, we explored the idea of generating stylized portfolios designed to isolate and profit from changes to the form of the yield curve.  Specifically, using 2-, 5-, and 10-year UST futures we design portfolios that aim to profit from level, slope, and curvature changes to the US Treasury yield curve.

With these portfolios in hand, we test whether we can time exposure to these changes using time-series momentum.

We find that while time-series momentum generates positive performance for the Level portfolio, it fails to keep up with buy & hold.  Acknowledging that level exposure may offer a positive long-term risk premium, we adjust the strategy from long/short to long/flat and are able to generate a substantially improved risk-adjusted return profile.

Time-series momentum also appears effective for the Slope portfolio, generating meaningful excess returns above the buy-and-hold portfolio.

Applying time-series momentum to the Curvature portfolio does not appear to offer any value.

We also tested whether the portfolios can be traded employing cross-sectional momentum.  We find significant success in the approach but believe that the results are an artifact of (1) the construction of the portfolios and (2) a market regime heavily influenced by monetary policy.  Without further testing, it is difficult to determine if this approach has merit.

Finally, even though our study focused on portfolios constructed using U.S. Treasury futures, we believe the results have potential application for investors who are simply trying to figure out how to position their duration exposure.  For example, a signal to be short (or flat) the Level portfolio and long the Slope portfolio may imply a view of rising rates with a flattening curve.  Translating these quantitative signals into a forecast about yield-curve behavior may allow investors to better position their fixed income portfolios.

Since this study utilized U.S. Treasury futures, these results translate well to implementing a portable beta strategy. For example, if you were an investor with a desired risk profile on par with 100% equities, you could add bond exposure on top of the higher risk portfolio. This would add a (generally) diversifying return source with only a minor cash drag to the extent that margin requirements dictate.

 


 

Ensemble Multi-Asset Momentum

This post is available as a PDF download here.

Summary­

  • We explore a representative multi-asset momentum model that is similar to many bank-based indexes behind structured products and market-linked CDs.
  • With a monthly rebalance cycle, we find substantial timing luck risk.
  • Using the same basic framework, we build a simple ensemble approach, diversifying both process and rebalance timing risk.
  • We find that the virtual strategy-of-strategies is able to harvest diversification benefits, realizing a top-quartile Sharpe ratio with a bottom-quartile maximum drawdown.

Early in the 2010s, a suite of index-linked products came to market that raised billions of dollars.  These products – offered by just about every major bank – sought to simultaneously exploit the diversification benefits of modern portfolio theory and the potential for excess returns from the momentum anomaly.

While each index has its own bells and whistles, they generally follow the same approach:

  • A global, multi-asset universe covering equities, fixed income, and commodities.
  • Implemented using highly liquid ETFs.
  • Asset class and position-level allocation limits.
  • A monthly rebalance schedule.
  • A portfolio optimization that seeks to maximize weighted prior returns (e.g. prior 6 month returns) while limiting portfolio volatility to some maximum threshold (e.g. 5%).

And despite their differences, we can see in plotting their returns below that these indices generally share a common return pattern, indicating a common, driving style.

Source: Bloomberg.

Frequent readers will know that “monthly rebalance” is an immediate red flag for us here at Newfound: an indicator that timing luck is likely lurking nearby.

Replicating Multi-Asset Momentum

To test the impact of timing luck, we replicate a simple multi-asset momentum strategy based upon available index descriptions.

We rebalance the portfolio at the end of each month.  Our optimization process seeks to identify the portfolio with a realized volatility less than 5% that would have maximized returns over the prior six months, subject to a number of position and asset-level limits.  If the 5% volatility target is not achievable, the target is increased by 1% until a portfolio can be constructed that satisfies our constraints.

We use the following ETFs and asset class limits:

As a naïve test for timing luck, rather than assuming the index rebalances at the end of each month, we will simply assume the index rebalances every 21 trading days. In doing so, we can construct 21 different variations of the index, each representing the results from selecting a different rebalance date.

Source: CSI Analytics; Calculations by Newfound Research.  Results are backtested and hypothetical.  Results assume the reinvestment of all distributions.  Results are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes, with the exception of underlying ETF expense ratios.  Past performance is not an indicator of future results. 

As expected, the choice of rebalance date has a meaningful impact.  Annualized returns range from 4.7% to 5.5%, Sharpe ratios range from 0.6 to 0.9, and maximum drawdowns range from 9.9% to 20.8%.

On a year-by-year basis, the only thing that is consistent is the large spread between the worst and best-performing rebalance date.  On average, the yearly spread exceeds 400 basis points.

Min

Max

2008*

-9.91%

0.85%

2009

2.36%

4.59%

2010

6.46%

9.65%

2011

3.31%

10.15%

2012

6.76%

10.83%

2013

3.42%

6.13%

2014

5.98%

10.60%

2015

-5.93%

-2.51%

2016

4.18%

8.45%

2017

9.60%

11.62%

2018

-6.00%

-2.53%

2019 YTD

5.93%

10.01%

* Partial year starting 7/22/2018

We’ve said it in the past and we’ll say it again: timing luck can be the difference between hired and fired.  And while we’d rather be on the side of good luck, the lack of control means we’d rather just avoid this risk all together.

If it isn’t nailed down for a reason, diversify it

The choice of when to rebalance is certainly not the only free variable of our multi-asset momentum strategy.  Without an explicit view as to why a choice is made, our preference is always to diversify so as to avoid specification risk.

We will leave the constraints (e.g. volatility target and weight constraints) well enough alone in this example, but we should consider the process by which we’re measuring past returns as well as the horizon over which we’re measuring it.  There is plenty of historical efficacy to using prior 6-month total returns for momentum, but no lack of evidence supporting other lookback horizons or measurements.

Therefore, we will use three models of momentum: prior total return, the distance of price from its moving average, and the distance of a short-term moving average from a longer-term moving average.  We will vary the parameterization of these signals to cover horizons ranging from 3- to 15-months in length.

We will also vary which day of the month the portfolio rebalances on.

By varying the signal, the lookback horizon, and the rebalance date, we can generate hundreds of different portfolios, all supported by the same theoretical evidence but having slightly different realized results due to their particular specification.

Our robust portfolio emerges by calculating the weights for all these different variations and averaging them together, in many ways creating a virtual strategy-of-strategies.

Below we plot the result of this –ensemble approach– as compared to a –random sample of the underlying specifications–.  We can see that while there are specifications that do much better, there are also those that do much worse.  By employing an ensemble approach, we forgo the opportunity for good luck and avoid the risk of bad luck.   Along the way, though, we may pick up some diversification benefits: the Sharpe ratio of the ensemble approach fell in the top quartile of specifications and its maximum drawdown was in the bottom quartile (i.e. lower drawdown).

Source: CSI Analytics; Calculations by Newfound Research.  Results are backtested and hypothetical.  Results assume the reinvestment of all distributions.  Results are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes, with the exception of underlying ETF expense ratios.  Past performance is not an indicator of future results.

Conclusion

In this commentary, we again demonstrate the potential risk of needless specification and the potential power of diversification.

Using a popular multi-asset momentum model as our example, we again find a significant amount of timing luck lurking in a monthly rebalance specification.  By building a virtual strategy-of-strategies, we are able to manage this risk by partially rebalancing our portfolio on different days.

We go a step further, acknowledging that processrepresents another axis of risk. Specifically, we vary both how we measure momentum and the horizon over which it is measured.  Through the variation of rebalance days, model specifications, and lookback horizons, we generate over 500 different strategy specifications and combine them into a virtual strategy-of-strategies to generate our robust multi-asset momentum model.

As with prior commentaries, we find that the robust model is able to effectively reduce the risk of both specification and timing luck.  But perhaps most importantly, it was able to harvest the benefits of diversification, realizing a Sharpe ratio in the top quartile of specifications and a maximum drawdown in the lowest quartile.

Value and the Credit Spread

This post is available as a PDF download here.

Summary­

  • We continue our exploration of quantitative signals in fixed income.
  • We use a measure of credit curve steepness as a valuation signal for timing exposure between corporate bonds and U.S. Treasuries.
  • The value signal generates a 0.84% annualized return from 1950 to 2019 but is highly regime dependent with meaningful drawdowns.
  • Introducing a naïve momentum strategy significantly improves the realized Sharpe ratio and drawdown profile, but does not reduce the regime-based nature of the returns.
  • With a combined return of just 1.0% annualized, this strategy may not prove effective after appropriate discounting for hindsight bias, costs, and manager fees. The signal itself, however, may be useful in other contexts.

In the last several weeks, we have been exploring the application of quantitative signals to fixed income.

Recent cross-sectional studies also build off of further research we’ve done in the past on applying trend, value, carry, and explicit measures of the bond risk premium as duration timing mechanisms (see Duration Timing with Style Premia; Timing Bonds with Value, Momentum, and Carry; and A Carry-Trend-Hedge Approach to Duration Timing).

Broadly, our studies have found:

  • Value (measured as deviation from real yield), momentum (prior 12-month returns), and carry (yield-to-worst) were all profitable factors in cross-section municipal bond sector long/short portfolios.
  • Value (measured as deviation from real yield), trend (measured as prior return), and carry (measured as term spread + roll yield) have historically been effective timing signals for U.S. duration exposure.
  • Prior short-term equity returns proved to be an effective signal for near-term returns in U.S. Treasuries (related to the “flight-to-safety premium”).
  • Short-term trend proved effective for high yield bond timing, but the results were vastly determined by performance in 2000-2003 and 2008-2009. While the strategy appeared to still be able to harvest relative carry between high-yield bonds and core fixed income in other environments, a significant proportion of returns came from avoiding large drawdowns in high yield.
  • Short-term cross-section momentum (prior total returns), value (z-score of loss-adjusted yield-to-worst), carry (loss-adjusted yield-to-worst), and 3-year reversals all appeared to offer robust signals for relative selection in fixed income sectors. The time period covered in the study, however, was limited and mostly within a low-inflation regime.
  • Application of momentum, value, carry, and reversal as timing signals proved largely ineffective for generating excess returns.

In this week’s commentary, we want to further contribute to research by introducing a value timing signal for credit.

Finding Value in Credit

Identifying a value signal requires some measure or proxy of an asset’s “fair” value. What can make identifying value in credit so difficult is that there are a number of moving pieces.

Conceptually, credit spreads should be proportional to default rates, recovery rates, and aggregate risk appetite, making determining whether spreads are cheap or expensive rather complicated.  Prior literature typically tackles the problem with one of three major categories of models:

  • Econometric: “Fair value” of credit spreads is modeled through a regression that typically explicitly accounts for default and recovery rates. Inputs are often related to economic and market variables, such as equity market returns, 10-year minus 2-year spreads, corporate leverage, and corporate profitability.  Bottom-up analysis may use metrics such as credit quality, maturity, supply, and liquidity.
  • Merton Model: Based upon the idea the bond holders have sold a put on a company’s asset value. Therefore, options pricing models can be used to calculate a credit spread.  Inputs include the total asset value, asset volatility, and leverage of the firm under analysis.
  • Spread Signal: A simple statistical model derived from credit spread themselves. For example, a rolling z-score of option-adjusted spreads or deviations from real yield.  Other models (e.g. Haghani and Dewey (2016)) have used spread plus real yield versus a long-run constant (e.g. “150 basis points”).

The first method requires a significant amount of economic modeling.  The second approach requires a significant amount of extrapolation from market data.  The third method, while computationally (and intellectually) less intensive, requires a meaningful historical sample that realistically needs to cover at least one full market cycle.

While attractive for its simplicity, there are a number of factors that complicate the third approach.

First, if spreads are measured against U.S. Treasuries, the metric may be polluted by information related to Treasuries due to their idiosyncratic behavior (e.g. scarcity effects and flight-to-safety premiums).  Structural shifts in default rates, recovery rates, and risk appetites may also cause a problem, as spreads may appear unduly thin or wide compared to past regimes.

In light of this, in this piece we will explore a similarly simple-to-calculate spread signal, but one that hopefully addresses some of these short-comings.

Baa vs. Aaa Yields

In order to adjust for these problems, we propose looking at the steepness of the credit curve itself by comparing prime / high-grade yield versus lower-medium grade yields.  For example, we could compare Moody’s Season Aaa Corporate Bond Yield and Moody’s Season Baa Corporate Bond Yield.  In fact, we will use these yields for the remainder of this study.

We may be initially inclined to measure the steepness of the credit curve by taking the difference in yield spreads, which we plot below.

Source: Federal Reserve of St. Louis.  Calculations by Newfound Research.

We can find a stronger mean-reverting signal, however, if we calculate the log-difference in yields.

Source: Federal Reserve of St. Louis.  Calculations by Newfound Research.

We believe this transformation is appropriate for two reasons.  First, the log transformation helps control for the highly heteroskedastic and skewed nature of credit spreads.

Second, it helps capture both the steepness andthe level of the credit curve simultaneously.  For example, a 50-basis-point premium when Aaa yield is 1,000 basis points is very different than when Aaa yield is 100 basis points.  In the former case, investors may not feel any pressure to bear excess risk to achieve their return objectives, and therefore a 50-basis-point spread may be quite thin.  In the latter case, 50 basis points may represent a significant step-up in relative return level in an environment where investors have either low default expectations, high recovery expectations, high risk appetite, or some combination thereof.

Another way of interpreting our signal is that it informs us about the relative decisions investors must make about their expected dispersion in terminal wealth.

Constructing the Value Strategy

With our signal in hand, we can now attempt to time credit exposure.  When our measure signals that the credit curve is historically steep, we will take credit risk.  When our signal indicates that the curve is historically flat we will avoid it.

Specifically, we will construct a dollar-neutral long/short portfolio using the Dow Jones Corporate Bond Index (“DJCORP”) and a constant maturity 5-year U.S. Treasury index (“FV”).   We will calculate a rolling z-score of our steepness measure and go long DJCORP and short FV when the z-score is positive and place the opposite trade when the z-score is negative.

In line with prior studies, we will apply an ensemble approach.  Portfolios are reformed monthly using formation ranging from 3-to-6 years with holding periods ranging from 1-to-6 months.  Portfolio weights for the resulting strategy are plotted below.

Source: Federal Reserve of St. Louis and Global Financial Data.  Calculations by Newfound Research.

We should address the fact that while both corporate bond yield and index data is available back to the 1930s, we have truncated our study to ignore dates prior to 12/1949 to normalize for a post-war period.  It should be further acknowledged that the Dow Jones Corporate Bond index used in this study did not technically exist until 2002.  Prior to that date, the index return tracks a Dow Jones Bond Aggregate, which was based upon four sub-indices: high-grade rails, second-grade rails, public utilities, and industries.  This average existed from 1915 to 1976, when it was replaced with a new average at that point when the number of railway bonds was no longer sufficient to maintain the average.

Below we plot the returns of our long/short strategy.

Source: Federal Reserve of St. Louis and Global Financial Data.  Calculations by Newfound Research. Returns are hypothetical and backtested.  Returns are gross of all management fees, transaction fees, and taxes, but net of underlying fund fees.  Total return series assumes the reinvestment of all distributions.

The strategy has an annualized return of 0.84% with a volatility of 3.89%, generating a Sharpe ratio of 0.22.  Of course, long-term return statistics belie investor and manager experience, with this strategy exhibiting at least two periods of decade-plus-long drawdowns.  In fact, the strategy really has just four major return regimes: 1950 to 1970 (-0.24% annualized), 1970 to 1987 (2.59% annualized), 1987 to 2002 (-0.33%), and 2002 to 2019 (1.49% annualized).

Try the strategy out in the wrong environment and we might be in for a lot of pain.

Momentum to the Rescue?

It is no secret that value and momentum go together like peanut butter and jelly. Instead of tweaking our strategy to death in order to improve it, we may just find opportunity in combining it with a negatively correlated signal.

Using an ensemble model, we construct a dollar-neutral long/short momentum strategy that compares prior total returns of DJCORP and FV.  Rebalanced monthly, the portfolios use formation periods ranging from 9-to-15 months and holding periods ranging from 1-to-6 months.

Below we plot the growth of $1 in our value strategy, our momentum strategy, and a 50/50 combination of the two strategies that is rebalanced monthly.

Source: Federal Reserve of St. Louis and Global Financial Data.  Calculations by Newfound Research. Returns are hypothetical and backtested.  Returns are gross of all management fees, transaction fees, and taxes, but net of underlying fund fees.  Total return series assumes the reinvestment of all distributions.

The first thing we note is – even without calculating any statistics – the meaningful negative correlation we see in the equity curves of the value and momentum strategies.  This should give us confidence that there is the potential for significant improvement through diversification.

The momentum strategy returns 1.11% annualized with a volatility of 3.92%, generating a Sharpe ratio of 0.29.  The 50/50 combination strategy, however, returns 1.03% annualized with a volatility of just 2.16% annualized, resulting in a Sharpe ratio of 0.48.

While we still see significant regime-driven behavior, the negative regimes now come at a far lower cost.

Conclusion

In this study we introduce a simple value strategy based upon the steepness of the credit curve.  Specifically, we calculated a rolling z-score on the log-difference between Moody’s Seasoned Baa and Aaa yields.  We interpreted a positive z-score as a historically steep credit curve and therefore likely one that would revert.  Similarly, when z-scores were negative, we interpreted the signal as a flat credit curve, and therefore a period during which taking credit risk is not well compensated.

Employing an ensemble approach, we generated a long/short strategy that would buy the Dow Jones Corporate Bond Index and short 5-year U.S. Treasuries when credit appeared cheap and place the opposite trade when credit appeared expensive.  We found that this strategy returned 0.84% annualized with a volatility of 3.89% from 1950 to 2019.

Unfortunately, our value signal generated significantly regime-dependent behavior with decade-long drawdowns.  This not only causes us to question the statistical validity of the signal, but also the practicality of implementing it.

Fortunately, a naively constructed momentum signal provides ample diversification.  While a combination strategy is still highly regime-driven, the drawdowns are significantly reduced.  Not only do returns meaningfully improve compared to the stand-alone value signal, but the Sharpe ratio more-than-doubles.

Unfortunately, our study leveraged a long/short construction methodology.  While this isolates the impact of active returns, long-only investors must cut return expectations of the strategy in half, as a tactical timing model can only half-implement this trade without leverage.  A long-only switching strategy, then, would only be expected to generate approximately 0.5% annualized excess return above a 50% Dow Jones Corporate Bond Index / 50% 5-Year U.S. Treasury index portfolio.

And that’s before adjustments for hindsight bias, trading costs, and manager fees.

Nevertheless, more precise implementation may lead to better results.  For example, our indices neither perfectly matched the credit spreads we evaluated, nor did they match each other’s durations.  Furthermore, while this particular implementation may not survive costs, this signal may still provide meaningful information for other credit-based strategies.

Quantitative Styles and Multi-Sector Bonds

This post is available as a PDF download here.

Summary­

  • In this commentary we explore the application of several quantitative signals to a broad set of fixed income exposures.
  • Specifically, we explore value, momentum, carry, long-term reversals, and volatility signals.
  • We find that value, 3-month momentum, carry, and 3-year reversals all create attractive quantile profiles, potentially providing clues for how investors might consider pursuing higher returns or lower risk.
  • This study is by no means comprehensive and only intended to invite further research and conversation around the application of quantitative styles across fixed income exposures.

In Navigating Municipal Bonds with Factors, we employed momentum, value, carry, and low-volatility signals to generate a sector-based approach to navigating municipal bonds.

In this article, we will introduce an initial data dive into applying quantitative signals to a broader set of fixed income exposures.  Specifically, we will incorporate 17 different fixed income sectors, spanning duration, credit, and geographic exposure.

  • U.S. Treasuries: Near (3-Month), short (1-3 Year), mid (3-5 Year) intermediate (7-10 Year), and long (20+ Year).
  • Investment-Grade Corporates: Short-term, intermediate-term, and Floating Rate corporate bonds.
  • High Yield: Short- and intermediate-term high yield.
  • International Government Bonds: Currency hedged and un-hedged government bonds.
  • Emerging Market: Local and US dollar denominated.
  • TIPs: Short- and intermediate-term TIPs.
  • Mortgage-Backed: Investment grade mortgage-backed bonds.

In this study, each exposure is represented by a corresponding ETF.  We extend our research prior to ETF launch by employing underlying index data the ETF seeks to track.

The quantitative styles we will explore are:

  • Momentum: Buy recent winners and sell recent losers.
  • Value: Buy cheap and sell expensive.
  • Carry: Buy high carry and sell low carry.
  • Reversal: Buy long-term losers and sell long-term winners.
  • Volatility: Buy high volatility and sell low volatility.1

The details of each style are explained in greater depth in each section below.

Note that the analysis herein is by no means meant to be prescriptive in any manner, nor is it a comprehensive review.  Rather, it is meant as a launching point for further commentaries we expect to write.

At the risk of spoiling the conclusion, below we plot the annualized returns and volatility profiles of dollar-neutral long-short portfolios.2  We can see that short-term Momentum, Value, Carry, and Volatility signals generate positive excess returns over the testing period.

Curiously, longer-term Momentum does not seem to be a profitable strategy, despite evidence of this approach being rather successful for many other asset classes.

Source: Bloomberg; Tiingo.  Calculations by Newfound Research.  Returns are hypothetical and backtested.  Returns are gross of all management fees, transaction fees, and taxes, but net of underlying fund fees.  Total return series assumes the reinvestment of all distributions.

However, these results are not achievable by most investors who may be constrained to a long-only implementation.  Even when interpreted as over- and under-weight signals, the allocations in the underlying long/short portfolios differ so greatly from benchmark exposures, they would be nearly impossible to implement.

For a long-only investor, then, what is more relevant is how these signals forecast performance of different rank orderings of portfolios.  For example, how does a portfolio of the best-ranking 3-month momentum exposures compare to a portfolio of the worst-ranking?

In the remainder of this commentary, we explore the return and risk profiles of quintile portfolios formed on each signal.  To construct these portfolios, we rank order our exposures based on the given quantitative signal and equally-weight the exposures falling within each quintile.

Momentum

We generate momentum signals by computing 12-, 6- and 3- month prior total returns to reflect slow, intermediate, and fast momentum signals.  Low-ranking exposures are those with the lowest prior total returns, while high ranking exposures have the highest total returns.

The portfolios assume a 1-month holding period for momentum signals.  To avoid timing luck, four sub-indexes are used, each rebalancing on a different week of the month.

Annualized return and volatility numbers for the quintiles are plotted below.

A few interesting data-points stand out:

  • For 12-month prior return, the lowest quintile actually had the highest total return.However, it has a dramatically lower Sharpe ratio than the highest quintile, which only slightly underperforms it.
  • Total returns among the highest quintile increase by 150 basis points (“bps”) from 12-month to 3-month signals, and 3-month rankings create a more consistent profile of increasing total return and Sharpe ratio. This may imply that short-term signals are more effective for fixed income.

Source: Bloomberg; Tiingo.  Calculations by Newfound Research.  Returns are hypothetical and backtested.  Returns are gross of all management fees, transaction fees, and taxes, but net of underlying fund fees.  Total return series assumes the reinvestment of all distributions.

Carry

Carry is the expected excess return of an asset assuming price does not change.  For our fixed income universe, we proxy carry using yield-to-worst minus the risk-free rate.  For non-Treasury holdings, we adjust this figure for expected defaults and recovery.

For reasonably efficient markets, we would expect higher carry to imply higher return, but not necessarily higher risk-adjusted returns.  In other words, we earn higher carry as a reward for bearing more risk.

Therefore, we also calculate an alternate measure of carry: carry-to-risk.  Carry-to-risk is calculated by taking our carry measure and dividing it by recent realized volatility levels.  One way of interpreting this figure is as forecast of Sharpe ratio.  Our expectation is that this signal may be able to identify periods when carry is episodically cheap or rich relative to prevailing market risk.

The portfolios assume a 12-month holding period for carry signals.  To avoid timing luck, 52 sub-indexes are used, each rebalancing on a different week of the year.

We see:

  • Higher carry implies a higher return as well as a higher volatility. As expected, no free lunch here.
  • Carry-to-risk does not seem to provide a meaningful signal. In fact, low carry-to-risk outperforms high carry-to-risk by 100bps annualized.
  • Volatility meaningfully declines for carry-to-risk quintiles, potentially indicating that this integrated carry/volatility signal is being too heavily driven by volatility.

Source: Bloomberg; Tiingo.  Calculations by Newfound Research.  Returns are hypothetical and backtested.  Returns are gross of all management fees, transaction fees, and taxes, but net of underlying fund fees.  Total return series assumes the reinvestment of all distributions.

Value

In past commentaries, we have used real yield as our value proxy in fixed income.  In this commentary, we deviate from that methodology slightly and use a time-series z-score of carry as our value of measure. Historically high carry levels are considered to be cheap while historically low carry levels are considered to be expensive.

The portfolios assume a 12-month holding period for value signals.  To avoid timing luck, 52 sub-indexes are used, each rebalancing on a different week of the year.

We see not only a significant increase in total return in buying cheap versus expensive holdings, but also an increase in risk-adjusted returns.

Source: Bloomberg; Tiingo.  Calculations by Newfound Research.  Returns are hypothetical and backtested.  Returns are gross of all management fees, transaction fees, and taxes, but net of underlying fund fees.  Total return series assumes the reinvestment of all distributions. 

Reversal

Reversal signals are the opposite of momentum: we expect past losers to outperform and past winners to underperform.  Empirically, reversals tend to occur over very short time horizons (e.g. 1 month) and longer-term time horizons (e.g. 3- to 5-years).  In many ways, long-term reversals can be thought of as a naive proxy for value, though there may be other behavioral and structural reasons for the historical efficacy of reversal signals.

We must be careful implementing reversal signals, however, as exposures in our universe have varying return dynamics (e.g. expected return and volatility levels).

To illustrate this problem, consider the simple two-asset example of equities and cash.  A 3-year reversal signal would sell the asset that has had the best performance over the prior 3-years and buy the asset that has performed the worst.  The problem is that we expect stocks to outperform cash due to the equity risk premium. Naively ranking on prior returns alone would have us out of equities during most bull markets.

Therefore, we must be careful in ranking assets with meaningfully different return dynamics.

(Why, then, can we do it for momentum?  In a sense, momentum is explicitly trying to exploit the relative time-series properties over a short-term horizon.  Furthermore, in a universe that contains low-risk, low-return assets, cross-sectional momentum can be thought of as an integrated process between time-series momentum and cross-sectional momentum, as the low-risk asset will bubble to the top when absolute returns are negative.)

To account for this, we use a time-series z-score of prior returns to create a reversal signal.  For example, at each point in time we calculate the current 3-year return and z-score it against all prior rolling 3-year periods.

Note that in this construction, high z-scores will reflect higher-than-normal 3-year numbers and low z-scores will reflect lower-than-normal 3-year returns. Therefore, we negate the z-score to generate our signal such that low-ranked exposures reflect those we want to sell and high-ranked exposures reflect those we want to buy.

The portfolios assume a 12-month holding period for value signals.  To avoid timing luck, 52 sub-indexes are used, each rebalancing on a different week of the year.

Plotting the results below for 1-, 3-, and 5-year reversal signals, we see that 3- and 5-year signals see a meaningful increase in both total return and Sharpe ratio between the lowest quintile.

Source: Bloomberg; Tiingo.  Calculations by Newfound Research.  Returns are hypothetical and backtested.  Returns are gross of all management fees, transaction fees, and taxes, but net of underlying fund fees.  Total return series assumes the reinvestment of all distributions.

Volatility

Volatility signals are trivial to generate: we simply sort assets based on prior realized volatility.  Unfortunately, exploiting the low-volatility anomaly is difficult without leverage, as the empirically higher risk-adjusted return exhibited by low-volatility assets typically coincides with lower total returns.

For example, in the tests below the low quintile is mostly comprised of short-term Treasuries and floating rate corporates.  The top quintile is allocated across local currency emerging market debt, long-dated Treasuries, high yield bonds, and unhedged international government bonds.

As a side note, for the same reason we z-scored reversal signals, we also hypothesized that z-scoring may work on volatility.  Beyond these two sentences, the results were nothing worth writing about.

Nevertheless, we can still attempt to confirm the existence of the low-volatility anomaly in our investable universe by ranking assets on their past volatility.

The portfolios assume a 1-month holding period for momentum signals.  To avoid timing luck, four sub-indexes are used, each rebalancing on a different week of the month.

Indeed, in plotting results we see that the lowest volatility quintiles have significantly higher realized Sharpe ratios.

Source: Bloomberg; Tiingo.  Calculations by Newfound Research.  Returns are hypothetical and backtested.  Returns are gross of all management fees, transaction fees, and taxes, but net of underlying fund fees.  Total return series assumes the reinvestment of all distributions.

Of the results plotted above, our eyes might be drawn to the results in the short-term volatility measure. It would appear that the top quintile has both a lower total return and much higher volatility than the 3rd and 4th quintiles.  This might suggest that we could improve our portfolios risk-adjusted returns without sacrificing total return by avoiding those top-ranked assets.

Unfortunately, this is not so clear cut.  Unlike the other signals where the portfolios had meaningful turnover, these quintiles are largely stable.  This means that the results are driven more by the composition of the portfolios than the underlying signals.  For example, the 3rd and 4th quintiles combine both Treasuries and credit exposure, which allows the portfolio to realize lower volatility due to correlation.  The highest volatility quintile, on the other hand, holds both local currency emerging market debt and un-hedged international government bonds, introducing (potentially uncompensated) currency risk into the portfolio.

Thus, the takeaway may be more strategic than tactical: diversification is good and currency exposure is going to increase your volatility.

Oh – and allocating to zero-to-negatively yielding foreign bonds isn’t going to do much for your return unless currency changes bail you out.

Conclusion

In this study, we explored the application of value, momentum, carry, reversal, and volatility signals across fixed income exposures.  We found that value, 3-month momentum, carry, and 3-year reversal signals may all provide meaningful information about forward expected returns and risk.

Our confidence in this analysis, however, is potentially crippled by several points:

  • The time horizon covered is, at best, two decades, and several economic variables are constant throughout it.
  • The inflation regime over the time period was largely uniform.
  • A significant proportion of the period covered had near-zero short-term Treasury yields and negative yields in foreign government debt.
  • Reversal signals require a significant amount of formation data. For example, the 3-year reversal signal requires 6 years (i.e. 3-years of rolling 3-year returns) of data before a signal can be generated. This represents nearly 1/3rd of the data set.
  • The dispersion in return dynamics (e.g. volatility and correlation) of the underlying assets can lead to the emergence of unintended artifacts in the data that may speak more to portfolio composition than the value-add from the quantitative signal.
  • We did not test whether certain exposures or certain time periods had an outsized impact upon results.
  • We did not thoroughly test stability regions for different signals.
  • We did not test the impact of our holding period assumptions.
  • Holdings within quantile portfolios were assumed to be equally weighted.

Some of these points can be addressed simply.  Stability concerns, for example, can be addressed by testing the impact of varying signal parameterization.

Others are a bit trickier and require more creative thinking or more computational horsepower.

Testing for the outsized impact of a given exposure or a given time period, for example, can be done through sub-sampling and cross-validation techniques.  We can think of this as the application of randomness to efficiently cover our search space.

For example, below we re-create our 3-month momentum quintiles, but do so by randomly selecting only 10 of the exposures and 75% of the return period to test.   We repeat this resampling 10,000 times for each quintile and plot the distribution of annualized returns below.

Even without performing an official difference-in-means test, the separation between the low and high quintile annualized return distributions provides a clue that the performance difference between these two is more likely to be a pervasive effect rather than due to an outlier holding or outlier time period.

We can make this test more explicit by using this subset resampling technique to bootstrap a distribution of annualized returns for a top-minus-bottom quintile long/short portfolio.  Specifically, we randomly select a subset of assets and generate our 3-month momentum signals.  We construct a dollar-neutral long/short portfolio by going long assets falling in the top quintile and short assets falling in the bottom quintile.  We then select a random sub-period and calculate the annualized return.

Only 207 of the 10,000 samples fall below 0%, indicating a high statistical likelihood that the outperformance of recent winners over recent losers is not an effect dominated by a specific subset of assets or time-periods.

While this commentary provides a first step towards analyzing quantitative style signals across fixed income exposures, more tests need to be run to develop greater confidence in their efficacy.

Source: Bloomberg; Tiingo.  Calculations by Newfound Research.  Returns are hypothetical and backtested.  Returns are gross of all management fees, transaction fees, and taxes, but net of underlying fund fees.  Total return series assumes the reinvestment of all distributions.

 


 

Page 2 of 4

Powered by WordPress & Theme by Anders Norén