The Research Library of Newfound Research

Category: Risk & Style Premia Page 7 of 16

Ensemble Multi-Asset Momentum

This post is available as a PDF download here.

Summary­

  • We explore a representative multi-asset momentum model that is similar to many bank-based indexes behind structured products and market-linked CDs.
  • With a monthly rebalance cycle, we find substantial timing luck risk.
  • Using the same basic framework, we build a simple ensemble approach, diversifying both process and rebalance timing risk.
  • We find that the virtual strategy-of-strategies is able to harvest diversification benefits, realizing a top-quartile Sharpe ratio with a bottom-quartile maximum drawdown.

Early in the 2010s, a suite of index-linked products came to market that raised billions of dollars.  These products – offered by just about every major bank – sought to simultaneously exploit the diversification benefits of modern portfolio theory and the potential for excess returns from the momentum anomaly.

While each index has its own bells and whistles, they generally follow the same approach:

  • A global, multi-asset universe covering equities, fixed income, and commodities.
  • Implemented using highly liquid ETFs.
  • Asset class and position-level allocation limits.
  • A monthly rebalance schedule.
  • A portfolio optimization that seeks to maximize weighted prior returns (e.g. prior 6 month returns) while limiting portfolio volatility to some maximum threshold (e.g. 5%).

And despite their differences, we can see in plotting their returns below that these indices generally share a common return pattern, indicating a common, driving style.

Source: Bloomberg.

Frequent readers will know that “monthly rebalance” is an immediate red flag for us here at Newfound: an indicator that timing luck is likely lurking nearby.

Replicating Multi-Asset Momentum

To test the impact of timing luck, we replicate a simple multi-asset momentum strategy based upon available index descriptions.

We rebalance the portfolio at the end of each month.  Our optimization process seeks to identify the portfolio with a realized volatility less than 5% that would have maximized returns over the prior six months, subject to a number of position and asset-level limits.  If the 5% volatility target is not achievable, the target is increased by 1% until a portfolio can be constructed that satisfies our constraints.

We use the following ETFs and asset class limits:

As a naïve test for timing luck, rather than assuming the index rebalances at the end of each month, we will simply assume the index rebalances every 21 trading days. In doing so, we can construct 21 different variations of the index, each representing the results from selecting a different rebalance date.

Source: CSI Analytics; Calculations by Newfound Research.  Results are backtested and hypothetical.  Results assume the reinvestment of all distributions.  Results are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes, with the exception of underlying ETF expense ratios.  Past performance is not an indicator of future results. 

As expected, the choice of rebalance date has a meaningful impact.  Annualized returns range from 4.7% to 5.5%, Sharpe ratios range from 0.6 to 0.9, and maximum drawdowns range from 9.9% to 20.8%.

On a year-by-year basis, the only thing that is consistent is the large spread between the worst and best-performing rebalance date.  On average, the yearly spread exceeds 400 basis points.

Min

Max

2008*

-9.91%

0.85%

2009

2.36%

4.59%

2010

6.46%

9.65%

2011

3.31%

10.15%

2012

6.76%

10.83%

2013

3.42%

6.13%

2014

5.98%

10.60%

2015

-5.93%

-2.51%

2016

4.18%

8.45%

2017

9.60%

11.62%

2018

-6.00%

-2.53%

2019 YTD

5.93%

10.01%

* Partial year starting 7/22/2018

We’ve said it in the past and we’ll say it again: timing luck can be the difference between hired and fired.  And while we’d rather be on the side of good luck, the lack of control means we’d rather just avoid this risk all together.

If it isn’t nailed down for a reason, diversify it

The choice of when to rebalance is certainly not the only free variable of our multi-asset momentum strategy.  Without an explicit view as to why a choice is made, our preference is always to diversify so as to avoid specification risk.

We will leave the constraints (e.g. volatility target and weight constraints) well enough alone in this example, but we should consider the process by which we’re measuring past returns as well as the horizon over which we’re measuring it.  There is plenty of historical efficacy to using prior 6-month total returns for momentum, but no lack of evidence supporting other lookback horizons or measurements.

Therefore, we will use three models of momentum: prior total return, the distance of price from its moving average, and the distance of a short-term moving average from a longer-term moving average.  We will vary the parameterization of these signals to cover horizons ranging from 3- to 15-months in length.

We will also vary which day of the month the portfolio rebalances on.

By varying the signal, the lookback horizon, and the rebalance date, we can generate hundreds of different portfolios, all supported by the same theoretical evidence but having slightly different realized results due to their particular specification.

Our robust portfolio emerges by calculating the weights for all these different variations and averaging them together, in many ways creating a virtual strategy-of-strategies.

Below we plot the result of this –ensemble approach– as compared to a –random sample of the underlying specifications–.  We can see that while there are specifications that do much better, there are also those that do much worse.  By employing an ensemble approach, we forgo the opportunity for good luck and avoid the risk of bad luck.   Along the way, though, we may pick up some diversification benefits: the Sharpe ratio of the ensemble approach fell in the top quartile of specifications and its maximum drawdown was in the bottom quartile (i.e. lower drawdown).

Source: CSI Analytics; Calculations by Newfound Research.  Results are backtested and hypothetical.  Results assume the reinvestment of all distributions.  Results are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes, with the exception of underlying ETF expense ratios.  Past performance is not an indicator of future results.

Conclusion

In this commentary, we again demonstrate the potential risk of needless specification and the potential power of diversification.

Using a popular multi-asset momentum model as our example, we again find a significant amount of timing luck lurking in a monthly rebalance specification.  By building a virtual strategy-of-strategies, we are able to manage this risk by partially rebalancing our portfolio on different days.

We go a step further, acknowledging that processrepresents another axis of risk. Specifically, we vary both how we measure momentum and the horizon over which it is measured.  Through the variation of rebalance days, model specifications, and lookback horizons, we generate over 500 different strategy specifications and combine them into a virtual strategy-of-strategies to generate our robust multi-asset momentum model.

As with prior commentaries, we find that the robust model is able to effectively reduce the risk of both specification and timing luck.  But perhaps most importantly, it was able to harvest the benefits of diversification, realizing a Sharpe ratio in the top quartile of specifications and a maximum drawdown in the lowest quartile.

Decomposing the Credit Curve

This post is available as a PDF download here.

Summary­

  • In this research note, we continue our exploration of credit.
  • Rather than test a quantitative signal, we explore credit changes through the lens of statistical decomposition.
  • As with the Treasury yield curve, we find that changes in the credit spread curve can be largely explained by Level, Slope, and Curvature (so long as we adjust for relative volatility levels).
  • We construct stylized portfolios to reflect these factors, adjusting position weights such that they contribute an equal amount of credit risk. We then neutralize interest rate exposure such that the return of these portfolios represents credit-specific information.
  • We find that the Level trade suggests little-to-no realized credit premium over the last 25 years, and Slope suggests no realized premium of junk-minus-quality within credit either. However, results may be largely affected by idiosyncratic events (e.g.  LTCM in 1998) or unhedged risks (e.g. sector differences in credit indices).

In this week’s research note, we continue our exploration of credit with a statistical decomposition of the credit spread curve.  Just as the U.S. Treasury yield curve plots yields versus maturity, the credit spread curve plots excess yield versus credit quality, providing us insight into how much extra return we demand for the risks of declining credit quality.

Source: Federal Reserve of St. Louis; Bloomberg.  Calculations by Newfound Research. 

Our goal in analyzing the credit spread curve is to gain a deeper understanding of the principal drivers behind its changes.  In doing so, we hope to potentially gain intuition and ideas for trading signals between low- and high-quality credit.

To begin our, we must first construct our credit spread curve.  We will use the following index data to represent our different credit qualities.

  • Aaa: Bloomberg U.S. Corporate Aaa Index (LCA3TRUU)
  • Aa: Bloomberg U.S. Corporate Aa Index (LCA2TRUU)
  • A:Bloomberg U.S. Corporate A Index (LCA1TRUU)
  • Baa: Bloomberg U.S. Corporate Baa Index (LCB1TRUU)
  • Ba: Bloomberg U.S. Corporate HY Ba Index (BCBATRUU)
  • B: Bloomberg U.S. Corporate HY B Index (BCBHTRUU)
  • Caa: Bloomberg U.S. Corporate HY Caa Index (BCAUTRUU)

Unfortunately, we cannot simply plot the yield-to-worst for each index, as spread captures the excess yield.  Which raises the question: excess to what?  As we want to isolate the credit component of the yield, we need to remove the duration-equivalent Treasury rate.

Plotting the duration of each credit index over time, we can immediately see why incorporating this duration data will be important.  Not only do durations vary meaningfully over time (e.g. Aaa durations varying between 4.95 and 11.13), but they also deviate across quality (e.g. Caa durations currently sit near 3.3 while Aaa durations are north of 11.1).

Source: Bloomberg.

To calculate our credit spread curve, we must first calculate the duration-equivalent Treasury bond yield for each index at each point in time.  For each credit index at each point in time, we use the historical Treasury yield curve to numerically solve for the Treasury maturity that matches the credit index’s duration.  We then subtract that matching rate from the credit index’s reported yield-to-worst to estimate the credit spread.

We plot the spreads over time below.

Source: Federal Reserve of St. Louis; Bloomberg.  Calculations by Newfound Research.

Statistical Decomposition: Eigen Portfolios

With our credit spreads in hand, we can now attempt to extract the statistical drivers of change within the curve.  One method of achieving this is to:

  • Calculate month-to-month differences in the curve.
  • Calculate the correlation matrix of the differences.
  • Calculate an eigenvalue decomposition of the correlation matrix.

Stopping after just the first two steps, we can begin to see some interesting visual patterns emerge in the correlation matrix.

  • There is not a monotonic decline in correlation between credit qualities. For example, Aaa is not more highly correlated to Aa than Ba and A is more correlated to B than it is Aa.
  • Aaa appears to behave rather uniquely.
  • Baa, Ba, B, and to a lesser extent Caa, appear to visually cluster in behavior.
  • Ba, B, and Caa do appear to have more intuitive correlation behavior, with correlations increasing as credit qualities get closer.

Step 3 might seem foreign for those unfamiliar with the technique, but in this context eigenvalue decomposition has an easy interpretation.   The process will take our universe of credit indices and return a universe of statistically independent factor portfolios, where each portfolio is made up of a combination of credit indices.

As our eigenvalue decomposition was applied to the correlation matrix of credit spread changes, the factors will explain the principal vectors of variance in credit spread changes.  We plot the weights of the first three factors below.

Source: Federal Reserve of St. Louis; Bloomberg.  Calculations by Newfound Research.

For anyone who has performed an eigenvalue decomposition on the yield curve before, three familiar components emerge.

We can see that Factor #1 applies nearly equal-weights across all the credit indices. Therefore, we label this factor “level” as it represents a level shift across the entire curve.

Factor #2 declines in weight from Aaa through Caa.  Therefore, we label this factor “slope,” as it controls steepening and flattening of the credit curve.

Factor #3 appears as a barbell: negative weights in the wings and positive weights in the belly.  Therefore, we call this factor “curvature,” as it will capture convexity changes in the curve.

Together, these three factors explain 80% of the variance in credit spread changes. Interestingly, the 4thfactor – which brings variance explained up to 87.5% – also looks very much like a curvature trade, but places zero weight on Aaa and barbells Aa/Caa against A/Baa.  We believe this serves as further evidence as to the unique behavior of Aaa credit.

Tracking Credit Eigen Portfolios

As we mentioned, each factor is constructed as a combination of exposure to our Aaa-Caa credit universe; in other words, they are portfolios!  This means we can track their performance over time and see how these different trades behave in different market regimes.

To avoid overfitting and estimation risk, we decided to simplify the factor portfolios into more stylized trades, whose weights are plotted below (though ignore, for a moment, the actual weights, as they are meant only to represent relative weighting within the portfolio and not absolute level).  Note that the Level trade has a cumulative positive weight while the Slope and Curvature trades sum to zero.

To actually implement these trades, we need to account for the fact that each credit index will have a different level of credit duration.

Akin to duration, which measure’s a bond’s sensitivity to interest rate changes, credit duration measures a bond’s sensitivity to changes in its credit spread. As with Treasuries, we need to adjust the weights of our trades to account for this difference in credit durations across our indices.

For example, if we want to place a trade that profits in a steepening of the Treasury yield curve, we might sell 10-year US Treasuries and buy 2-year US Treasuries. However, we would not buy and sell the same notional amount, as that would leave us with a significantly negative duration position.  Rather, we would scale each leg such that their durations offset.  In the end, this causes us to buy significantly more 2s than we sell 10s.

To continue, therefore, we must calculate credit spread durations.

Without this data on hand, we employ a statistical approach.  Specifically, we take monthly total return data and subtract yield return and impact from interest rate changes (employing the duration-matched rates we calculated above).  What is left over is an estimate of return due to changes in credit spreads. We then regress these returns against changes in credit spreads to calculate credit spread durations, which we plot below.

Source: Federal Reserve of St. Louis; Bloomberg.  Calculations by Newfound Research.

The results are a bit of a head scratcher.  Unlike duration in the credit curve which typically increases monotonically across maturities, we get a very different effect here.  Aaa credit spread duration is 10.7 today while Caa credit spread duration is 2.8.  How is that possible?  Why is lower-quality credit not more sensitiveto credit changes than higher quality credit?

Here we run into a very interesting empirical result in credit spreads: spread change is proportional to spread level.  Thus, a true “level shift” rarely occurs in the credit space; e.g. a 1bp change in the front-end of the credit spread curve may actually manifest as a 10bp change in the back end.  Therefore, the lower credit spread duration of the back end of the curve is offset by larger changes.

There is some common-sense intuition to this effect.  Credit has a highly non-linear return component: defaults.  If we enter an economic environment where we expect an increase in default rates, it tends to happen in a non-linear fashion across the curve.  To offset the larger increase in defaults in lower quality credit, investors will demand larger corresponding credit spreads.

(Side note: this is why we saw that the Baa–Aaa  spread did not appear to mean-revert as cleanly as the log-difference of spreads did in last week’s commentary, Value and the Credit Spread.)

While our credit spread durations may be correct, we still face a problem: weighting such that each index contributes equal credit spread duration will create an outsized weight to the Caa index.

DTS Scaling

Fortunately, some very smart folks thought about this problem many years ago. Recognizing the stability of relative spread changes, Dor, Dynkin, Hyman, Houweling, van Leeuwen, and Penninga (2007)recommend the measure of duration times spread (“DTS”) for credit risk.

With a more appropriate measure of credit sensitivity, we can now scale our stylized factor portfolio weights such that each position contributes an equal level of DTS.  This will have two effects: (1) the relative weights in the portfolios will change over time, and (2) the notional size of the portfolios will change over time.

We scale each position such that (1) they contribute an equal level of DTS to the portfolio and (2) each leg of the portfolio has a total DTS of 500bps.  The Level trade, therefore, represents a constant 500bps of DTS risk over time, while the Slope and Curvature trades represent 0bps, as the longs and short legs net out.

One problem still remains: interest rate risk.  As we plotted earlier in this piece, the credit indices have time-varying – and sometimes substantial – interest rate exposure.  This creates an unintended bet within our portfolios.

Fortunately, unlike the credit curve, true level shift does empirically apply in the Treasury yield curve.  Therefore, to simplify matters, we construct a 5-year zero-coupon bond, which provides us with a constant duration instrument.  At each point in time, we calculate the net duration of our credit trades and use the 5-year ZCB to neutralize the interest rate risk.  For example, if the Level portfolio has a duration of 1, we would take a -20% notional position in the 5-year ZCB.

Source: Federal Reserve of St. Louis; Bloomberg.  Calculations by Newfound Research.

Some things we note when evaluating the portfolios over time:

  • In all three portfolios, notional exposure to higher credit qualities is substantially larger than lower credit qualities. This captures the meaningfully higher exposure that lower credit quality indices have to credit risk than higher quality indices.
  • The total notional exposure of each portfolio varies dramatically over time as market regimes change. In tight spread environments, DTS is low, and therefore notional exposures increase. In wide spread environments – like 2008 – DTS levels expand dramatically and therefore only a little exposure is necessary to achieve the same risk target.
  • 2014 highlights a potential problem with our approach: as Aaa spreads reached just 5bps, DTS dipped as low as 41bps, causing a significant swing in notional exposure to maintain the same DTS contribution.

Conclusion

The fruit of our all our labor is the graph plotted below, which shows the growth of $1 in our constant DTS, stylized credit factor portfolios.

What can we see?

First and foremost, constant credit exposure has not provided much in the last 25 years until recently.  It would appear that investors did not demand a high enough premium for the risks that were realized over the period, which include the 1998 LTCM blow-up, the burst of the dot-com bubble, and the 2008 recession.

From 12/31/2008 lows through Q1 2019, however, a constant 500bps DTS exposure generated a 2.0% annualized return with 2.4% annualized volatility, reflecting a nice annual premium for investors willing to bear the credit risk.

Slope captures the high-versus-low-quality trade.  We can see that junk meaningfully out-performed quality in the 1990s, after which there really did not appear to be a meaningful difference in performance until 2013 when oil prices plummeted and high yield bond prices collapsed.  This result does highlight a potential problem in our analysis: the difference in sector composition of the underlying indices. High yield bonds had an outsized reaction compared to higher quality investment grade credit due to more substantial exposure to the energy sector, leading to a lop-sided reaction.

What is also interesting about the Slope trade is that the market did not seem to price a meaningful premium for holding low-quality credit over high-quality credit.

Finally, we can see that Curvature (“barbell versus belly”) – trade was rather profitable for the first decade, before deflating pre-2008 and going on a mostly-random walk ever since.  However, as mentioned when the curvature trade was initially introduced, the 4th factor in our decomposition also appeared to reflect a similar trade but shorts Aa and Caa versus a long position in A and Baa.  This trade has been a fairly consistent money-loser since the early 2000s, indicating that a barbell of high quality (just not Aaa) and junk might do better than the belly of the curve.

It is worth pointing out that these trades represent a significant amount of compounding estimation – from duration-matching Treasury rates to credit spread durations – which also means a significant risk of compounding estimation error.  Nevertheless, we believe there are a few takeaways worth exploring further:

  • The Level trade appears highly regime dependent (in positive and negative economic environments), suggesting a potential opportunity for on/off credit trades.
  • The 4th factor is a consistent loser, suggesting a potential structural tilt that can be made by investors by holding quality and junk (e.g. QLTA + HYG) rather than the belly of the curve (LQD).  Implementing this in a long-only fashion would require more substantial analysis of duration trade-offs, as well as a better intuition as to whythe returns are emerging as they are.
  • Finally, a recognition that maintaining a constant credit risk level requires reducing notional exposure as rates go up, as rate changes are proportional to rate levels. This is an important consideration for strategic asset allocation.

 

Value and the Credit Spread

This post is available as a PDF download here.

Summary­

  • We continue our exploration of quantitative signals in fixed income.
  • We use a measure of credit curve steepness as a valuation signal for timing exposure between corporate bonds and U.S. Treasuries.
  • The value signal generates a 0.84% annualized return from 1950 to 2019 but is highly regime dependent with meaningful drawdowns.
  • Introducing a naïve momentum strategy significantly improves the realized Sharpe ratio and drawdown profile, but does not reduce the regime-based nature of the returns.
  • With a combined return of just 1.0% annualized, this strategy may not prove effective after appropriate discounting for hindsight bias, costs, and manager fees. The signal itself, however, may be useful in other contexts.

In the last several weeks, we have been exploring the application of quantitative signals to fixed income.

Recent cross-sectional studies also build off of further research we’ve done in the past on applying trend, value, carry, and explicit measures of the bond risk premium as duration timing mechanisms (see Duration Timing with Style Premia; Timing Bonds with Value, Momentum, and Carry; and A Carry-Trend-Hedge Approach to Duration Timing).

Broadly, our studies have found:

  • Value (measured as deviation from real yield), momentum (prior 12-month returns), and carry (yield-to-worst) were all profitable factors in cross-section municipal bond sector long/short portfolios.
  • Value (measured as deviation from real yield), trend (measured as prior return), and carry (measured as term spread + roll yield) have historically been effective timing signals for U.S. duration exposure.
  • Prior short-term equity returns proved to be an effective signal for near-term returns in U.S. Treasuries (related to the “flight-to-safety premium”).
  • Short-term trend proved effective for high yield bond timing, but the results were vastly determined by performance in 2000-2003 and 2008-2009. While the strategy appeared to still be able to harvest relative carry between high-yield bonds and core fixed income in other environments, a significant proportion of returns came from avoiding large drawdowns in high yield.
  • Short-term cross-section momentum (prior total returns), value (z-score of loss-adjusted yield-to-worst), carry (loss-adjusted yield-to-worst), and 3-year reversals all appeared to offer robust signals for relative selection in fixed income sectors. The time period covered in the study, however, was limited and mostly within a low-inflation regime.
  • Application of momentum, value, carry, and reversal as timing signals proved largely ineffective for generating excess returns.

In this week’s commentary, we want to further contribute to research by introducing a value timing signal for credit.

Finding Value in Credit

Identifying a value signal requires some measure or proxy of an asset’s “fair” value. What can make identifying value in credit so difficult is that there are a number of moving pieces.

Conceptually, credit spreads should be proportional to default rates, recovery rates, and aggregate risk appetite, making determining whether spreads are cheap or expensive rather complicated.  Prior literature typically tackles the problem with one of three major categories of models:

  • Econometric: “Fair value” of credit spreads is modeled through a regression that typically explicitly accounts for default and recovery rates. Inputs are often related to economic and market variables, such as equity market returns, 10-year minus 2-year spreads, corporate leverage, and corporate profitability.  Bottom-up analysis may use metrics such as credit quality, maturity, supply, and liquidity.
  • Merton Model: Based upon the idea the bond holders have sold a put on a company’s asset value. Therefore, options pricing models can be used to calculate a credit spread.  Inputs include the total asset value, asset volatility, and leverage of the firm under analysis.
  • Spread Signal: A simple statistical model derived from credit spread themselves. For example, a rolling z-score of option-adjusted spreads or deviations from real yield.  Other models (e.g. Haghani and Dewey (2016)) have used spread plus real yield versus a long-run constant (e.g. “150 basis points”).

The first method requires a significant amount of economic modeling.  The second approach requires a significant amount of extrapolation from market data.  The third method, while computationally (and intellectually) less intensive, requires a meaningful historical sample that realistically needs to cover at least one full market cycle.

While attractive for its simplicity, there are a number of factors that complicate the third approach.

First, if spreads are measured against U.S. Treasuries, the metric may be polluted by information related to Treasuries due to their idiosyncratic behavior (e.g. scarcity effects and flight-to-safety premiums).  Structural shifts in default rates, recovery rates, and risk appetites may also cause a problem, as spreads may appear unduly thin or wide compared to past regimes.

In light of this, in this piece we will explore a similarly simple-to-calculate spread signal, but one that hopefully addresses some of these short-comings.

Baa vs. Aaa Yields

In order to adjust for these problems, we propose looking at the steepness of the credit curve itself by comparing prime / high-grade yield versus lower-medium grade yields.  For example, we could compare Moody’s Season Aaa Corporate Bond Yield and Moody’s Season Baa Corporate Bond Yield.  In fact, we will use these yields for the remainder of this study.

We may be initially inclined to measure the steepness of the credit curve by taking the difference in yield spreads, which we plot below.

Source: Federal Reserve of St. Louis.  Calculations by Newfound Research.

We can find a stronger mean-reverting signal, however, if we calculate the log-difference in yields.

Source: Federal Reserve of St. Louis.  Calculations by Newfound Research.

We believe this transformation is appropriate for two reasons.  First, the log transformation helps control for the highly heteroskedastic and skewed nature of credit spreads.

Second, it helps capture both the steepness andthe level of the credit curve simultaneously.  For example, a 50-basis-point premium when Aaa yield is 1,000 basis points is very different than when Aaa yield is 100 basis points.  In the former case, investors may not feel any pressure to bear excess risk to achieve their return objectives, and therefore a 50-basis-point spread may be quite thin.  In the latter case, 50 basis points may represent a significant step-up in relative return level in an environment where investors have either low default expectations, high recovery expectations, high risk appetite, or some combination thereof.

Another way of interpreting our signal is that it informs us about the relative decisions investors must make about their expected dispersion in terminal wealth.

Constructing the Value Strategy

With our signal in hand, we can now attempt to time credit exposure.  When our measure signals that the credit curve is historically steep, we will take credit risk.  When our signal indicates that the curve is historically flat we will avoid it.

Specifically, we will construct a dollar-neutral long/short portfolio using the Dow Jones Corporate Bond Index (“DJCORP”) and a constant maturity 5-year U.S. Treasury index (“FV”).   We will calculate a rolling z-score of our steepness measure and go long DJCORP and short FV when the z-score is positive and place the opposite trade when the z-score is negative.

In line with prior studies, we will apply an ensemble approach.  Portfolios are reformed monthly using formation ranging from 3-to-6 years with holding periods ranging from 1-to-6 months.  Portfolio weights for the resulting strategy are plotted below.

Source: Federal Reserve of St. Louis and Global Financial Data.  Calculations by Newfound Research.

We should address the fact that while both corporate bond yield and index data is available back to the 1930s, we have truncated our study to ignore dates prior to 12/1949 to normalize for a post-war period.  It should be further acknowledged that the Dow Jones Corporate Bond index used in this study did not technically exist until 2002.  Prior to that date, the index return tracks a Dow Jones Bond Aggregate, which was based upon four sub-indices: high-grade rails, second-grade rails, public utilities, and industries.  This average existed from 1915 to 1976, when it was replaced with a new average at that point when the number of railway bonds was no longer sufficient to maintain the average.

Below we plot the returns of our long/short strategy.

Source: Federal Reserve of St. Louis and Global Financial Data.  Calculations by Newfound Research. Returns are hypothetical and backtested.  Returns are gross of all management fees, transaction fees, and taxes, but net of underlying fund fees.  Total return series assumes the reinvestment of all distributions.

The strategy has an annualized return of 0.84% with a volatility of 3.89%, generating a Sharpe ratio of 0.22.  Of course, long-term return statistics belie investor and manager experience, with this strategy exhibiting at least two periods of decade-plus-long drawdowns.  In fact, the strategy really has just four major return regimes: 1950 to 1970 (-0.24% annualized), 1970 to 1987 (2.59% annualized), 1987 to 2002 (-0.33%), and 2002 to 2019 (1.49% annualized).

Try the strategy out in the wrong environment and we might be in for a lot of pain.

Momentum to the Rescue?

It is no secret that value and momentum go together like peanut butter and jelly. Instead of tweaking our strategy to death in order to improve it, we may just find opportunity in combining it with a negatively correlated signal.

Using an ensemble model, we construct a dollar-neutral long/short momentum strategy that compares prior total returns of DJCORP and FV.  Rebalanced monthly, the portfolios use formation periods ranging from 9-to-15 months and holding periods ranging from 1-to-6 months.

Below we plot the growth of $1 in our value strategy, our momentum strategy, and a 50/50 combination of the two strategies that is rebalanced monthly.

Source: Federal Reserve of St. Louis and Global Financial Data.  Calculations by Newfound Research. Returns are hypothetical and backtested.  Returns are gross of all management fees, transaction fees, and taxes, but net of underlying fund fees.  Total return series assumes the reinvestment of all distributions.

The first thing we note is – even without calculating any statistics – the meaningful negative correlation we see in the equity curves of the value and momentum strategies.  This should give us confidence that there is the potential for significant improvement through diversification.

The momentum strategy returns 1.11% annualized with a volatility of 3.92%, generating a Sharpe ratio of 0.29.  The 50/50 combination strategy, however, returns 1.03% annualized with a volatility of just 2.16% annualized, resulting in a Sharpe ratio of 0.48.

While we still see significant regime-driven behavior, the negative regimes now come at a far lower cost.

Conclusion

In this study we introduce a simple value strategy based upon the steepness of the credit curve.  Specifically, we calculated a rolling z-score on the log-difference between Moody’s Seasoned Baa and Aaa yields.  We interpreted a positive z-score as a historically steep credit curve and therefore likely one that would revert.  Similarly, when z-scores were negative, we interpreted the signal as a flat credit curve, and therefore a period during which taking credit risk is not well compensated.

Employing an ensemble approach, we generated a long/short strategy that would buy the Dow Jones Corporate Bond Index and short 5-year U.S. Treasuries when credit appeared cheap and place the opposite trade when credit appeared expensive.  We found that this strategy returned 0.84% annualized with a volatility of 3.89% from 1950 to 2019.

Unfortunately, our value signal generated significantly regime-dependent behavior with decade-long drawdowns.  This not only causes us to question the statistical validity of the signal, but also the practicality of implementing it.

Fortunately, a naively constructed momentum signal provides ample diversification.  While a combination strategy is still highly regime-driven, the drawdowns are significantly reduced.  Not only do returns meaningfully improve compared to the stand-alone value signal, but the Sharpe ratio more-than-doubles.

Unfortunately, our study leveraged a long/short construction methodology.  While this isolates the impact of active returns, long-only investors must cut return expectations of the strategy in half, as a tactical timing model can only half-implement this trade without leverage.  A long-only switching strategy, then, would only be expected to generate approximately 0.5% annualized excess return above a 50% Dow Jones Corporate Bond Index / 50% 5-Year U.S. Treasury index portfolio.

And that’s before adjustments for hindsight bias, trading costs, and manager fees.

Nevertheless, more precise implementation may lead to better results.  For example, our indices neither perfectly matched the credit spreads we evaluated, nor did they match each other’s durations.  Furthermore, while this particular implementation may not survive costs, this signal may still provide meaningful information for other credit-based strategies.

Quantitative Styles and Multi-Sector Bonds

This post is available as a PDF download here.

Summary­

  • In this commentary we explore the application of several quantitative signals to a broad set of fixed income exposures.
  • Specifically, we explore value, momentum, carry, long-term reversals, and volatility signals.
  • We find that value, 3-month momentum, carry, and 3-year reversals all create attractive quantile profiles, potentially providing clues for how investors might consider pursuing higher returns or lower risk.
  • This study is by no means comprehensive and only intended to invite further research and conversation around the application of quantitative styles across fixed income exposures.

In Navigating Municipal Bonds with Factors, we employed momentum, value, carry, and low-volatility signals to generate a sector-based approach to navigating municipal bonds.

In this article, we will introduce an initial data dive into applying quantitative signals to a broader set of fixed income exposures.  Specifically, we will incorporate 17 different fixed income sectors, spanning duration, credit, and geographic exposure.

  • U.S. Treasuries: Near (3-Month), short (1-3 Year), mid (3-5 Year) intermediate (7-10 Year), and long (20+ Year).
  • Investment-Grade Corporates: Short-term, intermediate-term, and Floating Rate corporate bonds.
  • High Yield: Short- and intermediate-term high yield.
  • International Government Bonds: Currency hedged and un-hedged government bonds.
  • Emerging Market: Local and US dollar denominated.
  • TIPs: Short- and intermediate-term TIPs.
  • Mortgage-Backed: Investment grade mortgage-backed bonds.

In this study, each exposure is represented by a corresponding ETF.  We extend our research prior to ETF launch by employing underlying index data the ETF seeks to track.

The quantitative styles we will explore are:

  • Momentum: Buy recent winners and sell recent losers.
  • Value: Buy cheap and sell expensive.
  • Carry: Buy high carry and sell low carry.
  • Reversal: Buy long-term losers and sell long-term winners.
  • Volatility: Buy high volatility and sell low volatility.1

The details of each style are explained in greater depth in each section below.

Note that the analysis herein is by no means meant to be prescriptive in any manner, nor is it a comprehensive review.  Rather, it is meant as a launching point for further commentaries we expect to write.

At the risk of spoiling the conclusion, below we plot the annualized returns and volatility profiles of dollar-neutral long-short portfolios.2  We can see that short-term Momentum, Value, Carry, and Volatility signals generate positive excess returns over the testing period.

Curiously, longer-term Momentum does not seem to be a profitable strategy, despite evidence of this approach being rather successful for many other asset classes.

Source: Bloomberg; Tiingo.  Calculations by Newfound Research.  Returns are hypothetical and backtested.  Returns are gross of all management fees, transaction fees, and taxes, but net of underlying fund fees.  Total return series assumes the reinvestment of all distributions.

However, these results are not achievable by most investors who may be constrained to a long-only implementation.  Even when interpreted as over- and under-weight signals, the allocations in the underlying long/short portfolios differ so greatly from benchmark exposures, they would be nearly impossible to implement.

For a long-only investor, then, what is more relevant is how these signals forecast performance of different rank orderings of portfolios.  For example, how does a portfolio of the best-ranking 3-month momentum exposures compare to a portfolio of the worst-ranking?

In the remainder of this commentary, we explore the return and risk profiles of quintile portfolios formed on each signal.  To construct these portfolios, we rank order our exposures based on the given quantitative signal and equally-weight the exposures falling within each quintile.

Momentum

We generate momentum signals by computing 12-, 6- and 3- month prior total returns to reflect slow, intermediate, and fast momentum signals.  Low-ranking exposures are those with the lowest prior total returns, while high ranking exposures have the highest total returns.

The portfolios assume a 1-month holding period for momentum signals.  To avoid timing luck, four sub-indexes are used, each rebalancing on a different week of the month.

Annualized return and volatility numbers for the quintiles are plotted below.

A few interesting data-points stand out:

  • For 12-month prior return, the lowest quintile actually had the highest total return.However, it has a dramatically lower Sharpe ratio than the highest quintile, which only slightly underperforms it.
  • Total returns among the highest quintile increase by 150 basis points (“bps”) from 12-month to 3-month signals, and 3-month rankings create a more consistent profile of increasing total return and Sharpe ratio. This may imply that short-term signals are more effective for fixed income.

Source: Bloomberg; Tiingo.  Calculations by Newfound Research.  Returns are hypothetical and backtested.  Returns are gross of all management fees, transaction fees, and taxes, but net of underlying fund fees.  Total return series assumes the reinvestment of all distributions.

Carry

Carry is the expected excess return of an asset assuming price does not change.  For our fixed income universe, we proxy carry using yield-to-worst minus the risk-free rate.  For non-Treasury holdings, we adjust this figure for expected defaults and recovery.

For reasonably efficient markets, we would expect higher carry to imply higher return, but not necessarily higher risk-adjusted returns.  In other words, we earn higher carry as a reward for bearing more risk.

Therefore, we also calculate an alternate measure of carry: carry-to-risk.  Carry-to-risk is calculated by taking our carry measure and dividing it by recent realized volatility levels.  One way of interpreting this figure is as forecast of Sharpe ratio.  Our expectation is that this signal may be able to identify periods when carry is episodically cheap or rich relative to prevailing market risk.

The portfolios assume a 12-month holding period for carry signals.  To avoid timing luck, 52 sub-indexes are used, each rebalancing on a different week of the year.

We see:

  • Higher carry implies a higher return as well as a higher volatility. As expected, no free lunch here.
  • Carry-to-risk does not seem to provide a meaningful signal. In fact, low carry-to-risk outperforms high carry-to-risk by 100bps annualized.
  • Volatility meaningfully declines for carry-to-risk quintiles, potentially indicating that this integrated carry/volatility signal is being too heavily driven by volatility.

Source: Bloomberg; Tiingo.  Calculations by Newfound Research.  Returns are hypothetical and backtested.  Returns are gross of all management fees, transaction fees, and taxes, but net of underlying fund fees.  Total return series assumes the reinvestment of all distributions.

Value

In past commentaries, we have used real yield as our value proxy in fixed income.  In this commentary, we deviate from that methodology slightly and use a time-series z-score of carry as our value of measure. Historically high carry levels are considered to be cheap while historically low carry levels are considered to be expensive.

The portfolios assume a 12-month holding period for value signals.  To avoid timing luck, 52 sub-indexes are used, each rebalancing on a different week of the year.

We see not only a significant increase in total return in buying cheap versus expensive holdings, but also an increase in risk-adjusted returns.

Source: Bloomberg; Tiingo.  Calculations by Newfound Research.  Returns are hypothetical and backtested.  Returns are gross of all management fees, transaction fees, and taxes, but net of underlying fund fees.  Total return series assumes the reinvestment of all distributions. 

Reversal

Reversal signals are the opposite of momentum: we expect past losers to outperform and past winners to underperform.  Empirically, reversals tend to occur over very short time horizons (e.g. 1 month) and longer-term time horizons (e.g. 3- to 5-years).  In many ways, long-term reversals can be thought of as a naive proxy for value, though there may be other behavioral and structural reasons for the historical efficacy of reversal signals.

We must be careful implementing reversal signals, however, as exposures in our universe have varying return dynamics (e.g. expected return and volatility levels).

To illustrate this problem, consider the simple two-asset example of equities and cash.  A 3-year reversal signal would sell the asset that has had the best performance over the prior 3-years and buy the asset that has performed the worst.  The problem is that we expect stocks to outperform cash due to the equity risk premium. Naively ranking on prior returns alone would have us out of equities during most bull markets.

Therefore, we must be careful in ranking assets with meaningfully different return dynamics.

(Why, then, can we do it for momentum?  In a sense, momentum is explicitly trying to exploit the relative time-series properties over a short-term horizon.  Furthermore, in a universe that contains low-risk, low-return assets, cross-sectional momentum can be thought of as an integrated process between time-series momentum and cross-sectional momentum, as the low-risk asset will bubble to the top when absolute returns are negative.)

To account for this, we use a time-series z-score of prior returns to create a reversal signal.  For example, at each point in time we calculate the current 3-year return and z-score it against all prior rolling 3-year periods.

Note that in this construction, high z-scores will reflect higher-than-normal 3-year numbers and low z-scores will reflect lower-than-normal 3-year returns. Therefore, we negate the z-score to generate our signal such that low-ranked exposures reflect those we want to sell and high-ranked exposures reflect those we want to buy.

The portfolios assume a 12-month holding period for value signals.  To avoid timing luck, 52 sub-indexes are used, each rebalancing on a different week of the year.

Plotting the results below for 1-, 3-, and 5-year reversal signals, we see that 3- and 5-year signals see a meaningful increase in both total return and Sharpe ratio between the lowest quintile.

Source: Bloomberg; Tiingo.  Calculations by Newfound Research.  Returns are hypothetical and backtested.  Returns are gross of all management fees, transaction fees, and taxes, but net of underlying fund fees.  Total return series assumes the reinvestment of all distributions.

Volatility

Volatility signals are trivial to generate: we simply sort assets based on prior realized volatility.  Unfortunately, exploiting the low-volatility anomaly is difficult without leverage, as the empirically higher risk-adjusted return exhibited by low-volatility assets typically coincides with lower total returns.

For example, in the tests below the low quintile is mostly comprised of short-term Treasuries and floating rate corporates.  The top quintile is allocated across local currency emerging market debt, long-dated Treasuries, high yield bonds, and unhedged international government bonds.

As a side note, for the same reason we z-scored reversal signals, we also hypothesized that z-scoring may work on volatility.  Beyond these two sentences, the results were nothing worth writing about.

Nevertheless, we can still attempt to confirm the existence of the low-volatility anomaly in our investable universe by ranking assets on their past volatility.

The portfolios assume a 1-month holding period for momentum signals.  To avoid timing luck, four sub-indexes are used, each rebalancing on a different week of the month.

Indeed, in plotting results we see that the lowest volatility quintiles have significantly higher realized Sharpe ratios.

Source: Bloomberg; Tiingo.  Calculations by Newfound Research.  Returns are hypothetical and backtested.  Returns are gross of all management fees, transaction fees, and taxes, but net of underlying fund fees.  Total return series assumes the reinvestment of all distributions.

Of the results plotted above, our eyes might be drawn to the results in the short-term volatility measure. It would appear that the top quintile has both a lower total return and much higher volatility than the 3rd and 4th quintiles.  This might suggest that we could improve our portfolios risk-adjusted returns without sacrificing total return by avoiding those top-ranked assets.

Unfortunately, this is not so clear cut.  Unlike the other signals where the portfolios had meaningful turnover, these quintiles are largely stable.  This means that the results are driven more by the composition of the portfolios than the underlying signals.  For example, the 3rd and 4th quintiles combine both Treasuries and credit exposure, which allows the portfolio to realize lower volatility due to correlation.  The highest volatility quintile, on the other hand, holds both local currency emerging market debt and un-hedged international government bonds, introducing (potentially uncompensated) currency risk into the portfolio.

Thus, the takeaway may be more strategic than tactical: diversification is good and currency exposure is going to increase your volatility.

Oh – and allocating to zero-to-negatively yielding foreign bonds isn’t going to do much for your return unless currency changes bail you out.

Conclusion

In this study, we explored the application of value, momentum, carry, reversal, and volatility signals across fixed income exposures.  We found that value, 3-month momentum, carry, and 3-year reversal signals may all provide meaningful information about forward expected returns and risk.

Our confidence in this analysis, however, is potentially crippled by several points:

  • The time horizon covered is, at best, two decades, and several economic variables are constant throughout it.
  • The inflation regime over the time period was largely uniform.
  • A significant proportion of the period covered had near-zero short-term Treasury yields and negative yields in foreign government debt.
  • Reversal signals require a significant amount of formation data. For example, the 3-year reversal signal requires 6 years (i.e. 3-years of rolling 3-year returns) of data before a signal can be generated. This represents nearly 1/3rd of the data set.
  • The dispersion in return dynamics (e.g. volatility and correlation) of the underlying assets can lead to the emergence of unintended artifacts in the data that may speak more to portfolio composition than the value-add from the quantitative signal.
  • We did not test whether certain exposures or certain time periods had an outsized impact upon results.
  • We did not thoroughly test stability regions for different signals.
  • We did not test the impact of our holding period assumptions.
  • Holdings within quantile portfolios were assumed to be equally weighted.

Some of these points can be addressed simply.  Stability concerns, for example, can be addressed by testing the impact of varying signal parameterization.

Others are a bit trickier and require more creative thinking or more computational horsepower.

Testing for the outsized impact of a given exposure or a given time period, for example, can be done through sub-sampling and cross-validation techniques.  We can think of this as the application of randomness to efficiently cover our search space.

For example, below we re-create our 3-month momentum quintiles, but do so by randomly selecting only 10 of the exposures and 75% of the return period to test.   We repeat this resampling 10,000 times for each quintile and plot the distribution of annualized returns below.

Even without performing an official difference-in-means test, the separation between the low and high quintile annualized return distributions provides a clue that the performance difference between these two is more likely to be a pervasive effect rather than due to an outlier holding or outlier time period.

We can make this test more explicit by using this subset resampling technique to bootstrap a distribution of annualized returns for a top-minus-bottom quintile long/short portfolio.  Specifically, we randomly select a subset of assets and generate our 3-month momentum signals.  We construct a dollar-neutral long/short portfolio by going long assets falling in the top quintile and short assets falling in the bottom quintile.  We then select a random sub-period and calculate the annualized return.

Only 207 of the 10,000 samples fall below 0%, indicating a high statistical likelihood that the outperformance of recent winners over recent losers is not an effect dominated by a specific subset of assets or time-periods.

While this commentary provides a first step towards analyzing quantitative style signals across fixed income exposures, more tests need to be run to develop greater confidence in their efficacy.

Source: Bloomberg; Tiingo.  Calculations by Newfound Research.  Returns are hypothetical and backtested.  Returns are gross of all management fees, transaction fees, and taxes, but net of underlying fund fees.  Total return series assumes the reinvestment of all distributions.

 


 

Tactical Credit

This post is available as a PDF download here.

Summary­

  • In this commentary we explore tactical credit strategies that switch between high yield bonds and core fixed income exposures.
  • We find that short-term momentum signals generate statistically significant annualized excess returns.
  • We use a cross-section of statistically significant strategy parameterizations to generate an ensemble strategy.Consistent with past research, we find that this ensemble approach helps reduce idiosyncratic specification risk and dramatically increases the strategy’s information ratio above the median underlying strategy information ratio.
  • To gain a better understanding of the strategy, we attempt to determine the source of strategy returns. We find that a significant proportion of returns are generated as price returns occurring during periods when credit spreads are above their median value and are expanding.
  • Excluding the 2000-2003 and 2008-2009 sub-periods reduces gross-of-cost strategy returns from 2.9% to 1.5%, bringing into question how effective post-of-cost implementation can be if we do not necessarily expect another crisis period to unfold.

There is a certain class of strategies we get asked about quite frequently but have never written much on: tactical credit.

The signals driving these strategies can vary significantly (including momentum, valuation, carry, macro-economic, et cetera) and implementation can range from individual bonds to broad index exposure to credit default swaps.  The simplest approach we see, however, are high yield switching strategies.  The strategies typically allocate between high yield corporate bonds and core fixed income (or short-to-medium-term U.S. Treasuries) predominately based upon some sort of momentum-driven signal.

It is easy to see why this seemingly naïve approach has been attractive.  Implementing a simple rotation between –high-yield corporates– and –core U.S. fixed income– with a 3-month lookback with 1-month hold creates a fairly attractive looking –tactical credit– strategy.

Source: Tiingo.  Calculations by Newfound Research.   Tactical Credit strategy returns are hypothetical and backtested.  Returns gross of all management fees and taxes, but net of underlying fund fees.  HY Corporates represents the Vanguard High-Yield Corporate Fund (VWEHX).  Core Bonds is represented by the Vanguard Total. Bond Market Index Fund (VBMFX).  Returns assume the reinvestment of all distributions.

Visualizing the ratio of the equity curves over time, we see a return profile that is reminiscent of past writings on tactical and trend equity strategies. The tactical credit strategy tends to outperform core bonds during most periods, with the exception of periods of economic stress (e.g. 2000-2002 or 2008).  On the other hand, the tactical credit strategy tends to underperform high yield corporates in most environments, but has historically added significant value in those same periods of economic stress.

Source: Tiingo.  Calculations by Newfound Research.   Tactical Credit strategy returns are hypothetical and backtested.  Returns gross of all management fees and taxes, but net of underlying fund fees.  HY Corporates represents the Vanguard High-Yield Corporate Fund (VWEHX).  Core Bonds is represented by the Vanguard Total. Bond Market Index Fund (VBMFX).  Returns assume the reinvestment of all distributions.

This is akin to tactical equity strategies, which have historically out-performed the safety asset (e.g. cash) during periods of equity market tailwinds, but under-performed buy-and-hold equity during those periods due to switching costs and whipsaw. As the most aggressive stance the tactical credit strategy can take is a 100% position in high yield corporates, it would be unrealistic for us to expect such a strategy to out-perform in an environment that is conducive to strong high yield performance.

What makes this strategy different than tactical equity, however, is that the vast majority of total return in these asset classes comes from income rather than growth.  In fact, since the 1990s, the price return of high yield bonds has annualized at -0.8%.  This loss reflects defaults occurring within the portfolio offset by recovery rates.1

This is potentially problematic for a tactical strategy as it implies a significant potential opportunity cost of switching out of high yield.  However, we can also see that the price return is volatile.  In years like 2008, the price return was -27%, more than offsetting the 7%+ yield you would have achieved just holding the fund.

Source: Tiingo.  Calculations by Newfound Research.   Returns gross of all management fees and taxes, but net of underlying fund fees.

Like trend equity, we can think of this tactical credit strategy as being a combination of two portfolios:

  • A fixed-mix of 50% high yield corporates and 50% core bonds; and
  • 50% exposure to a dollar-neutral long/short portfolio that captures the tactical bet.

For example, when the tactical credit portfolio is 100% in high yield corporates, we can think of this as being a 50/50 strategy portfolio with a 50% overlay that is 100% long high yield corporates and 100% short core bonds, leading to a net exposure that is 100% long high yield corporates.

Thinking in this manner allows us to isolate the active returns of the portfolio actually being generated by the tactical signals and determine value-add beyond a diversified buy-and-hold core.  Thus, for the remainder of this commentary we will focus our exploration on the long/short component.

Before we go any further, we do want to address that a naïve comparison between high yield corporates and core fixed income may be plagued by changing composition in the underlying portfolios as well as unintended bets.  For example, without specifically duration matching the legs of the portfolio, it is likely that a dollar-neutral long/short portfolio will have residual interest rate exposure and will not represent an isolated credit bet.  Thus, naïve total return comparisons will capture both interest rate and credit-driven effects.

This is further complicated by the fact that sensitivity to these factors will change over time due both to the math of fixed income (e.g. interest rate sensitivity changing over time due to higher order effects like convexity) as well as changes in the underlying portfolio composition.  If we are not going to specifically measure and hedge out these unintended bets, we will likely want to rely on faster signals such that the bet our portfolio was attempting to capture is no longer reflected by the holdings.

We will begin by first evaluating the stability of our momentum signals.  We do this by varying formation period (i.e. lookback) and holding period of our momentum rotation strategy and calculating the corresponding t-statistic of the equity curve’s returns.  We plot the t-statistics below and specifically highlight those regions were t-statistics exceed 2, a common threshold for significance.

Source: Tiingo.  Calculations by Newfound Research.

It should be noted that data for this study only goes back to 1990, so achieving statistical significance is more difficult as the sample size is significantly reduced. Nevertheless, unlike trend equity which tends to exhibit strong significance across formation periods ranging 6-to-18 months, we see a much more limited region with tactical credit. Only formation periods from 3-to-5 months appear significant, and only with holding periods where the total period (formation plus holding period) is less than 6-months.

Note that our original choice of 63-day (approximately 3 months) formation and 21-day (approximately 1 month) hold falls within this region.

We can also see that very short formation and holding period combinations (e.g. less than one month) also appear significant.  This may be due to the design of our test.  To achieve the longest history for this study, we employed mutual funds.  However, mutual funds holding less liquid underlying securities tend to exhibit positive autocorrelation. While we adjusted realized volatility levels for this autocorrelation effect in an effort to create more realistic t-statistics, it is likely that positive results in this hyper short-term region emerge from this effect.

Finally, we can see another rather robust region representing the same formation period of 3-to-5 months, but a much longer holding length of 10-to-12 months.  For the remainder of this commentary, we’ll ignore this region, though it warrants further study.

Assuming formation and holding periods going to a daily granularity, the left-most region represents over 1,800 possible strategy combinations.  Without any particular reason for choosing one over another, we will embrace an ensemble approach, calculating the target weights for all possible combinations and averaging them together in a virtual portfolio-of-portfolios configuration.

Below we plot the long/short allocations as well as the equity curve for the ensemble long/short tactical credit strategy.

Source: Tiingo.  Calculations by Newfound Research.   Tactical Credit strategy returns are hypothetical and backtested.  Returns gross of all management fees and taxes, but net of underlying fund fees.  Returns assume the reinvestment of all distributions.

Note that each leg of the long/short portfolio does not necessarily equal 100% notional.  This reflects conflicting signals in the underlying portfolios, causing the ensemble strategy to reduce its gross allocation as a reflection of uncertainty.

As a quick aside, we do want to highlight how the performance of the ensemble compares to the performance of the underlying strategies.

Below we plot the annualized return, annualized volatility, maximum drawdown, and information ratio of all the underlying equity curves of the strategies that make up the ensemble.  We also identify the –ensemble approach–.  While we can see that the ensemble approach brings the annualized return in-line with the median annualized return, its annualized volatility is in the 14thpercentile and its maximum drawdown is in the 8thpercentile.

Source: Tiingo.  Calculations by Newfound Research.   Tactical Credit strategy returns are hypothetical and backtested.  Returns gross of all management fees and taxes, but net of underlying fund fees.  Returns assume the reinvestment of all distributions.

By maintaining the median annualized return and significantly reducing annualized volatility, the ensemble has an information ratio in the 78thpercentile.  As we’ve demonstrated in prior commentaries, by diversifying idiosyncratic specification risk, the ensemble approach is able to generate an information ratio significantly higher than the median without having to explicitly choose which specification we believe will necessarily outperform.

Given this ensemble implementation, we can now ask, “what is the driving force of strategy returns?”  In other words, does the strategy create returns by harvesting price return differences or through carry (yield) differences?

One simple way of evaluating this question is by evaluating the strategy’s sensitivity to changes in credit spreads.  Specifically, we can calculate daily changes in the ICE BofAML US High Yield Master II Option-Adjusted Spread and multiply it against the strategy’s exposure to high yield bonds on the prior day.

By accumulating these weighted changes over time, we can determine how much spread change the strategy has captured.  We can break this down further by isolating positive and negative change days and trying to figure out whether the strategy has benefited from avoiding spread expansion or from harvesting spread contraction.

In the graph below, we can see that the strategy harvested approximately 35,000 basis points (“bps”) from 12/1996 to present (the period for which credit spread data was available). Point-to-point, credit spreads actually widened by 100bps over the period, indicating that tactical changes were able to harvest significant changes in spreads.

Source: St. Louis Federal Reserve.  Calculations by Newfound Research.  

We can see that over the full period, the strategy predominately benefited from harvesting contracting spreads, as exposure to expanding spreads had a cumulative net zero impact.  This analysis is incredibly regime dependent, however, and we can see that periods like 2000-2003 and 2008 saw a large benefit from short-exposure in high yield during a period when spreads were expanding.

We can even see that in the case of post-2008, switching to long high yield exposure allowed the strategy to benefit from subsequent credit spread declines.

While this analysis provides some indication that the strategy benefits from harvesting credit spread changes, we can dig deeper by taking a regime-dependent view of performance. Specifically, we can look at strategy returns conditional upon whether spreads are above or below their long-term median, as well as whether they expand or contract in a given month.

Source: St. Louis Federal Reserve.  Calculations by Newfound Research.  Tactical Credit strategy returns are hypothetical and backtested.  Returns gross of all management fees and taxes, but net of underlying fund fees.  Returns assume the reinvestment of all distributions.

Most of the strategy return appears to occur during times when spreads are above their long-term median. Calculating regime-conditional annualized returns confirms this view.

Above

Below

Expanding

10.88%

-2.79%

Contracting

1.59%

4.22%

 

The strategy appears to perform best during periods when credit spreads are expanding above their long-term median level (e.g. crisis periods like 2008).  The strategy appears to do its worst when spreads are below their median and begin to expand, likely representing periods when the strategy is generally long high yield but has not had a chance to make a tactical switch.

This all points to the fact that the strategy harvests almost all of its returns in crisis periods.  In fact, if we remove 2000-2003 and 2008-2009, we can see that the captured credit spread declines dramatically.

Source: St. Louis Federal Reserve.  Calculations by Newfound Research.  

Capturing price returns due to changes in credit spreads are not responsible for all of the strategy’s returns, however.

Below we explicitly calculate the yield generated by the long/short strategy over time.  As high yield corporates tend to offer higher yields, when the strategy is net long high yield, the strategy’s yield is positive.  On the other hand, when the strategy is net short high yield, the strategy’s yield is negative.

This is consistent with our initial view about why these sorts of tactical strategies can be so difficult.  During the latter stages of the 2008 crisis, the long/short strategy had a net negative yield of close to -0.5% per month.2   Thus, the cost of carrying this tactical position is rather expensive and places a larger burden on the strategy accurately timing price return.

Source: Tiingo.  Calculations by Newfound Research.  Tactical Credit strategy returns are hypothetical and backtested.  Returns gross of all management fees and taxes, but net of underlying fund fees.

From this graph, we believe there are two interesting things worth calling out:

  • The long-run average yield is positive, representing the strategy’s ability to capture carry differences between high yield and core bonds.
  • In the post-crisis environments, the strategy generates yields in excess of one standard deviation of the full-period sample, indicating that the strategy may have benefited from allocating to high yield when yields were abnormally large.

To better determine whether capturing changes in credit spreads or carry differences had a larger impact on strategy returns, we can explicitly calculate the –price– and –total return– indices of the ensemble strategy.

Source: St. Louis Federal Reserve.  Calculations by Newfound Research.  Tactical Credit strategy returns are hypothetical and backtested.  Returns gross of all management fees and taxes, but net of underlying fund fees.  Total return series assumes the reinvestment of all distributions.

The –price return– and –total return– series return 2.1% and 2.9% annualized respectively, implying that capturing price return effects account for approximately 75% of the strategy’s total return.

This is potentially concerning, because we have seen that the majority of the price return comes from a single regime: when credit spreads are above their long-term median and expanding.  As we further saw, simply removing the 2000-2003 and 2008-2009 periods significantly reduced the strategy’s ability to harvest these credit spread changes.

While the strategy may appear to be supported by nearly 30-years of empirical evidence, in reality we have a situation where the vast majority of the strategy’s returns were generated in just two regimes.

If we remove 2000-2003 and 2008-2009 from the return series, however, we can see that the total return of the strategy only falls to 0.7% and. 1.6% annualized for –price return– and –total return– respectively.  While this may appear to be a precipitous decline, it indicates that there may be potential to capture both changes in credit spread and net carry differences even in normal market environments so long as implementation costs are kept low enough.

Source: Tiingo.  Calculations by Newfound Research.  Tactical Credit strategy returns are hypothetical and backtested.  Returns gross of all management fees and taxes, but net of underlying fund fees.  Total return series assumes the reinvestment of all distributions.

Conclusion

In this commentary, we explored a tactical credit strategy that switched between high yield corporate bonds and core fixed income.  We decompose these strategies into a 50% high yield / 50% core fixed income portfolio that is overlaid with 50% exposure to a dollar-neutral long/short strategy that captures the tactical tilts.  We focus our exploration on the dollar-neutral long/short portfolio, as it isolates the active bets of the strategy.

Using cross-sectional momentum, we found that short-term signals with formation periods ranging from 3-to-5 months were statistically significant, so long as the holding period was sufficiently short.

We used this information to construct an ensemble strategy made out of more than 1,800 underlying strategy specifications.  Consistent with past research, we found that the ensemble closely tracked the median annualized return of the underlying strategies, but had significantly lower volatility and maximum drawdown, leading to a higher information ratio.

We then attempted to deconstruct where the strategy generated its returns from.  We found that a significant proportion of total returns were achieved during periods when credit spreads were above their long-term median and expanding.  This is consistent with periods of economic volatility such as 2000-2003 and 2008-2009.

The strategy also benefited from harvesting net carry differences between high yield and core fixed income.  Explicitly calculating strategy price and total return, we find that this carry component accounts for approximately 25% of strategy returns.

The impact of the 2000-2003 and 2008-2009 periods on strategy returns should not be understated.   Removing these time periods reduced strategy returns from 2.9% to 1.6% annualized. Interestingly, however, the proportion of total return explained by net carry only increased from 25% to 50%, potentially indicating that the strategy was still able to harvest some opportunities in changing credit spreads.

For investors evaluating these types of strategies, cost will be an important component.  While environments like 2008 may lead to opportunities for significant out-performance, without them the strategy may offer anemic returns.  This is especially true when we recall that a long-only implementation only has 50% implicit exposure to the long/short strategy we evaluated in this piece.

Thus, the 2.9% annualized return is really closer to a 1.5% annualized excess return above the 50/50 portfolio.  For the ex-crisis periods, the number is closer to 0.8% annualized.  When we consider that this analysis was done without explicit consideration for management costs or trading costs and we have yet to apply an appropriate expectation haircut given the fact that this analysis was all backtested, there may not be sufficient juice to squeeze.

That said, we only evaluated a single signal in this piece.  Combining momentum with valuation, carry, or even macro-economic signals may lead to significantly better performance.  Further, high yield corporates is a space where empirical evidence suggests that security selection can make a large difference.  Careful selection of funds may lead to meaningfully better performance than just broad asset class exposure.

 


 

Page 7 of 16

Powered by WordPress & Theme by Anders Norén