The Research Library of Newfound Research

Category: Weekly Commentary Page 1 of 21

Liquidity Cascades: The Coordinated Risk of Uncoordinated Market Participants

This paper is unlike any research we’ve shared in the past. Within we dive into the circumstantial evidence surrounding the “weird” behavior many investors believe markets are exhibiting. We tackle narratives such as the impact of central bank intervention, the growing scale of passive / indexed investing, and asymmetric liquidity provisioning.

Spoiler: Individually, the evidence for these narratives may be nothing more than circumstantial. In conjunction, however, they share pro-cyclical patterns that put pressure upon the same latent risk: liquidity.

In the last part of the paper we discuss some ideas for how investors might try to build portfolios that can both seek to exploit these dynamics as well as remain resilient to them.

Read it now.

Heads I Win, Tails I Hedge

This post is available as a PDF download here.

Summary

  • For hedging strategies, there is often a trade-off between degree, certainty, and cost.
  • Put options have high certainty and typically offer a high degree of protection, making them costly to hold and roll over the long run.
  • In this note, we briefly explore the application of different tactical signals to a 9-month, 25-delta rolling put strategy in an effort to reduce long-term costs.
  • We find that signals based upon volatility appear to perform significantly better than signals based upon price changes, likely due, at least in part, to the nature of the put we are purchasing, which has significant sensitivity to changes in implied volatilities.
  • These results must be taken with a significant grain of salt due to the low number of actual crisis events to measure against. Furthermore, these results are not applicable for investors for whom a certain degree of loss would be disruptive to their financial plan or needs (e.g. impacting withdrawal / spending plans or forcing the liquidation of assets).  For other investors, however, the tactical application of put options may represent an interesting pay-off profile.

In managing risk, there are three primary trade-offs to consider: degree, cost, and certainty.

Degree measures how much protection we are looking too get.  Rather than thinking of degree as how much of our portfolio we’re looking to protect (e.g. 10% vs 100% of our notional exposure), we want to think of this more in terms of the loss level we want the protection to begin at.  For example, degree captures whether we want to protect against all losses or just losses greater than 30%.

Cost captures how much we must pay for our protection.  This cost can be explicit (i.e. we pay a known, up-front premium) or implicit (e.g. whipsaw cost in trend following).

Finally, certainty captures how reliable the hedge is.  A centrally cleared put option, for example, has a very high degree of certainty.  Buying a call option on Treasury bonds (perhaps to benefit from the materialization of a flight-to-safety trade or as a bet on Fed policy during a crisis) carries with it some basis risk if our primary goal is to protect against equity losses.

Like many trade-offs in life, this is one of those “pick two” cases.  You can have a high degree of protection with high reliability, but it will cost you a lot.  If you want to reduce the cost, you’ll need to either reduce the degree of protection or the certainty.

Rather than trying to find the holy grail of high degree, high certainty, and low cost, our time is likely better spent first considering the axis by which we are constrained.  For example, if a 50% loss represents a catastrophic outcome (e.g. impacting withdrawal / spending plans and potentially having knock-on effects in creating forced asset sales), then we can seek to maximize certainty and minimize cost for this specific scenario.  On the other hand, if we cannot afford to spend more than 300 basis points a year on risk management, then we can try to maximize degree and certainty for that budget.

Put options, by definition, have a high degree of certainty, and therefore tend to carry a fairly substantial cost.  For example, below we plot the return of a put option strategy that rolls 9-month, 25-delta puts each month, purchasing enough puts to cover 100% of the S&P 500.

Source: DiscountOptionsData.com.  Calculations by Newfound Research.  Returns are hypothetical and backtested.  Returns are gross of all fees including, but not limited to, management fees, transaction fees, and taxes.  Returns assume the reinvestment of all distributions.

Despite offering meaningful returns during the 2008 economic crisis and the recent March 2020 COVID-19 panic, this strategy has still lost -2.3% annualized.

To be fair, this is a very naïve tail hedging strategy.  There are no considerations for generating offsetting carry (e.g. a put ratio approach), pro-active monetization, trade conversion (e.g. puts to put spreads), reasonable basis risk trades, or exchanging between linear and non-linear trades.

And it may not be wholly fair to evaluate the returns of a tail risk strategy in isolation.  After all, it may help increase the geometric returns of an equity portfolio substantially if appropriately rebalanced.

Nevertheless, this example highlights that if we want to combine a high degree of protection with certainty, it should carry relatively high cost.

In this commentary we will explore a few ideas for dynamically employing put options, attempting to maintain relatively high certainty while minimizing cost.

Tactical Signals

Using tactical signals to identify when to buy put options is akin to waiting to smell smoke before calling your agent to buy fire insurance.  It may save significant cost over the long run, but you risk failing to have protection in periods where you cannot get to the phone fast enough or by the time that you do, the cost of insurance is prohibitive.

Nevertheless, in cases where a tail hedge is not necessary (i.e. true knock-out conditions) but simply preferred, tactical tail hedging may provide an attractive payoff.

Below we explore a variety of signals which may indicate elevated risk going forward.  At the core of our approach will be the 9-month 25-delta put strategy we introduced above.  For each of our signals, when the signal indicates rising risk, we will buy into the put strategy.  Otherwise, we will assume a 0% return cash position.

It should be stressed that this is a rather general approach to what can be a highly specific problem for allocators.  By rolling far-dated puts each month, our strategy will have exhibit substantial convexity to changes in implied volatility, whereas a short-dated put would exhibit greater convexity to changes in the S&P 500 itself.  This means that our approach may not be suitable for protecting against slow, tepid market declines.

Fortunately, market declines and changes in volatility have historically exhibited significant negative correlation.  Therefore, for large and rapid declines, we can generally expect the value of our long-dated, deep out-of-the-money puts to appreciate significantly.

Given that our options will be highly sensitive to changes in implied volatility, we explore signals that are not only potentially related to losses in U.S. equities, but also appreciation of expected volatility.

IndicatorMeasureThesis
S&P 500 Returns63-Day ReturnNegative returns in the S&P 500 may forecast negative returns going forward.
S&P 500 ReturnsZ-Score of 63-Day Return (126-Day)Below average returns in the S&P 500 may forecast negative returns going forward.
S&P 500 Trend30×120 EWMANegative trend signals in the S&P 500 may forecast negative returns going forward.
1M IV63-Day ChangeIncreasing implied volatility may be a sign that investors believe risk is increasing.
1M RV63-Day ChangeIncreasing realized volatility may be a sign that volatility will increase in the future.
1M RV – IV63-Day ChangeIf realized volatility is increasing beyond implied volatility, it may be a sign that protection is underpriced.
1M – 3-Month RV63-Day ChangeIf short-term volatility is higher than medium-term volatility, it may be a sign that risk is increasing.
Skew (1M 25 Delta Put – Call)63-Day ChangeIf the skew of the volatility curve is increasing, it may be a sign that investor demand for protection has gone up.
Short Volatility Strategy63-Day ReturnIf the return of a short volatility strategy is negative, it may be a sign that risk is increasing.
High Yield Credit Spreads63-Day ChangeIf markets are demanding an increasing premium for credit risk, it may be a sign that economic risk is increasing.

 

Why would we expect tactical signals to work?  The core thesis is partially behavioral and partially structural.  On the behavioral side, we expect investors to first under- and then over-react to regime changes in the market.  Ideally tactical signals can cue us into these changes before the herd catches on, and then we can benefit as the herd reprices markets.

From a less irrational perspective, we expect investors to exhibit “knock-out” conditions whereby they become forced sellers.  For example, as prices fall and volatility picks up, collateral requirements may go up.  This can cause forced de-leveraging, further driving down prices and further driving up collateral requirements.  This type of positive feedback loop can create liquidity and credit spirals in markets.  Therefore, by buying protection at the early signs of a potential market dislocation, we can potentially protect ourselves from the non-economically driven behavior of other market participants.

Note that we focus on fairly short measurement periods.  This is for two reasons.  First, risk can reprice rapidly, so we want to make sure.  Secondly, put options allow us to explicitly measure, per day, how much we’ll pay in premium for the non-linear payoff we are purchasing.  This massively asymmetric payoff profile means that we may be able to afford more false positives, unlike trend following where our capital may be meaningfully eroded by whipsaw or jump risk.

Below we plot the returns of applying each signal.  When a signal indicates heightened risk (e.g. increasing volatility or declining prices), we purchase the put strategy index.  We tranche positions over a 20-trading-day period, meaning that if a signal stays constant, we’ll increase our position by 5% a day.  If a signal turns on and then immediately off, we’ll carry at least a 5% position for 20 trading days.

Source: DiscountOptionsData.com; Tiingo.com; St. Louis Federal Reserve.  Calculations by Newfound Research.  Returns are hypothetical and backtested.  Returns are gross of all fees including, but not limited to, management fees, transaction fees, and taxes.  Returns assume the reinvestment of all distributions.

We can see that all of the approaches significantly cut down on the premium paid for protection.  The “worst” performing strategy – the 63-day return z-score – had a loss of -1.0% annualized compared to the -2.3% for the constant put strategy.

Of course, just sitting in cash the entire time would have reduced the cost.  The question we should ask is: how much did we forego in protection?

Below we plot the performance of these approaches over several of the larger market loss scenarios over the last 15 years.

Source: DiscountOptionsData.com; Tiingo.com; St. Louis Federal Reserve.  Calculations by Newfound Research.  Returns are hypothetical and backtested.  Returns are gross of all fees including, but not limited to, management fees, transaction fees, and taxes.  Returns assume the reinvestment of all distributions.

We can see that the volatility-based models (e.g. changes in 1M IV, RV, RV – IV, and Skew) tend to do a fairly consistent job their up-capture, whereas performance-based measures on the S&P 500 (e.g. 63-day returns or 30×120 EWMA) are much less consistent.  This is particularly apparent in the recent COVID-19 crisis, where return-based signals were too delayed.  Interestingly, this lower upside capture was not met with decreased cost: the return-based signals were some of the worst performing models.  Only the high yield credit spread model seemed to offer a balanced trade-off.

Interestingly, signals derived from a short-volatility strategy were negative in 2008.  In this strategy we are short an at-the-money call and put.  Calling this strategy short-volatility may be a bit of a misnomer, as it will profit when realized returns stay range-bound, which is different than explicitly generating a return from declining volatility.  Nevertheless, we can see that the return profile of this approach, plotted below, looks very much like “picking up pennies in front of a steam roller.”  Unfortunately, the steam roller seems to manifest rather quickly, so the 63-day return signal may be too slow in this case.

Source: DiscountOptionsData.com; Tiingo.com; St. Louis Federal Reserve.  Calculations by Newfound Research.  Returns are hypothetical and backtested.  Returns are gross of all fees including, but not limited to, management fees, transaction fees, and taxes.  Returns assume the reinvestment of all distributions.

Conclusion

Given their high certainty and degree of protection offered, put options can be prohibitively expensive (particularly after a significant market decline, when demand for protection often goes up).  For investors for whom a certain level of loss is truly disruptive to operations or creates a knock-out condition, protection is not an option.  For others, though, the selective use of put options may provide an interesting, diversifying payoff profile.

In this commentary, we briefly explored the application of different tactical signals to a far-dated, deep out-of-the-money put strategy.  Not surprisingly, we found that all of the approaches helped reduce the annualized cost of the put strategy.  However, not all of the signals provided meaningful upside capture.  Given that there are few actual periods where the put strategy offers positive returns, missing these gains defeats the whole purpose of the exercise.

We found that volatility-based signals worked best.  This may be due to a combination of two facts: (1) the put strategy has meaningful sensitivity to changes in implied volatility, and (2) the put strategy has an asymmetric payoff profile, reducing the cost of false positives.

These results should taken with a large grain of salt, however, as the number of meaningful payoff periods is very low.  Future research should explore how these signals work when applied to different equity indices, ETFs, or even individual stocks.

Option-Based Trend Following

This post is available as a PDF download here.

Summary

  • The convex payoff profile of trend following strategies naturally lends itself to comparative analysis with option strategies.
  • To isolate the two extremes of paying for whipsaw – either up front or in arrears – we replicate an option strategy that buys 1-month at-the-money calls and puts based on the trend signal.
  • We find that while option premiums steadily eat away at the balance of the options portfolio, the avoidance of large whipsaw events gives the strategy a boost at key times over the past 15 years, especially recently.
  • We examine how this whipsaw cost fits into the historical context of the options strategy and explore some simple ways to shift between the option-based trend following and the standard model.
  • The extent that whipsaw can be mitigated while still maintaining the potential to earn diversified returns is likely limited, but the optimal blend of trend following and options can be a beneficial guideline for investors to weather both sudden and prolonged drawdowns.

The non-linear payoff of trend following strategies has many similarities to options strategies, and by way of analogy, we can often gain insight into which market environments will favor trend following and why.

In our previous research piece, Straddles and Trend Following, we looked at purchasing straddles – that is, a call option and a put option – with a strike price tied to the anchor price of the trend following model. For example, if the trend following model invested in equities when the return over the past 12 months was positive, for a security that was at $100 12-months ago and is at $120 today, we would purchase a call and a put option with a strike price of $100. In this case, the call would be 20% in-the-money (ITM) and the put would be out-of-the-money (OTM).

In essence, this strategy acted like an insurance policy where the payout was tied to a reversion in the trend signal, and the premium paid when the trend signal was strong was small.

This concept of insurance is an important discussion topic in trend following strategies. The risk we must manage in these types of strategies, either directly through insurance or some other indirect means like diversification, is whipsaw.

In this commentary, we will construct an options strategy that is similar to a trend following strategy. The option strategy will pay a premium up-front to avoid whipsaw. By comparing this strategy to trend following that bears the full risk of whipsaw, we can set a better practical bound for how much investors should expect to pay or earn for bearing this risk.

Methodology and Data

For this analysis, we will use the S&P 500 index for equity returns, the 1-year LIBOR rate as the risk-free rate, and options data on the S&P 500 (SPX options).

To bridge the gap between practice and abstraction, we will utilize a volatility surface calibrated to real option data to price options. We will constrain our SPX options to $5 increments and interpolate total implied variance to get prices for options that were either illiquid or not included in the data set.

For the most part, we will stick to options that expire on the third Friday of each month and will mention when we deviate from that assumption.

The long/short trend equity strategy looks at total returns of equities over 12 months. If this return is positive, the strategy invests in equities for the following month. If the return is negative, the strategy shorts equities for the following month and earns the risk-free rate on the cash. The strategy is rebalanced monthly on the options expiration dates.

For the option-based trend strategy, on each rebalance date, we will purchase a 1-month call if the trend signal is positive or a put if the trend signal is negative. We will purchase all options at-the-money (ATM) and hold them to expiration. The strategy is fully cash-collateralized. Any premium is paid on the options roll date, interest is earned on the remaining account balance, and the option payout is realized on the next roll date.

Why are we now using ATM options when previous research used ITM and OTM options, potentially deeply ITM or OTM?

Here we are looking to isolate the cost of whipsaw in the premium paid for the option while earning a payout that is close to that of the underlying in the event that our trend signal is correct. If we utilized OTM options, then our premium would be lower but we would realize smaller gains if the underlying followed the trend. ITM options would have downside exposure before the protection kicked in.

We are also not using straddles since we do not want to pay extra premium for the chance to profit off a whipsaw. The underlying assumption here is that there is value in the trend following signal. Either strategy is able to capitalize on that (i.e. it’s the control variable); the strategies primarily differ in their treatment of whipsaw costs.

The High Cost of ATM Options

The built-in whipsaw protection in the options does not come cheap. The chart below shows the –L/S trend following strategy–, the –option-based trend strategy–, and the ratio of the two (dotted).Source: DiscountOptionsData.com.  Calculations by Newfound Research.  Returns are hypothetical and backtested.  Returns are gross of all fees including, but not limited to, management fees, transaction fees, and taxes.  Returns assume the reinvestment of all distributions.

During normal market environments and even in prolonged equity-market drawdown periods like 2008, trend following outperformed the option-based strategy. Earning the full return on the underlying equity is generally beneficial.

However, something that is “generally beneficial” can be erased very quickly. In March 2020, the trend following strategy reverted back to the level of the option-based strategy. If you had only looked at cumulative returns over those 15 years, you would not be able to tell much difference between the two.

The following chart highlights these tail effects.

Source: DiscountOptionsData.com.  Calculations by Newfound Research.  Returns are hypothetical and backtested.  Returns are gross of all fees including, but not limited to, management fees, transaction fees, and taxes.  Returns assume the reinvestment of all distributions.

In most months, the option-based strategy forfeits its ~1.5% premium for the ATM option. The 75th percentile cutoff is 2.2% and the 90th percentile cutoff is 2.9%. These premiums have occasionally spiked to 6-7%.

While these premiums are not always forfeited without some offsetting gain, they are always paid relative to the trend following strategy.

A 3% whipsaw event in trend should definitely not be a surprise based on the typical up-front cost of the option strategy.

Source: DiscountOptionsData.com.  Calculations by Newfound Research.

But What About a 30% Whipsaw?

Now that’s a good question.

Up until March 2020, for the 15 years prior, the largest whipsaws relative to the options strategy were 12-13%. This is the epitome of tail risk, and it can be disheartening to think that now that we have seen 30% underperformance, we should probably expect more at some point in the (hopefully very distant) future.

However, a richer sample set can shed some light on this very poor performance.

Let’s relax our assumption that we roll the options and rebalance the trend strategies on the third Friday of the month and instead allow rebalances and rolls on any day in the month. Since we are dealing with one-month options, this is not beyond implementation since there are typically options listed that expire on Monday, Wednesday, and Friday.

The chart below shows all of these option strategies and how large of an effect that roll / rebalance timing luck can have.

Source: DiscountOptionsData.com.  Calculations by Newfound Research.  Returns are hypothetical and backtested.  Returns are gross of all fees including, but not limited to, management fees, transaction fees, and taxes.  Returns assume the reinvestment of all distributions.

With timing luck in both the options strategies and trend following, there can be large effects when the luck cuts opposite ways.

The worst returns between rebalances of trend following relative to each options strategy highlight how bad the realized path in March 2020 truly was.

Source: DiscountOptionsData.com.  Calculations by Newfound Research.  Returns are hypothetical and backtested.  Returns are gross of all fees including, but not limited to, management fees, transaction fees, and taxes.  Returns assume the reinvestment of all distributions.

In many of the trend following and option strategies pairs, the worst underperformance of trend following over any monthlong period was around 10%.

Returning to the premise that the options strategies are analogous to trend following, we see the same effects of timing luck that we have explored in previous research: effects that make comparing variants of the same strategy or similar strategies more nuanced. Whether an option strategy is used for research, benchmarking, or active investing, the implications of this timing luck should be taken into account.

But even without taking a multi-model approach at this point to the options strategy, can we move toward a deeper understanding of when it may be an effective way to offset some of the risk of whipsaw?

I’d Gladly Pay You Tuesday for a Whipsaw Risk Today

With the two extremes of paying for whipsaw up front with options and being fully exposed to whipsaw through trend following, perhaps there is a way to tailor this whipsaw risk profile. If the risk of whipsaw is elevated but the cost of paying for the insurance is cheap, then the options strategy may be favorable. On the other hand, if option premiums are high, trend following may more efficiently capture the market returns.

The price of the options (or their implied volatilities) is a natural place to start investigating this topic since it encapsulates the premium for whipsaw insurance. The problem is that it may not be a reliable signal if there is no barrier to efficiency in the options market, either behavioral or structural.

Comparing the ATM option implied volatilities with the trend signal (12-month trailing returns), we see a negative correlation, which indicates that the options-based strategy will have a higher hurdle rate of return in strongly downtrending market environments.

Source: DiscountOptionsData.com.  Calculations by Newfound Research. 

But this is only one piece of the puzzle.

Do these implied volatilities relate to the forward 1-month returns for the S&P 500?

Based on the above scatterplot: not really. However, since we are merely sticking implied volatility in the middle of the trend following signal and the forward return, and we believe that trend following works over the long run, then we must believe there is some relationship between implied volatility and forward returns.

While this monthly trend following signal is directionally correct over the next month 60% of the time, historically, that says nothing about the magnitude of the returns based on the signal.

Without looking too much into the data to avoid overfitting a model, we will set a simple cutoff of 20% implied volatility. If options cost more than that, we will utilize trend following. If they cost less, we will invest in the options strategy.

We will also compare it to a 50/50 blend of the two.

Source: DiscountOptionsData.com.  Calculations by Newfound Research.  Returns are hypothetical and backtested.  Returns are gross of all fees including, but not limited to, management fees, transaction fees, and taxes.  Returns assume the reinvestment of all distributions.

The switching strategy (gray line) worked well until around 2013 when the option prices were cheap, but the risk of whipsaw was not realized. It did make it through 2015, 2016 and 4Q 2018 better than trend following.

When viewed in a broader context of a portfolio, since these are alternative strategies, it does not take a huge allocation to make a difference. These strategies manage equity risk, so we can pair them with an allocation to the S&P 500 (SPY) and see how the aggregate statistics are affected over the period from 2005 to April 2020.

The chart below plots the efficient frontiers of allocations to 100% SPY at the point of convergence on the right of the graph) to 40% SPY on the left of the graph with the remainder allocated to the risk- management strategy.

The Sharpe ratio is maximized at a 35% allocation to the switching strategy, a 25% allocation to the option-based strategy, and 10% for the trend following strategy.

Source: DiscountOptionsData.com.  Calculations by Newfound Research.  Returns are hypothetical and backtested.  Returns are gross of all fees including, but not limited to, management fees, transaction fees, and taxes.  Returns assume the reinvestment of all distributions.

Conclusion

In this research note, we explored the link between trend following and options strategies using 1-month ATM put and call options, depending on the sign of the trend.

The cost of ATM options Is generally 1.5% of the portfolio value, but the fact that this cost can spike upwards of 9% should justify larger whipsaws in trend following strategies. Very large whipsaws, like in March 2020, not only show that the cost can be seemingly unbounded but also that there is significant exposure to timing luck based upon the option roll dates.

Then, we moved on to investigating a simple way to allocate between the two strategies based upon the cost of the options, When the options were cheap, we used that strategy, and when they were expensive, we invested in the trend following strategy. A modest allocation is enough to make a different in the realized efficient frontier.

Deciding to pay the up-front payment of the whipsaw insurance premium, bear the full risk a whipsaw, or land somewhere in between is largely up to investor preferences. It is risky to have a large downside potential, but the added benefit of no premiums can be enough to offset the risk.

An implied volatility threshold was a rather crude signal for assessing the risk of whipsaw and the price of insuring against it. Further research into one or multiple signals and a robust process for aggregating them into an investment decision is needed to make more definitive statements on when trend following is better than options or vice versa. The extent that whipsaw can be mitigated while still maintaining the potential to earn diversified returns is likely limited, but the optimal blend of trend following and options can be a beneficial guideline for investors to weather both sudden and prolonged drawdowns.

Tail Hedging

This post is available as a PDF download here.

Summary

  • The March 2020 equity market sell-off has caused many investors to re-investigate the potential benefits of tail risk hedging programs.
  • Academic support for these programs is quite limited, and many research papers conclude that the cost of implementation for naïve put strategies out-weighs the potential payoff benefits.
  • However, many of these studies only consider strategies that hold options to expiration. This means that investors can only profit from damage assessed.  By rolling put options prior to expiration, investors can profit from damage
  • In this research note we demonstrate that holding to expiration is not a required feature of a successful tail hedging program.
  • Furthermore, we demonstrate that once that requirement is lifted, the most valuable component of a tail risk hedging program may not actually be the direct link to damage assessed, but rather the ability to profit in a convex manner from the market’s re-pricing of risk.

“To hedge, or not to hedge, that is the question.”

Nothing brings tail risk management back to the forefront of investors’ minds like a market crisis.  Despite the broad interest, the jury is still out as to the effectiveness of these approaches.

Yet if an investor is subject to a knock-out barrier – i.e. a point of loss that creates permanent impairment – then insuring against that loss is critical.  This is often the case for retirees or university endowments, as withdrawal rates increase non-linearly with portfolio drawdowns.  In this case, the question is not whether to hedge, but rather about the most cost-effective means of hedging.

Some academics and practitioners have argued that put-based portfolio protection is prohibitively expensive, failing to keep pace with a simple beta-equivalent equity portfolio.  They also highlight that naïve put strategies – such as holding 10% out-of-the-money (“OTM”) puts to expiration – are inherently path dependent.

Yet empirical evidence may fail us entirely in this debate.  After all, if the true probability and magnitude of tail events is unknowable (as markets have fat tails whose actual distribution is hidden from us), then prior empirical evidence may not adequately inform us about latent risks.  After all, by their nature, tail events are rare.  Therefore, drawing any informed conclusions from tail event data will be shrouded in a large degree of statistical uncertainty.

Let us start by saying that the goal of this research note is not to prove whether tail risk hedging is or is not cost effective.  Rather, our goal is to demonstrate some of the complexities and nuances that make the conversation difficult.

And this piece will only scratch the surface.  We’ll be focusing specifically on buying put options on the S&P 500.  We will not discuss pro-active monetization strategies (i.e. conversion of our hedge into cash), trade conversion (e.g. converting puts into put spreads), basis risk trades (e.g. buying calls on U.S. Treasuries instead of puts on equities), or exchanging non-linear for linear hedges (e.g. puts for short equity futures).

Given that we are ignoring all these components – all of which are important considerations in any actively managed tail hedging strategy – it does call into question the completeness of this note.  While we hope to tackle these topics in later pieces, we highlight their absence specifically to point out that tail risk hedging is a highly nuanced topic.

So, what do we hope to achieve?

We aim to demonstrate that the path dependency risk of tail hedging strategies may be overstated and that the true value of deep tail hedges emerges not from the actual insurance of loss but the rapid repricing of risk.

A Quantitative Aside

Options data is notoriously dirty, and therefore the results of back testing options strategies can be highly suspect.  In this note, rather than price our returns based upon historical options data (which may be stale or have prohibitively wide bid/ask spreads), we fit a volatility surface to that data and price our options based upon that surface.

Specifically, each trading day we fit a quadratic curve to log-moneyness and implied total variance for each quoted maturity.  This not only allows us to reduce the impact of dirty data, but it allows us to price any strike and maturity combination.

While we limit ourselves only to using listed maturity dates, we do stray from listed strikes.  For example, in quoting a 10% out-of-the-money put, rather than using the listed put option that would be closest to that strike, we just assume the option for that strike exists.

This approach means, definitively, that results herein were not actually achievable by any investor.  However, since we will be making comparisons across different option strategy implementations, we do not believe this is a meaningful impact to our results.

To reduce the impacts of rebalance timing luck, all strategies are implemented with overlapping portfolios.  For example, for a strategy that buys 3-month put options and holds them to maturity would be implemented with three overlapping sub-portfolios that each roll on discrete 3-month periods but do so on different months.

Finally, the indices depicted herein are designed such that they match notional coverage of the S&P 500 (e.g. 1 put per share of S&P 500) when implemented as a 100% notional overlay and rebalanced monthly upon option expiration.

The Path Dependency of Holding to Expiration

One of the arguments often made against tail hedging is the large degree of path dependency the strategy can exhibit.  For example, consider an investor who buys 10% OTM put options each quarter.  If the market falls less than 10% each quarter, the options will provide no protection.  Therefore, when holding to expiration, we need drawdowns to precisely coincide with our holding period to achieve maximum protection.

But is there something inherently special about holding to expiration?  For popular indices and ETFs, there are liquid options markets available, allowing us to buy and sell at any time.  What occurs if we roll our options a month or two before expiration?

Below we plot the results of doing precisely this.  In the first strategy, we purchase 10% OTM puts and hold them to expiration.  In the second strategy, we purchase the same 10% OTM puts, but roll them a month before expiration.

Source: DiscountOptionsData.com.  Calculations by Newfound Research.  Returns are hypothetical and backtested.  Returns are gross of all fees including, but not limited to, management fees, transaction fees, and taxes.  Returns assume the reinvestment of all distributions.

We see nearly identical long-term returns and, more importantly, the returns during the 2008 crisis and the recent March turmoil are indistinguishable.  And we outright skipped holding each option for 1/3rd of its life!

Our results seem to suggest that the strategies are less path dependent than originally argued.

An alternative explanation, however, may be that during these crises our options end up being so deep in the money that it does not matter whether we roll them early or not.  One way to evaluate this hypothesis is to look at the rolling delta profile – how sensitive our option strategy is to changes in the underlying index – over time.

Source: DiscountOptionsData.com.  Calculations by Newfound Research. 

We can see is that during calm market environments, the two strategies exhibit nearly identical delta profiles.  However, in 2008, August 2011, Q4 2018, and March 2020 the delta of the strategy that holds to expiration is substantially more negative.  For example, in October 2008, the strategy that holds to expiration had a delta of -2.75 whereas the strategy that rolls had a delta of -1.77.  This means that for each 1% the S&P 500 declines, we estimate that the strategies would gain +2.75% and +1.77% respectively (ignoring other sensitivities for the moment).

Yet, despite this added sensitivity, the strategy that holds to expiration does not seem to offer meaningfully improved returns during these crisis periods.

Source: DiscountOptionsData.com.  Calculations by Newfound Research. 

Part of the answer to this conundrum is theta, which measures the rate at which options lose their value over time.  We can see that during these crises the theta of the strategy that holds to expiration spikes significantly, as with little time left the value of the option will be rapidly pulled towards the final payoff and variables like volatility will no longer have any impact.

What is clear is that delta is only part of the equation.  In fact, for tail hedges, it may not even be the most important piece.

Convexity in Volatility

To provide a bit more insight, we can try to contrive an example whereby we know that ending in the money should not have been a primary driver of returns.

Specifically, we will construct two strategies that buy 3-month put options and roll each month.  In the first strategy, the put option will just be 10% OTM and in the second strategy it will be 30% OTM.  As we expect the option in the second strategy to be significantly cheaper, we set an explicit budget of 60 basis points of our capital each month.1

Below we plot the results of these strategies.

Source: DiscountOptionsData.com.  Calculations by Newfound Research.  Returns are hypothetical and backtested.  Returns are gross of all fees including, but not limited to, management fees, transaction fees, and taxes.  Returns assume the reinvestment of all distributions.

In March 2020, the 10% OTM put strategy returned 13.4% in and the 30% OTM put strategy returned 39.3%.  From prior trough (February 19th) to peak (March 23rd), the strategies returned 18.4% and 46.5% respectively.

This is a stark difference considering that the 10% OTM put was definitively in-the-money as of March 20th (when it was rolled) and the 30% OTM strategy was on the cusp.  Consider the actual trades placed:

  • 10% OTM Strategy: Buy a 3-month 10% OTM put on February 21st and sell a 2-month 23.3% ITM put on March 20th. When bought, the option had an implied volatility of 20.9% and a price of $45.452; when sold it had an implied volatility of 39.5% and a price of $1428.21 for a 3042% return.
  • 30% OTM Strategy: Buy a 3-month 30% OTM put on February 21st and sell a 2-month 1.4% ITM put on March 20th. When bought, the option had an implied volatility of 35.0% and a price of $5.42; when sold it had an implied volatility of 53.8% and a price of $425.85 for a 7757% return.

It is also worth noting that since we are spending a fixed budget, we can buy 8.38 contracts of the 30% OTM put for every contract of the 10% OTM put.

So why did the 30% OTM put appreciate so much more?  Below we plot the position scaled sensitivities (i.e. dividing by the cost per contract) to changes in the S&P 500 (“delta”), changes in implied volatility (“vega”), and their respective derivatives (“gamma” and “volga”).

Source: DiscountOptionsData.com.  Calculations by Newfound Research. 

We can see that as of February 21st, the sensitivities are nearly identical for delta, gamma, and vega.  But note the difference in volga.

What is volga?  Volga tells us how much the option’s sensitivity to implied volatility (“vega”) changes as implied volatility itself changes.  If we think of vega as a kind of velocity, volga would be acceleration.

A positive vega tells us that the option will gain value as implied volatility goes up.  A positive volga tells us that the option will gain value at an accelerating rate as implied volatility goes up.  Ultimately, this means the price of the option is convex with respect to changes in implied volatility.

So as implied volatilities climbed during the March turmoil, not only did the option gain value due to its positive vega, but it did so at an accelerating rate thanks to its positive volga.

Arguably this is one of the key features we are buying when we buy a deep OTM put.3  We do not need the option to end in the money to provide a meaningful tail hedge; rather, the value is derived from large moves in implied volatility as the market re-prices risk.

Indeed, if we perform the same analysis for September and October 2008, we see an almost identical situation.

Source: DiscountOptionsData.com.  Calculations by Newfound Research. 

Conclusion

In this research note, we aimed to address one of the critiques against tail risk hedging: namely that it is highly path dependent.  For naively implemented strategies that hold options to expiration, this may be the case.  However, we have demonstrated in this piece that holding to expiration is not a necessary condition of a tail hedging program.

In a contrived example, we explore the return profile of a strategy that rolls 10% OTM put options and a strategy that rolls 30% OTM put options.  We find that the latter offered significantly better returns in March 2020 despite the fact the options sold were barely in the money.

We argue that the primary driver of value in the 30% OTM put is the price convexity it offers with respect to implied volatility.  While the 10% OTM put has positive sensitivity to changes in implied volatility, that sensitivity does not change meaningfully as implied volatility changes.  On the other hand, the 30% OTM put has both positive vega and volga, which means that vega will increase with implied volatility.  This convexity makes the option particularly sensitive to large re-pricings of market risk.

It is common to think of put options as insurance contracts.  However, with insurance contracts we receive a payout based upon damage assessed.  The key difference with options is that we have the ability to monetize them based upon potential damage perceived.  When we remove the expectation of holding options into expiration (and therefore only monetizing damage assessed), we potentially unlock the ability to profit from more than just changes in underlying price.

 


 

Defensive Equity with Machine Learning

This post is available as a PDF download here.

Summary

  • Defensive equity strategies are comprised of stocks that lose less than the market during bear markets while keeping up with the market during a bull market.
  • Coarse sorts on metrics such as volatility, beta, value, and momentum lead to diversified portfolios but have mixed results in terms of their defensive characteristics, especially through different crisis periods that may favor one metric over another.
  • Using non-linear machine learning techniques is a desirable way to identify certain combinations of factors that lead to better defensive equity strategies over multiple periods.
  • By applying techniques such as random forests and gradient boosting to two sample defensive equity metrics, we find that machine learning does not add significant value over a low volatility sort, given the features included in the model.
  • While this by no means rules out the benefits of machine learning techniques, it shows how a blanket application of it is not a panacea for investing during crisis periods.

There is no shortage of hypotheses as to what characteristics define a stock that will outperform in a bear market.  Some argue that value stocks should perform well, given their relative valuation buffer (the “less far to fall” argument).  Some argue for a focus on balance sheet strength while others argue that cash-flow is the ultimate life blood of a company and should be prioritized.  There are even arguments for industry preferences based upon economic cyclicality.

Each recession and crisis is unique, however, and therefore the characteristics of stocks that fare best will likely change.  For example, the dot-com run-up caused a large number of real-economy businesses to be sorted into the “cheap” bucket of the value factor.  These companies also tended to have higher quality earnings and lower beta / volatility than the dot-com stocks.

Common sense would indicate that unconstrained value may be a natural counter-hedge towards large, speculative bubbles, but we need only look towards 2008 – a credit and liquidity event – to see that value is not a panacea for every type of crisis.

It is for this reason that some investors prefer to take their cues from market-informed metrics such as beta, volatility, momentum, or trading volume.

Regardless of approach, there are some philosophical limitations we should consider when it comes to expectations with defensive equity portfolios.  First, if we were able to identify an approach that could avoid market losses, then we would expect that strategy to also have negative alpha.1 If this were not the case, we could construct an arbitrage.

Therefore, in designing a defensive equity portfolio, our aim should be to provide ample downside protection against market losses while minimizing the relative upside participation cost of doing so.

Traditional linear sorts – such as buying the lowest volatility stocks – are coarse by design.  They aim to robustly capture a general truth and hedge missed subtleties through diversification.  For example, while some stocks deserve to be cheap and some stocks are expensive for good reason, naïve value sorts will do little to distinguish them from those that are unjustifiably cheap or rich.

For a defensive equity portfolio, however, this coarseness may not only reduce effectiveness, but it may also increase the implicit cost.  Therefore, in this note we implement non-linear techniques in an effort to more precisely identify combinations of characteristics that may create a more effective defensive equity strategy.

The Strategy Objective

To start, we must begin by defining precisely what we mean by a “defensive equity strategy.”  What are the characteristics that would make us label one security as defensive and another as not?  Or, potentially better, is there a characteristic that allows us to rank securities on a gradient of defensiveness?

This is not a trivial decision, as our entire exercise will attempt to maximize the probability of correctly identifying securities with this characteristic.

As our goal is to find those securities which provide the most protection during equity market routs but bleed the least during equity market rallies, we chose a metric that scored how closely a stock’s return reflected the payoff of a call option on the S&P 500 over the next 63 trading days (approximately 3 months).

In other words, if the S&P 500 is positive over the next 63 trading days, the score of a security is equal to the squared difference between its return and the S&P 500’s return.  If the market’s return is negative, the score of a security is simply its squared return.

To determine whether this metric reflects the type of profile we want, we can create a long/short portfolio.  Each month we rank securities by their scores and select the quintile with the lowest scores.  Securities are then weighted by their market capitalization.  Securities are held for three months and the portfolio is implemented with three tranches.  The short leg of the portfolio is the market rather than the highest quintile, as we are explicitly trying to identify defense against the market.

To create a scalable solution, we restrict our investable universe to those in the top 1,000 securities by market capitalization.

We plot the performance below.

Source: Sharadar Fundamentals.  Calculations by Newfound Research.  Returns are hypothetical and backtested.  Returns are gross of all fees including, but not limited to, management fees, transaction fees, and taxes.  Returns assume the reinvestment of all distributions.

We can see that the strategy is relatively flat during bull markets (1998-2000, 2003-2007, 2011-2015, 2016-2018), but rallies during bear markets and sudden market shocks (2000-2003, 2008, 2011, 2015/2016, Q4 2018, and 2020).

Interestingly, despite having no sector constraints and not explicitly targeting tracking error at the portfolio level, the resulting portfolio ends up well diversified across sectors, though it does appear to make significant short-term jumps in sector weights.  We can also see an increasing tilt towards Technology over the last 3 years in the portfolio.  In recent months, positions in Financials and Industrials have been almost outright eliminated.

Source: Sharadar Fundamentals.  Calculations by Newfound Research. 

Of course, this metric is explicitly forward looking.  We’re using a crystal ball to peer into the future and identify those stocks that track the best on the way up and protect the best on the way down.  Our goal, then, is to use a variety of company and security characteristics to accurately forecast this score.

We will include a variety of characteristics and features, including:

  • Size: Market Capitalization.
  • Valuation: Book-to-Price, Earnings-to-Price, Free Cash Flow-to-Price, Revenue-to-EV, and EBITDA-to-EV.
  • Momentum: 12-1 Month Return and 1-Month Return.
  • Risk: Beta, Volatility, Idiosyncratic Volatility, and Ulcer Index.
  • Quality: Accruals, ROA, ROE, CFOA, GPOA, Net Margin, Asset Turnover, Leverage, and Payout Ratio.
  • Growth: Internal Growth Rate, EPS Growth, Revenue Growth.

These 24 features are all cross-sectionally ranked at each point in time.  We also include dummy variables for each security to represent sector inclusion as well as whether the company has positive Net Income and whether the company has positive Operating Cash Flow.

Note that we are not including any market regime characteristics, such information about market returns, volatility, interest rates, credit spreads, sentiment, or monetary or fiscal policy.  Had we included such features, our resulting model may end up as a factor switching approach, changing which characteristics it selects based upon the market environment.  This may be an interesting model in its own right, but our goal herein is simply to design a static, non-linear factor sort.

Random Forests

Our first approach will be to apply a random forest algorithm, which is an ensemble learning method.  The approach uses a training data set to build a number of individual decision trees whose results are then re-combined to create the ultimate decision.  By training each tree on a subset of data and considering only a subset of features for each node, we can create trees that may individually have high variance, but as an aggregate forest reduce variance without necessarily increasing bias.

As an example, this means that one tree may be built using a mixture of low volatility and quality features, while another may be built using valuation and momentum features.  Each tree is able to model a non-linear relationship, but by restricting tree depth and building trees using random subsets of data and features, we can prevent overfitting.

There are a number of hyperparameters that can be set to govern the model fit.  For example, we can set the maximum depth of the individual trees as well as the number of trees we want to fit.  Fitting hyperparameters is an art unto itself, and rather than go down the rabbit hole of tuning hyperparameters via cross-validation, we did our best to select reasonable hyper parameters.  We elected to train the model on 50% of our data (March 1998 to March 2009), with a total of 100 trees each with a maximum depth of 2.

The results of the exercise are plotted below.

Source: Sharadar Fundamentals.  Calculations by Newfound Research. 

The performance does appear to provide defensive properties both in- and out-of-sample, with meaningful returns generated in 2000-2002, 2008, Q3 and Q4 of 2011, June 2015 through June 2016, and Q4 2008.

We can see that the allocations also express a number of static sector concentrations (e.g. Consumer Defensive) as well as some cyclical changes (e.g. Finances pre- and post-2007).

We can also gain insight into how the portfolio composition changes by looking at the weighted characteristic scores of the long leg of the portfolio over time.

Source: Sharadar Fundamentals.  Calculations by Newfound Research. 

It is important to remember that characteristics are cross-sectionally ranked across stocks.  For some characteristics, higher is often considered better (e.g. a higher earnings-to-price cheaper is considered cheaper), whereas for other factors lower is better (e.g. lower volatility is considered to have less risk).

We can see that some characteristics are static tilts: higher market capitalization, positive operating cash flow, positive net income, and lower risk characteristics.  Other characteristics are more dynamic.  By 12/2008, the portfolio has tilted heavily towards high momentum stocks.  A year later, the portfolio has tilted heavily towards low momentum stocks.

What is somewhat difficult to disentangle is whether these static and dynamic effects are due to the non-linear model we have developed, or whether it’s simply that applying static tilts results in the dynamic tilts.  For example, if we only applied a low volatility tilt, is it possible that the momentum tilts would emerge naturally?

Unfortunately, the answer appears to be the latter.  If we plot a long/short portfolio that goes long the bottom quintile of stocks ranked on realized 1-year volatility and short the broad market, we see a very familiar equity curve.

Source: Sharadar Fundamentals.  Calculations by Newfound Research.  Returns are hypothetical and backtested.  Returns are gross of all fees including, but not limited to, management fees, transaction fees, and taxes.  Returns assume the reinvestment of all distributions.

It would appear that the random forest model effectively identified the benefits of low volatility securities.  And while out-of-sample performance does appear to provide more ample defense during 2011, 2015-2016, and 2018 than the low volatility tilt, it also has significantly greater performance drag.

Gradient Boosting

One potential improvement we might consider is to apply a gradient boosting model.  Rather than simply building our decision trees independently in parallel, we can build them sequentially such that each tree is built on a modified version of the original data set (e.g. increasing the weights of those data points that were harder to classify and decreasing the weights on those that were easier).

Rather than just generalize to a low-volatility proxy, gradient boosting may allow our decision tree process to pick up upon greater subtleties and conditional relationships in the data.  For comparison purposes, we’ll assume the same maximum tree depth and number of trees as the random forest method.

In initially evaluating the importance of features, it does appear that low volatility remains a critical factor, but other characteristics – such as momentum, free cash flow yield, and payout ratio – are close seconds.  This may be a hint that gradient boosting was able to identify more subtle relationships.

Unfortunately, in evaluating the sector characteristics over time, we see a very similar pattern.  Though we can notice that sectors like Technology have received a meaningfully higher allocation with this methodology versus the random forest approach.

Source: Sharadar Fundamentals.  Calculations by Newfound Research. 

If we compare long/short portfolios, we find little meaningful difference to our past results.  Our model simply seems to identify a (historically less effective) low volatility model.

Source: Sharadar Fundamentals.  Calculations by Newfound Research.  Returns are hypothetical and backtested.  Returns are gross of all fees including, but not limited to, management fees, transaction fees, and taxes.  Returns assume the reinvestment of all distributions.

Re-Defining Defensiveness

When we set out on this problem, we made a key decision to define a stock’s defensiveness by how closely it is able to replicate the payoff of a call option on the S&P 500.  What if we had elected another definition, though?  For example, we could define defensive stocks as those that minimize the depth and frequency of drawdowns using a measure like the Ulcer Index.

Below we replicate the above tests but use forward 12-month Ulcer Index as our target score (or, more precisely, a security’s forward 12-month cross-sectional Ulcer Index rank).

We again begin by constructing an index that has perfect foresight, buying a market-capitalization weighted portfolio of securities that rank in the lowest quintile of forward 12-month ulcer index.  We see a very different payoff profile than before, with strong performance exhibited in both bull and bear markets.

By focusing on forward 12-month scores rather than 3-month scores, we also see a far steadier sector allocation profile over time.  Interestingly, we still see meaningful sector tilts over time, with sectors like Technology, Financials, and Consumer Defensives coming in and out of favor over time.

Source: Sharadar Fundamentals.  Calculations by Newfound Research.  Returns are hypothetical and backtested.  Returns are gross of all fees including, but not limited to, management fees, transaction fees, and taxes.  Returns assume the reinvestment of all distributions.

We again use a gradient boosted random forest model to try to model our target scores.  We find that five of the top six most important features are price return related, either measuring return or risk.

Despite the increased emphasis on momentum, the resulting long/short index still echoes a naïve low-volatility sort.  This is likely because negative momentum and high volatility have become reasonably correlated proxies for one another in recent years.

While returns appear improved from prior attempts, the out-of-sample performance (March 2009 and onward) is almost identical to that of the low-volatility long/short.

Source: Sharadar Fundamentals.  Calculations by Newfound Research.  Returns are hypothetical and backtested.  Returns are gross of all fees including, but not limited to, management fees, transaction fees, and taxes.  Returns assume the reinvestment of all distributions.

Conclusion

In this research note we sought to apply machine learning techniques to factor portfolio construction.  Our goal was to exploit the ability of machine learning models to model non-linear relationships, hoping to come up with a more nuanced definition of a defensive equity portfolio.

In our first test, we defined a security’s defensiveness by how closely it was able to replicate the payoff of a call option on the S&P 500 over rolling 63-day (approximately 3-month) periods.  If the market was up, we wanted to pick stocks that closely matched the market’s performance; if the market was down, we wanted to pick stocks that minimized drawdown.

After pre-engineering a set of features to capture both company and stock dynamics, we first turned to a random forest model.  We chose this model as the decision tree structure would allow us to model conditional feature dynamics.  By focusing on generating a large number of shallow trees we aimed to avoid overfitting while still reducing overall model variance.

Training the model on data from 1999-2009, we found that the results strongly favored companies exhibiting positive operating cash flow, positive earnings, and low realized risk characteristics (e.g. volatility and beta).  Unfortunately, the model did not appear to provide any meaningful advantage versus a simple linear sort on volatility.

We then turned to applying gradient boosting to our random forest.  This approach builds trees in sequence such that each tree seeks to improve upon the last.  We hoped that such an approach would allow the random forest to build more nuance than simply scoring on realized volatility.

Unfortunately, the results remained largely the same.

Finally, we decided to change our definition of defensiveness by focusing on the depth and frequency of drawdowns with the Ulcer Index.  Again, after re-applying the gradient boosted random forest model, we found little difference in realized results versus a simple sort on volatility (especially out-of-sample).

One answer for these similar results may be that our objective function is highly correlated to volatility measures.  For example, if stocks follow a geometric Brownian motion process, those with higher levels of volatility should have deeper drawdowns.  And if the best predictor of future realized volatility is past realized volatility, then it is no huge surprise that the models ultimately fell back towards a naïve volatility sort.

Interestingly, value, quality, and growth characteristics seemed largely ignored.  We see two potential reasons for this.

The first possibility is that they were simply subsumed by low volatility with respect to our objective.  If this were the case, however, we would see little feature importance placed upon them, but would still expect their weighted average characteristic scores within our portfolios to be higher (or lower).  While this is true for select features (e.g. payout ratio), the importance of others appears largely cyclical (e.g. earnings-to-price).  In fact, during the fall out of the dot-com bubble, weighted average value scores remained between 40 and 70.

The second reason is that the fundamental drivers behind each market sell-off are different.  Factors tied to company metrics (e.g. valuation, quality, or growth), therefore, may be ill-suited to navigate different types of sell offs.  For example, value was the natural antithesis to the speculative dot-com bubble.  However, during the recent COVID-19 crisis, it has been the already richly priced technology stocks that have fared the best.  Factors based upon security characteristics (e.g. volatility, returns, or volume) may be better suited to dynamically adjust to market changes.

While our results were rather lackluster, we should acknowledge that we have really only scratched the surface of machine learning techniques.  Furthermore, our results are intrinsically linked to how we’ve defined our problem and the features we engineered.  A more thoughtful target score or a different set of features may lead to substantially different results.

 


 

Page 1 of 21

Powered by WordPress & Theme by Anders Norén