This post is available as a PDF download here.
Summary
- To generate returns that are different than the market, we must adopt a positioning that is different than the market.
- With the increasing adoption of systematic factor portfolios, we explore whether an anti-factor stance can generate contrarian-based profits.
- Specifically, we explore the idea of factor orphans: stocks that are not included in any factor portfolio at a given time.
- To identify these stocks, we replicate four popular factor indices: the S&P 500 Enhanced Value index, the S&P 500 Momentum index, the S&P 500 Low Volatility index, and the S&P 500 Quality index.
- On average, there are over 200 stocks in the S&P 500 that are orphaned at any given time.
- Generating an equal-weight portfolio of these stocks does not exhibit meaningfully different performance than a naïve equal-weight S&P 500 portfolio.
Contrarian investing is nothing new. Holding a variant perception to the market is often cited as a critical component to generating differentiated performance. The question in the details is, however, “contrarian to what?”
In the last decade, we’ve witnessed a dramatic rise in the popularity of systematically managed active strategies. These so-called “smart beta” portfolios seek to harvest documented risk premia and market anomalies and implement them with ruthless discipline.
But when massively adopted, do these strategies become the commonly-held view and therefore more efficiently priced into the market? Would this mean that the variant perception would actually be buying those securities totally ignored by these strategies?
This is by no means a new idea. Morningstar has long maintained its Unloved strategy that purchases the three equity categories that have witnessed the largest outflows at the end of the year. A few years ago, Vincent Deluard constructed a “DUMB” beta portfolio that included all the stocks shunned by popular factor ETFs. In the short out-of-sample period the performance of the strategy was tested, it largely kept pace with an equal-factor portfolio. More recently, a Bank of America research note claimed that a basket of most-hated securities – as defined by companies neglected by mutual funds and shorted by hedge funds hedge funds – had tripled the S&P 500’s return over the past year.
The approach certainly has an appealing narrative: as the crowd zigs to adopt smart beta, we zag. But has it worked?
To test this concept, we wanted to identify what we call “factor orphans”: those securities not held by any factor portfolio. Once identified, we can build a portfolio holding these stocks and track its performance over time.
As a quant, this idea strikes us as a little crazy. A stock not held in a value, momentum, low volatility, or quality index is likely one that is expensive, highly volatile, with poor fundamentals and declining performance. Precisely the type of stock factor investing would tell us not to own.
But perhaps the fact that these securities are orphaned means that there are no more sellers: the major cross-section of market strategies have already abandoned the stock. Thus, stepping in to buy them may allow us to offload them later when they are picked back up by these systematic approaches.
Perhaps this idea is crazy enough it just might work…
To test this idea, we first sought to replicate four common factor benchmarks: the S&P 500 Enhanced Value index, the S&P 500 Momentum index, the S&P 500 Low Volatility index and the S&P 500 Quality index. Once replicated, we can use the underlying baskets as being representative of the holdings for factor portfolios is general.
Results of our replication efforts are plotted below. We can see that our models fit the shape of most of the indices closely, with very close fits for the Momentum and Low Volatility portfolios.
Source: Sharadar. Calculations by Newfound Research. Past performance is not an indicator of future results. Performance is backtested and hypothetical. Performance figures are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes. Performance assumes the reinvestment of all distributions.
The Quality replication represents the largest deviation from the underlying index, but still approximates the shape of the total return profile rather closely. This gives us confidence that the portfolio we constructed is a quality portfolio (which should come as no surprise, as securities were selected based upon common quality metrics), but the failure to more closely replicate this index may represent a thorn in our ability to identify truly orphaned stocks.
At the end of each month, we identify the set of all securities held by any of the four portfolios. The securities in the S&P 500 (at that point in time) but not in the factor basket are the orphaned stocks. Somewhat surprisingly, we find that approximately 200 names are orphaned at any given time, with the number reaching as high as 300 during periods when underlying factors converge.
Also interesting is that the actual overlap in holdings in the factor portfolios is quite low, rarely exceeding 30%. This is likely due to the rather concentrated nature of the indices selected, which hold only 100 stocks at a given time.
Source: Sharadar. Calculations by Newfound Research.
Once our orphaned stocks are identified, we construct a portfolio that holds them in equal weight. We rebalance our portfolio monthly to sell those stocks that have been acquired by a factor portfolio and roll into those securities that have been abandoned.
We plot the results of our exercise below as well as an equally weighted S&P 500 benchmark.
Source: Sharadar. Calculations by Newfound Research. Past performance is not an indicator of future results. Performance is backtested and hypothetical. Performance figures are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes. Performance assumes the reinvestment of all distributions.
While the total return is modestly less (but certainly not statistically significantly so), what is most striking is how little deviation there is in the orphaned stock portfolio versus the equal-weight benchmark.
However, as we have demonstrated in the past, the construction choices in a portfolio can have a significant impact upon the realized results. As we look at the factor portfolios themselves, we must acknowledge that they represent relative tilts to the benchmark, and that the absence of one security might actually represent a significantly smaller relative underweight to the benchmark than the absence of another. Or the absence of one security may actually represent a smaller relative underweight than another that is actually included.
Therefore, as an alternative test we construct an equal-weight factor portfolio and subtract the S&P 500 market-capitalization weights. The result is the implied over- and under-weights of the combined factor portfolios. We then rank securities to select the 100 most under-weight securities each month and hold them in equal weight.
Source: Sharadar. Calculations by Newfound Research. Past performance is not an indicator of future results. Performance is backtested and hypothetical. Performance figures are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes. Performance assumes the reinvestment of all distributions.
Of course, we didn’t actually have to perform this exercise had we stepped back to think for a moment. We generally know that these (backtested) factors have out-performed the benchmark. Therefore, selecting stocks that they are underweight means we’re taking the opposite side of the factor trade, which we know has not worked.
Which does draw an important distinction between most underweight and orphaned. It would appear that factor orphans do not necessarily create the strong anti-factor tilt the way that the most underweight portfolio does.
For the sake of completion, we can also evaluate the portfolios containing securities held in just one of the factor portfolios, two of the factor portfolios, three of the factor portfolios, or all of the factor portfolios at a given time.
Below we plot the count of securities in such portfolios over time. We can see that it is very uncommon to identify securities that are simultaneously held by all the factors, or even three of the factors, at once.
Source: Sharadar. Calculations by Newfound Research.
Source: Sharadar. Calculations by Newfound Research. Past performance is not an indicator of future results. Performance is backtested and hypothetical. Performance figures are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes. Performance assumes the reinvestment of all distributions.
We can see that the portfolio built from stocks held in just one factor (“In One”) closely mimics the portfolio built from stocks held in no factor (“In Zero”), which in turn mimics the S&P 500 Equal Weight portfolio. This is likely because the portfolios include so many securities that they effectively bring you back to the index.
On the other end of the spectrum, we see the considerable risks of concentration manifest in the portfolios built from stocks held in three or four of the factors. The portfolio comprised of stocks held in all four factors simultaneously (“In Four”) not only goes long stretches of holding nothing at all, but is also subject to large bouts of volatility due to the extreme concentration.
We also see this for the portfolio that holds stocks held by three of the factors simultaneously (“In Three”). While this portfolio has modestly more diversification – and even appears to out-perform the equal-weight benchmark – the concentration risk finally materializes in 2018-2019, causing a dramatic drawdown.
The portfolio holding stocks held in just two of the factors (“In Two”), though, appears to offer some out-performance opportunity. Perhaps by forcing just two factors to agree, we strike a balance between confirmation among signals and portfolio diversification.
Unfortunately, our enthusiasm quickly wanes when we realize that this portfolio closely matches the results achieved just by naively equally-weighting exposure among the four factor portfolios themselves, which is far more easily implemented.
Source: Sharadar. Calculations by Newfound Research. Past performance is not an indicator of future results. Performance is backtested and hypothetical. Performance figures are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes. Performance assumes the reinvestment of all distributions.
Conclusion
To achieve differentiated results, we must take a differentiated stance from the market. As systematic factor portfolios are more broadly adopted, we should consider asking ourselves if taking an anti-factor stance might lead to contrarian-based profits.
In this study, we explore the idea of factor orphans: stocks not held by any factor portfolio at a given time. Our hypothesis is that these orphaned securities may be systematically over-sold, leading to an opportunity for future out-performance if they are re-acquired by the factor portfolios at a later date.
We begin by replicating four factor indices: the S&P 500 Enhanced Value index, the S&P 500 Momentum index, the S&P 500 Low Volatility index, and the S&P 500 Quality index. Replicating these processes allows us to identify historical portfolio holdings, which in turn allows us to identify stocks not held by the factors.
We are able to closely replicate the S&P 500 Momentum and Low Volatility portfolios, create meaningful overlap with the S&P 500 Enhanced Value method, and generally capture the S&P 500 Quality index. The failure to more closely replicate the S&P 500 Quality index may have a meaningful impact on the results herein, though we believe our methodology still captures the generic return of a quality strategy.
We find that, on average, there are over 200 factor orphans at a given time. Constructing an equal-weight portfolio of these orphans, however, only seems to lead us back to an S&P 500 Equal Weight benchmark. While there does not appear to be an edge in this strategy, it is interesting that there does not appear to be a negative edge either.
Recognizing that long-only factor portfolios represent active bets expressed as over- and underweights relative to the S&P 500, we also construct a portfolio of the most underweight stocks. Not surprisingly, as this portfolio actively captures a negative factor tilt, the strategy meaningfully underperforms the S&P 500 Equal Weight benchmark. Though the relative underperformance meaningfully dissipates in recent years.
Finally, we develop portfolios to capture stocks held in just one, two, three, or all four of the factors simultaneously. We find the portfolios comprised stocks held in either three or four of the factors at once exhibit significant concentration risk. As with the orphan portfolio, the portfolio of stocks held by just one of the factors closely tracks the S&P 500 Equal Weight benchmark, suggesting that it might be over-diversified.
The portfolio holding stocks held by just two factors at a time appears to be the Goldilocks portfolio, with enough concentration to be differentiated from the benchmark but not so much as to create significant concentration risk.
Unfortunately, this portfolio also almost perfectly replicates a naïve equal-weight portfolio among the four factors, suggesting that the approach is likely a wasted effort.
In conclusion, we find no evidence that factor orphans have historically offered a meaningful excess return opportunity. Nor, however, do they appear to have been a drag on portfolio returns either. We should acknowledge, however, that the adoption of factor portfolios accelerated rapidly after the Great Financial Crisis, and that backtests may not capture current market dynamics. More recent event studies of orphaned stocks being added to factor portfolios may provide more insight into the current environment.
Global Growth-Trend Timing
By Steven Braun
On November 4, 2019
In Portfolio Construction, Trend, Weekly Commentary
This post is available as a PDF download here.
Summary
We apologize in advance, as this commentary will be fairly graph- and table-heavy.
We have written fairly extensively on the topic of factor-timing in the past, and much of the success has been proven to be both hard to implement and recreate out of sample.
One of the inherent pains of trend following is the existence of whipsaws, or more precisely, the misidentification of perceived market trends, which turn out to be more noise than signal. An article from Philosophical Economics proposed using several economic indicators to tune down the noise that might affect price-driven signals such as trend following. Generally, this strategy imposed an overlay that turned trend following “on” when the change in the economic indicators were negative year-over-year signaling a higher likelihood of recession, and conversely, adopted a buy-and-hold stance when the economic indicators were not flashing warning lights.
This strategy presents a certain appeal as leading economic indicators may, as their name implies, lead the market for some time until capital preservation is warranted. Switching to a trend-following approach may allow a strategy to continue to participate in market appreciation while it lasts. On the other hand, using economic confirmation as a filter may help a strategy avoid the whipsaw costs generated from noisy market dips while positive economic conditions persist.
In an effort to test such a strategy out-of-sample, we took the approach global, hoping to capture a broader cross-section of economic and market environments.
First, we will consider trend following with no timing using the economic indicators.1
Below we plot the equity curves for Australia, Germany, Italy, Japan, Singapore, the United Kingdom, and the United States, alongside a strategy that is long the market when the market is above the trailing twelve-month average (“12 Month average”) and steps to cash when the price is below it. The ratio between the two is also included to show the relative cumulative performance between the trend strategy and the respective market. An increasing ratio means that the trend following strategy is adding value over buy-and-hold.
Source: MSCI, Global Financial Data. Calculations by Newfound Research. Past performance is not an indicator of future results. Performance is backtested and hypothetical. Performance figures are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes. Performance assumes the reinvestment of all distributions.
Through the graphs above, it becomes clear that much of the trend premium is realized by avoiding the large, prolonged bear markets that tend to occur during economic distress. In between these periods, however, the trend strategy lags the market. It makes sense, then, that a potential improvement to this strategy would be to implement an augmentation that could better distinguish between real price break-outs and those that lead to a whipsaw in the portfolio.
Growth-Trend Timing
For each country, we look at a number of economic indicators, including: corporate earnings growth, employment, housing starts, industrial production, and retail sales growth.2 The strategy then followed the same rules as described above: if the economic indicator in question displays a negative percentage change over the previous twelve-month period, a position is taken in a trend following strategy utilizing a twelve-month moving average signal. Otherwise, a buy-and-hold position is established.
To ensure that we are not benefitting from look-ahead bias, a lag of three months was imposed on each of the economic indicators, as it would be unrealistic to assume that the economic levels would be known at the end of each month.
Unfortunately, some of the economic data points could not be found for the entire period in which prices are available, though the analysis can still prove beneficial by indicating what economic regimes trend following is benefitted by growth-trend timing, or the potential identification where one indicator may work when another does not.3
In the charts below, we plot the growth-trend timing (referred to as GTT for the remainder of this commentary) for each country utilizing the available signals. The charts represent the relative cumulative performance over the respective country’s market return. For example, when the lines remain flat, the GTT approach has adopted buy-and-hold exposure and therefore matches the respective market’s returns. Any changes in the ratios are due to the GTT strategy investing in the trend following strategy.
Source: MSCI, Global Financial Data, St. Louis Fed, Bloomberg. Calculations by Newfound Research. Past performance is not an indicator of future results. Performance is backtested and hypothetical. Performance figures are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes. Performance assumes the reinvestment of all distributions.
What we see from the above figures is a mixed bag of results.
The overlay of economic indicators was by far successful in the mitigation of whipsaw losses, as each country reaped the benefits of being primarily long the market during bull markets. As the 12-month moving average strategy tended to slowly give up a portion of the gains realized from severe market environments, the majority of the GTT strategies remained relatively stagnant until the next major correction.
There are some instances, however, where the indicator was late to the economic party. It is worth remembering that the market is, in theory, a forward-looking measure, and therefore sudden economic shocks may not be captured in economic data as quickly as it is in market returns. This created cases where the strategy either missed the chance to be out of the market during a correction or was sitting on the sidelines during the subsequent recoveries. Notably, the employment signal in Australia, Italy, Singapore, and the United Kingdom tended to be a poor leading indicator as the strategy tended to be invested longer in the bear markets than the trend strategy.
A Candidate for Ensembling
The implicit assumption in the analysis above is that the included indicators behave in similar ways. For example, by using a twelve-month lookback period for the indicators, we are assuming that each indicator will begin to trend in roughly the same way.
That may not be a particularly fair assumption. Whereas housing starts and retail sales are generally considered leading indicators, employment (unemployment) rates are normally categorized as lagging indicators. For this reason, it may be more beneficial to use a shorter lookback period so as to pick up on potential problems in the economy as they begin to present themselves. Further, some signals tend to be more erratic than others, suggesting that a meaningful lookback period for one indicator may not be meaningful for another. With no perfect reason to prefer one lookback over another, we might consider different lookback periods so as to diversify any specification risk that may exist within the strategy.
With the benefit of hindsight, we know that not all recessions occur for the same reasons, so being reliant on one signal that has worked in the past may not be as beneficial in the future. With this in mind, we should consider that all indicators hold some information as to the state of the economy since one indicator may be signaling the all-clear while another may be flashing warning lights.
For the same reason medical professionals take multiple readings to gain insight into the state of the body, we should also consider any available signals to ascertain the health of the economy.
To ensemble this strategy, we will vary the lookbacks from six to eighteen months, while holding the lag at three months, as well as combine the available economic signals for each country. For the sake of brevity, we will hold the trend-following strategy the same with a twelve-month moving average.
Remember, if the economic signal is negative, it does not mean that we are immediately out of the market: a negative economic signal simply moves the strategy into a trend-following approach. With 5 economic indicators and 13 lookback periods, we have 65 possible strategies for each country. As an example, if 40 of these 65 models were positive and 25 were negative, we would hold 62% in the market and 38% in the trend following strategy.
The resulting performance statistics can be seen in the table below.
Source: MSCI, Global Financial Data, St. Louis Fed, Bloomberg. Calculations by Newfound Research. Past performance is not an indicator of future results. Performance is backtested and hypothetical. Performance figures are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes. Performance assumes the reinvestment of all distributions.
From the table above, we see that there are, again, mixed results. One country that particularly stands out is Italy in that the sign on its return flipped to negative and the drawdown was actually deeper with GTT than with a simple buy-and-hold strategy.
Source: MSCI, Global Financial Data, St. Louis Fed, Bloomberg. Calculations by Newfound Research. Past performance is not an indicator of future results. Performance is backtested and hypothetical. Performance figures are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes. Performance assumes the reinvestment of all distributions.
Digging deeper, it appears that the GTT strategy for Italy was actually whipsawed by more than just trend-following. Housing start data for Italy was not readily available until December 2008, so Italy may have been at a relative disadvantage when compared against the other countries. Since the reliable data we could find begins at the end of 2008 and the majority of the whipsaw losses occur post-Great Financial Crisis, we can run the analysis again, but with housing start data being added in upon its availability.
Source: MSCI, Global Financial Data, St. Louis Fed, Bloomberg. Calculations by Newfound Research. Past performance is not an indicator of future results. Performance is backtested and hypothetical. Performance figures are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes. Performance assumes the reinvestment of all distributions.
Adding housing starts in as an indicator did not meaningfully alter the results over the period. One hypothesis is that the indicators included could not fully encapsulate the complex state of Italy’s economy over the period. Italy has weathered three technical recessions over the past decade, so this could be a regime where the market is looking to sources outside the country for indications of distress or where the economic indicator is not reflective of the pressures driving the market.
Source: MSCI, St. Louis Fed. Calculations by Newfound Research. Past performance is not an indicator of future results. Performance is backtested and hypothetical. Performance figures are gross of all fees, including, but not limited to, manager fees, transaction costs, and taxes. Performance assumes the reinvestment of all distributions.
Above, we can see several divergences between the market movement and changes in real GDP. Specifically, in the past decade, we see that the market reacted to information that didn’t materialize in the country’s real GDP. More likely, the market was reacting to regional financial distress driven by debt concerns.
The MSCI Italy index is currently composed of 24 constituents with multinational business operations. Additionally, the index maintains large concentrations in financials, utilities, and energy: 33%, 25%, and 14%, respectively.4 Because of this sector concentration, utilizing the economic indicators may overly focus on the economic health of Italy while ignoring external factors such as energy prices or broader financial distress that could be swaying the market needle.
A parallel explanation could be that the Eurozone is entangled enough that signals could be interfering with each other between countries. Further research could seek to disaggregate signals between the Eurozone and the member-countries, attempting to differentiate between zone, regional, and country signals to ascertain further meaning.
Additionally, economic indicators are influenced by both the private and public sector so this could represent a disconnect between public company health and private company health.
Conclusion
In this commentary, we sought to answer the question, “can we improve trend-following by drawing information from a country’s economy”. It intuitively makes sense that an investor would generally opt for remaining in the market unless there are systemic issues that may lead to market distress. A strategy that successfully differentiates between market choppiness and periods of potential recession would drastically mitigate any losses incurred from whipsaw, thereby capturing a majority of the equity premium as well as the trend premium.
We find that growth-trend timing has been relatively successful in countries such as the United States, Germany, and Japan. However, the country that is being analyzed should be considered in light of their specific circumstances.
Peeking under the hood of Italy, it becomes clear that market movements may be influenced by more than a country’s implicit economic health. In such a case, we should pause and ask ourselves whether a macroeconomic indicator is truly reflective of that country’s economy or if there are other market forces pulling the strings.