The Research Library of Newfound Research

Month: March 2018

Protect & Participate: Managing Drawdowns with Trend Following

This post is available as PDF download here.

Summary

  • Trend following is an investment strategy that buys assets exhibiting strong absolute performance and sells assets exhibiting negative absolute performance.
  • Despite its simplistic description, trend following has exhibited considerable empirical robustness as a strategy, having been found to work in equity indices, bonds, commodities, and currencies.
  • A particularly interesting feature about trend following is its potential ability to avoid significant losses. Evidence suggests that trend following approaches can be used as alternative risk management techniques.
  • However, if investors expect to fully participate with asset growth while receiving significant protection, they are likely to be disappointed.
  • Relative to other risk management techniques, even very simple trend following strategies have exhibited very attractive return profiles.

What is Trend Following?

At its core, trend following – also called “absolute” or “time-series” momentum – is a very basic investment thesis: investments exhibiting positive returns tend to keep exhibiting positive returns and those exhibiting negative returns tend to keep exhibiting negative returns.

While the approach may sound woefully simplistic, the empirical and academic evidence that supports it extends back nearly two centuries.  Lempérière, Deremble, Seager, Potters, and Bouchard (2014), for example, test trend following approaches on commodities, currencies, stock indices, and bonds going back to 1800 and find that “the existence of trends [is] one of the most statistically significant anomalies in financial markets.”[1]

While LDSPB (2014) may have one of the longest backtests to date, a variety of other authors have demonstrated the existence of trends, and the success of trend following, in a variety of environments and markets.  We won’t list them here, but for those interested, a more thorough history can be found in our own paper Two Centuries of Momentum.

The driving theory behind trend following is that investor (mis-)behavior causes the emergence of trends.  When new information enters the market, investors underreact due to an anchoring bias that causes them to overweight prior information.  As price begins to drift towards fair value, herding takes over and causes investors to overreact.  This under and subsequent over-reaction is what causes a trend to emerge.

While somewhat contradictory to the notion that investors should not “chase performance” or “time markets,” evidence suggests that when systematically applied, trend following approaches can create a potentially significant return premium and potentially help investors avoid significant losses.

The Basic Trend Following Setup

In our experience, the two most popular methods of implementing a trend following signal are (1) a simple moving average cross-over system and (2) a measure of trailing total return.

In a simple moving average system cross-over system, when price is above the simple moving average, the system stays invested.  When price falls below, the strategy divests (usually into a risk-free asset, like U.S. Treasury Bills).  This sort of “in-or-out” system is often called “long/flat.”  For example, below we show a 12-month simple moving average and highlight when the system would buy and sell based upon when price crosses over.

The second form of trend following is more commonly referred to as “time-series momentum.”  In this approach, prior realized returns are calculated and the signal is generated depending upon whether returns were positive or negative.  For example, a popular academic approach is to use a “12-1” model, which takes the prior 12-month returns and subtracts the most recent month’s return (to avoid short-term mean reversion effects).  If this value is positive, the system invests and if the value is negative, it divests.

By looking at the example graphs, we can see that while these systems are similar, they are not exactly equal.  Nor are they the only way trend following approaches are implemented by practitioners.  What is important here is not the specific methodology, but that these methodologies attempt to capture the same underlying dynamics.

Empirical Evidence: Trend Following in a Crisis

To explore how a simple 12-1 time-series momentum system has worked in the past, we will apply the process to a broad U.S. equity index.  At the end of each month, we will calculate the trend following signal.  If the signal is positive, we will remain invested in the index (i.e. we are “long”).  If the signal is negative, we will divest into U.S. Treasury Bills (i.e. we are “flat”).

To explore the potential risk management capabilities of trend following, we will define a “crisis” as any period over which the broad U.S. equity market suffers a drawdown exceeding 25% from a recent market high.  We will then measure the maximum peak-to-trough drawdown of U.S. equities over the period and compare it to the maximum peak-to-trough drawdown of the 12-1 time series momentum strategy.

Since the early 1900s, we identify eight such scenarios.

Source: Kenneth French Data Library.  Calculations by Newfound Research.  Past performance is not indicative of future returns.  All performance is hypothetical and backtested.  Performance assumes the reinvestment of all distributions.  Returns are gross of all fees, including management fees, transaction costs, and taxes.

A few important takeaways:

  • Trend following is not a risk panacea. Even with trend following applied, drawdowns in excess of 15% occurred in each of these cases.  This is the cost of market participation, which will address a bit later.
  • Trend following did not limit losses in all cases. The market sell-off in October 1987 was so rapid that there was not sufficient time for trends to emerge and the system to be able to exit.  When trend following ends up protecting from quick sell-offs, it is more likely a function of luck than skill.
  • In many cases, trend following did help cut losses significantly. In the bear markets of the 1970s and 2000s, trend following helped reduce realized losses by over 50%.

Of course, the experience of these losses is very different than the summary numbers.  Below we plot the actual returns of equities versus a trend following overlay for several of the scenarios.

 

Source: Kenneth French Data Library.  Calculations by Newfound Research.  Past performance is not indicative of future returns.  All performance is hypothetical and backtested.  Performance assumes the reinvestment of all distributions.  Returns are gross of all fees, including management fees, transaction costs, and taxes.

We can see that the in many cases, when the trend following system got out, the market subsequently rallied, meaning that a trend follower would have a larger drawdown.  For example, in the Great Depression after the trend following system divested into U.S. Treasury Bills, the equity market rallied significantly.  This left the trend follower with a realized loss of -32% while a buy-and-hold investor would only be down -19%.

It is only with the benefit of hindsight that we can see that markets continued to fall and the patient trend follower was rewarded.

Ex-Ante Expectations About Participation

Of course, protecting capital is only half of the equation.  If we only cared about capital preservation, we could invest in short-term inflation-protected Treasuries and, barring a default by the U.S. government, sleep very well at night.

Before we demonstrate any empirical evidence about trend following’s ability to participate in growth, we want to use one of our favorite exercises – a coin flip game – to help establish reasonable expectations.

Imagine that we approach you with the offer to play a game.  We are going to flip a coin and you are going to try to guess how it lands.  If the coin lands on heads and you guess heads, the game is a push.  If it lands on tails and you guess tails, we give you $1.  If you guess wrong, you give us $1.

Does this sound like a game you would want to play?  Our guess is “no.”

Yet when we talk to many investors about their expectations for trend following strategies, this is the game they have created by choosing the U.S. equity market as a benchmark.

Consider the four scenarios that can happen:

  • The market goes up and trend following participates.
  • The market goes down and trend following goes down.
  • The market goes up and trend following is in cash.
  • The market goes down and trend following is in cash.

In the first scenario, even though trend following got the call right, we created a mental “push.”  In the middle two scenarios, trend following was incorrect and either participates on the downside or fails to participate on the upside (i.e. we “lose”).  It is only in the last scenario that trend following adds value.

In other words, by choosing U.S. equities as our benchmark for a long/flat trend following strategy, the strategy can only add value when the market is going down.  If we believe that the market will go up over the long run, that leaves very few scenarios for trend following to add value and plenty of scenarios for it to be a detractor.

Which is, unsurprisingly, exactly what you see if you plot the growth of a buy-and-hold investor versus a time-series momentum strategy: success in periods of significant market drawdown and relative underperformance in other periods.

Source: Kenneth French Data Library.  Calculations by Newfound Research.  Past performance is not indicative of future returns.  All performance is hypothetical and backtested.  Performance assumes the reinvestment of all distributions.  Returns are gross of all fees, including management fees, transaction costs, and taxes.

We can see, for example, that the trend following strategy lost its entire lead to the buy-and-hold investor from 1942 to 1962.  That is a frustratingly long period of underperformance for any investor to weather.

Determining the appropriate benchmark, however, is often a matter of preference.  We believe the appropriate way to address the problem is by asking whether trend following materially outperforms U.S. equities on a risk-adjusted basis.

To answer this question, we calculate the strategy’s full-period sensitivity to the U.S. equity index (i.e. its “beta”) and then re-create a new index that is comprised of a mixture U.S. equities and U.S. Treasury Bills that shares the same beta.  In this case, that index is 50% U.S. equities and 50% U.S. Treasury Bills.

Source: Kenneth French Data Library.  Calculations by Newfound Research.  Past performance is not indicative of future returns.  All performance is hypothetical and backtested.  Performance assumes the reinvestment of all distributions.  Returns are gross of all fees, including management fees, transaction costs, and taxes.

We can see that compared to a risk-adjusted benchmark, trend following exhibits a significant return premium without necessarily materializing significant excess downside risk.

Our take away from this is simple: investors who expect long/flat trend following strategies to keep up with equities are sure to be disappointed eventually.  However, if we use a benchmark that allows both “in” and “flat” decisions to add value (e.g. a 50% U.S. equity index + 50% U.S. Treasury Bill portfolio), trend following has historically added significant value.

One interpretation may be that trend following may be best suited as a “risk pivot” within the portfolio, rather than as an outright replacement for U.S. equity.  For example, if an investor has a 60% equity and 40% bond portfolio, rather than replacing equity with a trend strategy, the investor could replace a mix of both stocks and bonds.  By taking 10% from stocks and 10% from bonds to give to the trend allocation, the portfolio now has the ability to pivot between a 70/30 and a 50/50.  You can read more about this idea in our whitepaper Achieving Risk Ignition.

Another potential interpretation of this data is that long/flat trend following is a risk management technique and should be compared in light of alternative means of managing risk.

Pre-2008 versus Post-2008 Experience

Unfortunately, many investors have had their expectations for long/flat trend following strategies set by the period leading up to the 2008 financial crisis as well as the crisis itself, only to find themselves disappointed by subsequent performance.

Several years of whipsaws (including 2011, 2015 and 2016) leading to relative underperformance have caused many to ask, “is trend following broken?”

When we evaluate the data, however, we see that it is not the post-2008 period that is unique, but rather the pre-2008 period.

In fact, the pre-2008 period is unique in how calm a market environment it was, with drawdowns rarely eclipsing 10%.  While the post-2008 period has had its calm years (e.g. 2013 and 2017), it has also been punctuated by periods of volatility.  We can see the difference by plotting the drawdowns over the two periods.

Source: Kenneth French Data Library.  Calculations by Newfound Research. 

The unfortunate reality is that the calm period of pre-2008 and the strong performance of trend following in 2008 gave investors the false confidence that trend following had the ability to nearly fully participate on the upside and protect almost entirely on the downside.

Unfortunately, this simply is not true.  As we have said many times in the past, “risk cannot be destroyed, only transformed.”  While trend following tends to do well in environments where trends persist, it does poorly in those periods that exhibit sharp and sudden price reversals.

However, if we compare our trend following system against the more appropriate long-term risk-adjusted benchmark, we still see a significant return premium earned.

Source: Kenneth French Data Library.  Calculations by Newfound Research.  Past performance is not indicative of future returns.  All performance is hypothetical and backtested.  Performance assumes the reinvestment of all distributions.  Returns are gross of all fees, including management fees, transaction costs, and taxes.

One question we may ask ourselves is, “if we are using trend following to manage risk, how did other risk management techniques perform over the same period?”

Annualized Return
(2009 – 2017)
Annualized Volatility
(2009 – 2017)
Maximum Drawdown
(2007 – 2009)
S&P 50014.4%12.0%-52.3%
12-1 TS Momentum11.7%12.3%-10.9%
80/2012.3%9.4%-42.5%
60/4010.1%6.9%-32.0%
CBOE S&P 500 5% Put Protection Index10.2%10.1%-36.6%
Salient Trend Index (Managed Futures)1.2%10.3%-14.3%
Salient Risk Parity Index6.6%8.7%-30.8%
HFRX Global Hedge Fund Index1.5%4.0%-23.4%

Source: Kenneth French Data Library, CSI, Salient, HFRI, CBOE.  Calculations by Newfound Research.  Past performance is not indicative of future returns.  Performance assumes the reinvestment of all distributions.  Returns are gross of all fees, including management fees, transaction costs, and taxes.  60/40 and 80/20 portfolios are mixtures of the SPDR S&P 500 ETF (“SPY”) and iShares Core U.S. Bond ETF (“AGG”) in 60%/40% and 80%/20% proportional allocations, rebalanced annually.

We can see that while trend following has failed to keep up with U.S. equities in the post-crisis period (again, we would expect this), it has kept up much better than other potential risk management alternatives while providing significantly more protection during the crisis period.

Another important takeaway is that during the post crisis period, the trend following strategy had the highest volatility of any of the strategies measured.  In other words, while we might be able to rely on trend following for crisis risk management (i.e. avoiding the large left tail of returns), it is not necessarily going to reduce volatility during a bull market.

Conclusion

As an investment strategy, trend following has a long history of academic and empirical support.  Evidence suggests that trend following can be an effective means of avoiding large negative returns that coincide with traditional bear markets.

However, trend following is not a panacea.  In line with our philosophy that “risk cannot be destroyed, only transformed,” the risk management benefit often seen in trend following strategies comes with higher risks in other environments (i.e. “whipsaw”).

Investors who have relied upon the realized participation of trend following strategies during the pre-crisis period (2003-2007), as well as the protection afforded during the 2008 crisis itself, may have unrealistic expectations for forward performance.  Simply put: long/flat trend following strategies are very likely to underperform the underlying asset during strong bull markets.  In this case, replacing traditional equity exposure with a long/flat trend following strategy will likely lead to long-term underperformance.

However, when compared against other means of risk management, trend following has historically exhibited considerable downside protection for the upside participation it has realized.  Compared to a risk-adjusted benchmark, a long/flat U.S. equity trend following strategy exhibits an annualized excess return of 2.89%.

For investors looking to diversify how they manage risk, we believe the trend following represents a high transparent, and historically effective, alternative.

 


 

[1] https://arxiv.org/pdf/1404.3274.pdf

Two Centuries of Momentum

This post is available as a PDF download here.

A momentum-based investing approach can be confusing to investors who are often told that “chasing performance” is a massive mistake and “timing the market” is impossible.

Yet as a systematized strategy, momentum sits upon nearly a quarter century of positive academic evidence and a century of successful empirical results.

Our firm, Newfound Research, was founded in August 2008 to offer research derived from our volatility-adjusted momentum models.  Today, we provide tactically risk-managed investment portfolios using those same models.

Momentum, and particularly time-series momentum, has been in our DNA since day one.

In this Foundational Series piece, we want to explore momentum’s rich history and the academic evidence demonstrating its robustness across asset classes, geographies, and market cycles.

1. What is momentum?

Momentum is a system of investing that buys and sells based upon recent returns.  Momentum investors buy outperforming securities and avoid – or sell short – underperforming ones.

The notion is closely tied to physics.  In physics, momentum is the product of the mass and velocity of an object.  For example, a heavy truck moving at a high speed has large momentum.  To stop the truck, we must apply either a large or a prolonged force against it.

Momentum investors apply a similar notion.  They assume outperforming securities will continue to outperform in absence of significant headwinds.

 

2. The Two Faces & Many Names of Momentum

2.1 Relative Momentum

The phenomenon of relative momentum is also called cross-sectional momentum and relative strength.

Relative momentum investors compare securities against each other’s performance.  They favor buying outperforming securities and avoiding – or short-selling – underperforming securities.

Long-only relative momentum investors rotate between a subset of holdings within their investable universe. For example, a simple long-only relative strength system example is “best N of.”  At rebalance, this system sells its current holdings and buys the top N performing securities of a basket. In doing so, the strategy seeks to align the portfolio with the best performing securities in hopes they continue to outperform.

2.2 Absolute Momentum

Absolute momentum is also referred to as time-series momentum or trend following.

Absolute momentum investors compare a security against its own historical performance.  The system buys positive returning securities and avoids, or sells short, negative returning securities.

The primary difference is that relative momentum makes no distinction about return direction. If all securities are losing value, relative momentum will seek to invest in those assets that are going down least. Absolute momentum will seek to avoid negative returning assets.

 

3. A Brief History of Momentum

3.1 Early Practitioners

Momentum is one of Wall Street’s oldest investment strategies.

In 1838, James Grant published The Great Metroplis, Volume 2. Within, he spoke of David Ricardo, an English political economist who was active in the London markets in the late 1700s and early 1800s. Ricardo amassed a large fortune trading both bonds and stocks.

According to Grant, Ricardo’s success was attributed to three golden rules:

As I have mentioned the name of Mr. Ricardo, I may observe that he amassed his immense fortune by a scrupulous attention to what he called his own three golden rules, the observance of which he used to press on his private friends. These were, “Never refuse an option* when you can get it,”—”Cut short your losses,”—”Let your profits run on.” By cutting short one’s losses, Mr. Ricardo meant that when a member had made a purchase of stock, and prices were falling, he ought to resell immediately. And by letting one’s profits run on he meant, that when a member possessed stock, and prices were raising, he ought not to sell until prices had reached their highest, and were beginning again to fall. These are, indeed, golden rules, and may be applied with advantage to innumerable other transactions than those connected with the Stock Exchange.

The rules “cut short your losses” and “let your profits run on” are foundational philosophies of momentum.

Following in Ricardo’s footsteps are some of Wall Street’s greatest legends who implemented momentum and trend-following techniques.

Charles H. Dow (1851 – 1902) was the founder and first editor of the Wall Street Journal as well as the co-founder of Dow Jones and Company. In his Wall Street Journal column, he published his market trend analysis, which eventually developed into a body of research called Dow theory. Dow theory primarily focuses on the identification of trends as being the key signal for investing.

Jesse Livermore (1877 – 1940) was a stock market speculator in the early 1900s who famously made – and subsequently lost – two massive fortunes during the market panic of 1907 and crash of 1929.  He is attributed (by Edwin Lefèvre, in Reminiscences of a Stock Operator) to saying,

[T]he big money was not in the individual fluctuations but in the main movements … sizing up the entire market and its trend.

Livermore claimed that his lack of adherence to his own rules was the main reason he lost his wealth.

In the same era of Livermore, Richard Wyckoff (1873 – 1934) noted that stocks tended to trend together. Thus he focused on entering long positions only when the broad market was trending up.  When the market was in decline, he focused on shorting.  He also emphasized the placement of stop-losses to help control risk.

He was personally so successful with his techniques, he eventually owned nine and a half acres in the Hamptons.

Starting in the 1930s, George Chestnutt successfully ran the American Investors Fund for nearly 30 years using relative strength techniques. He also published market letters with stock and industry group rankings based on his methods.  He wrote,

[I]t is better to buy the leaders and leave the laggards alone. In the market, as in many other phases of life, ‘the strong get stronger, and the weak get weaker.’

In the late 1940s and early 1950s, Richard Donchian developed a rules based technical system that became the foundation for his firm Futures, Inc.  Futures, Inc. was one of the first publicly held commodity funds.  The investment philosophy was based upon Donchian’s belief that commodity prices moved in long, sweeping bull and bear markets.  Using moving averages, Donchian built one of the first systematic trend-following methods, earning him the title of the father of trend-following.

In the late 1950s, Nicholas Darvas (1920 – 1977), trained economist and touring dancer, invented “BOX theory.”  He modeled stock prices as a series of boxes.  If a stock price remained in a box, he waited.  As a stock price broke out of a box to new highs, he bought and placed a tight stop loss.  He is quoted as saying, 

I keep out in a bear market and leave such exceptional stocks to those who don’t mind risking their money against the market trend.

Also during the 1950s and 1960s was Jack Dreyfus, who Barron’s named the second most significant money manager of the last century. From 1953 to 1964, his Dreyfus Fund returned 604% compared to 346% for the Dow index. Studies performed by William O’Neil showed that Dreyfus tended to buy stocks making new 52-week highs. It wouldn’t be until 2004 that academic studies would confirm this method of investing.

Richard Driehaus took the momentum torch during the 1980s. In his interview in Jack Schwager’s The New Market Wizards, he said he believed that money was made buying high and selling higher.

That means buying stocks that have already had good moves and have high relative strength – that is, stocks in demand by other investors. I would much rather invest in a stock that’s increasing in price and take the risk that it may begin to decline than invest in a stock that’s already in a decline and try to guess when it will turn around.

3.2 Earliest Academic Studies

In 1933, Alfred Cowles III and Herbert Jones released a research paper titled Some A Posteriori Probabilities in Stock Market Action. Within it they specifically focused on “inertia” at the “microscopic” – or stock – level.

They focused on counting the ratio of sequences – times when positive returns were followed by positive returns, or negative returns were followed by negative returns – to reversals – times when positive returns were followed by negative returns, and vice versa.

Their results:

It was found that, for every series with intervals between observations of from 20 minutes up to and including 3 years, the sequences out-numbered the reversals. For example, in the case of the monthly series from 1835 to 1935, a total of 1200 observations, there were 748 sequences and 450 reversals. That is, the probability appeared to be .625 that, if the market had risen in a given month, it would rise in the succeeding month, or, if it had fallen, that it would continue to decline for another month. The standard deviation for such a long series constructed by random penny tossing would be 17.3; therefore the deviation of 149 from the expected value of 599 is in excess of eight times the standard deviation. The probability of obtaining such a result in a penny-tossing series is infinitesimal.

Despite the success of their research on the statistical significance of sequences, the next academic study on momentum was not released for 30 years.

In 1967, Robert Levy published Relative Strength as a Criterion for Investment Selection. Levy found that there was “good correlation between past performance groups and future … performance groups” over 26-week periods. He states:

[…] the [26-week] average ranks and ratios clearly support the concept of continuation of relative strength. The stocks which historically were among the 10 per cent strongest (lowest ranked) appreciated in price by an average of 9.6 per cent over a 26-week future period. On the other hand, the stocks which historically were among the 10 per cent weakest (highest ranked) appreciated in price an average of only 2.9 per cent over a 26-week future period.

Unfortunately, the scope of the study was limited. The period used in the analysis was only from 1960 to 1965. Thus, of the 26-week periods tested, only 8 were independent. In Levy’s words, “the results were extensively intercorrelated; and the use of standard statistical measures becomes suspect.” Therefore, Levy omitted these statistics.

Despite its promise, momentum research went dark for the next 25 years.

4. The Dark Days of Momentum Research

Despite the success of practitioners and promising results of early studies, momentum would go largely ignored by academics until the 1990s.

Exactly why is unknown, but we have a theory: fundamental investing, modern portfolio theory, and the efficient market hypothesis.

4.1 The Rise of Fundamental Investing

In 1934, Benjamin Graham and David Dodd published Security Analysis. Later, in 1949, they published The Intelligent Investor. In these tomes, they outline their methods for successful investing.

For Graham and Dodd, a purchase of stock was a purchase of partial ownership of a business. Therefore, it was important that investors evaluate the financial state of the underlying business they were buying.

They also defined a strong delineation between investing and speculating. To quote,

An investment operation is one which, upon thorough analysis, promises safety of principal and an adequate return. Operations not meeting these requirements are speculative.

Speculative was a pejorative term. Even the title of The Intelligent Investor implied that any investors not performing security analysis were not intelligent.

The intelligent investor began her process by computing a firm’s intrinsic value. In other words, “what is the business truly worth?” This value was either objectively right or wrong based on the investor’s analysis. Whether the market agreed or not was irrelevant.

Once an intrinsic value was determined, Graham and Dodd advocated investors buy with a margin of safety. This meant waiting for the market to offer stock prices at a deep discount to intrinsic value.

These methods of analysis became the foundation of value investing.

To disciples of Graham and Dodd, momentum is speculative nonsense. To quote Warren Buffett in The Superinvestors of Graham-and-Doddsville:

I always find it extraordinary that so many studies are made of price and volume behavior, the stuff of chartists. Can you imagine buying an entire business simply because the price of the business had been marked up substantially last week and the week before?

4.2 Modern Portfolio Theory and the Efficient Market Hypothesis

In his 1952 article “Portfolio Selection,” Harry Markowitz outlined the foundations of Modern Portfolio Theory (MPT). The biggest breakthrough of MPT was that it provided a mathematical formulation for diversification.

While the concept of diversification has existed since pre-Biblical eras, it had never before been quantified. With MPT, practitioners could now derive portfolios that optimally balanced risk and reward. For example, by combining assets together, Markowitz created the efficient frontier: those combinations for which there is the lowest risk for a given level of expected return.

By introducing a risk-free asset, the expected return of any portfolio constructed can be linearly changed by varying the allocation to the risk-free asset. In a graph like the one on the left, this can be visualized by constructing a line that passes through the risk-free asset and the risky portfolio (called a Capital Allocation Line or CAL). The CAL that is tangent to the efficient frontier is called the capital market line (CML). The point of tangency along the efficient frontier is the portfolio with the highest Sharpe ratio (excess expected return divided by volatility).

According to MPT, in which all investors seek to maximize their Sharpe ratio, an investor should only hold a mixture of this portfolio and the risk free asset. Increasing the allocation to the risk-free asset decreases risk while introducing leverage increases risk.

The fact that any investor should only hold one portfolio has a very important implication: given all the assets available in the market, all investors should hold, in equal relative proportion, the same portfolio of global asset classes. Additionally, if all investors are holding the same mix of assets, in market equilibrium, the prices of asset classes – and therefore their expected returns – must adjust such that the allocation ratios of the assets in the tangency portfolio will match the ratio in which risky assets are supplied to the market.

Holding anything but a combination of the tangency portfolio and the risk-free asset is considered sub-optimal.

From this foundation, concepts for the Capital Asset Pricing Model (CAPM) are derived. CAPM was introduced independently by Jack Treynor, William Sharpe, John Lintner, and Jan Mossin from 1961-1966.

CAPM defines a “single-factor model” for pricing securities. The expected return of a security is defined in relation to a risk-free rate, the security’s “systematic” risk (sensitivity to the tangency portfolio), and the expected market return. All other potentially influencing factors are considered to be superfluous.

While its origins trace back to the 1800s, the efficient market hypothesis (EMH) was officially developed by Eugene Fama in his 1962 Ph.D. thesis.

EMH states that stock prices reflect all known and relevant information and always trade at fair value. If stocks could not trade above or below fair value, investors would never be able to buy them at discounts or sell them at premiums. Therefore, “beating the market” on a risk-adjusted basis is impossible.

Technically, MPT and EMH are independent theories. MPT tells us we want to behave optimally, and gives us a framework to do so. EMH tells us that even optimal behavior will not generate any return in excess of returns predicted by asset pricing models like CAPM.

Markowitz, Fama, and Sharpe all went on to win Nobel prizes for their work.

4.3 Growing Skepticism Towards Technical Analysis

Technical analysis is a category of investing methods that use past market data – primarily price and volume – to make forward forecasts.

As a category, technical analysis is quite broad. Some technicians look for defined patterns in price charts. Others look for lines of support or resistance. A variety of indicators may be calculated and used. Some technicians follow specific techniques – like Dow theory or Elliot Wave theory.

Unfortunately, the broad nature of technical analysis makes it difficult to evaluate academically. Methods vary widely and different technical analysts can make contradictory predictions using the same data.

Thus, during the rise of EMH through the 1960s and 1970s, technical analysis was largely dismissed by academics.

Since momentum relies only on past prices, and many practitioners used tools like moving averages to identify trends, it was categorized as a form of technical analysis.  As academics dismissed the field, momentum went overlooked.

4.4 But Value Research Went On

Despite CAPM, EMH, and growing skepticism towards technical analysis, academic research for fundamental investing continued. Focus was especially strong on value investing.

For example, in 1977, S. Basu authored a comprehensive study on value investing, titled Investment Performance of Common Stocks in Relation to their Price-Earnings Ratios: A Test of the Efficient Market Hypothesis. Within, Basu finds that the return relationship strictly increases for stocks sorted on their price-earnings ratio. Put more simply, cheap stocks outperform expensive ones.

Unfortunately, in many of these studies, the opposite of value was labeled growth or glamor. This became synonymous with high flying, over-priced stocks. Of course, not value is not the same as growth. And not value is certainly not the same as momentum. It is entirely possible that a stock can be in the middle of a positive trend, yet still be undervalued.  Nevertheless, it is easy to see how relatively outperforming and over-priced may be conflated.

It is possible that the success of value research in demonstrating the success of buying cheap stocks dampened the enthusiasm for momentum research.

5. The Return of Momentum

Fortunately, decades of value-based evidence against market efficiency finally piled up.

In February 1993, Eugene Fama and Kenneth French released Common Risk Factors in the Returns on Stocks and Bonds. Fama and French extended the single-factor model of CAPM into a three-factor model. Beyond the “market factor,” factors for “value” and “size” were added, acknowledging these distinct drivers of return.

Momentum was still nowhere to be found.

But a mere month later, Narasimhan Jegadeesh and Sheridan Titman published their seminal work on momentum, titled Returns to Buying Winners and Selling Losers: Implication for Stock Market Efficiency. Within they demonstrated:

Strategies which buy stocks that have performed well in the past and sell stocks that have performed poorly in the past generate significant positive returns over 3- to 12-month holding periods.

The results of the paper could not be explained by systematic risk or delayed reactions to other common factors, echoing the results of Cowles and Jones some 60 years prior.

In 1996, Fama and French authored Multifactor Explanations of Asset Pricing Anomalies. Armed with their new three-factor model, they explored whether recently discovered market phenomena – including Jegadeesh and Titman’s momentum – could be rationally explained away.

While most anomalies disappeared under scrutiny, the momentum results remained robust. In fact, in the paper Fama and French admitted that,

“[momentum is the] main embarrassment of the three-factor model.”

6. The Overwhelming Evidence for Momentum

With its rediscovery and robustness against prevailing rational pricing models, momentum research exploded over the next two decades. It was applied across asset classes, geographies, and time periods. In chronological order:

Asness, Liew, and Stevens (1997) shows that momentum investing is a profitable strategy for country indices.

Carhart (1997) finds that portfolios of mutual funds, constructed by sorting on trailing one-year returns, decrease in monthly excess return nearly monotonically, inline with momentum expectations.

Rouwenhorst (1998) demonstrates that stocks in international equity markets exhibit medium-term return continuations. The study covered stocks from Austria, Belgium, Denmark, France, Germany, Italy, the Netherlands, Norway, Spain, Sweden, Switzerland, and the United Kingdom.

LeBaron (1999) finds that a simple momentum model creates unusually large profits in foreign exchange series.

Moskowitz and Grinblatt (1999) finds evidence for a strong and persistent industry momentum effect.

Rouwenhorst (1999), in a study of 1700 firms across 20 countries, demonstrates that emerging market stocks exhibit momentum.

Liew and Vassalou (2000) shows that momentum returns are significantly positive in foreign developed countries but there is little evidence to explain them by economic developments.

Griffin, Ji, and Martin (2003) demonstrates momentum’s robustness, finding it to be large and statistically reliable in periods of both negative and positive economic growth. The study finds no evidence for macroeconomic or risk-based explanations to momentum returns.

Erb and Harvey (2006) shows evidence of success for momentum investing in commodity futures.

Gorton, Hayashi, and Rouwenhorst (2008) extends momentum research on commodities, confirming its existence in futures but also identifying its existence in spot prices.

Jostova, Niklova Philopov, and Stahel (2012) shows that momentum profits are significant for non-investment grade corporate bonds.

Luu and Yu (2012) identifies that for liquid fixed-income assets, such as government bonds, momentum strategies may provide a good risk-return trade-off and a hedge for credit exposure.

7. Academic Explanations for Momentum

While academia has accepted momentum as a distinct driver of return premia in many asset classes around the world, the root cause is still debated.

So far, the theory for rational markets has failed to account for momentum’s significant and robust returns.  It is not correlated with macroeconomic variables and does not seem to reflect exposure to other known risk factors.

But there are several hypotheses that might explain how irrational behavior may lead to momentum.

7.1 The Behavioral Thesis

The most commonly accepted argument for why momentum exists and persists comes from behavioral finance. Behavioral finance is a field that seeks to link psychological theory with economics and finance to explain irrational decisions.

Some of the popular behavioral finance explanations for momentum include:

Herding: Also known as the “bandwagon effect,” herding is the tendency for individuals to mimic the actions of a larger group.

Anchoring Bias: The tendency to rely too heavily on the first piece of information received.

Confirmation Bias: The tendency to ignore information contradictory to prior beliefs.

Disposition Effect: Investors tend to sell winners too early and hold on to losers too long. This occurs because investors like to realize their gains but not their losses, hoping to “make back” what has been lost.

Together, these biases cause investors to either under- or over-react to information, causing pricing inefficiencies and irrational behavior.

7.1.1 Cumulative Advantage & Momentum Beyond Markets

There is strong evidence for momentum being a behavioral and social phenomenon beyond stock markets.

Matthew Salganik, Peter Dodds, and Duncan Watts ran a 14,000 participant, web-based study designed to establish independence of taste and preference in music.

Participants were asked to explore, listen to, and rate music.  One group of participants would be able to see how many times a song was downloaded and how other participants rated it; the other group would not be able to see downloads or ratings.  The group that could see the number of downloads (“social influence”) was then sub-divided into 8 distinct, random groups where members of each sub-group could only see the download and ratings statistics of their sub-group peers.

The hypothesis of the study was that “good music” should garner the same amount of market share regardless of the existence of social influence: hits should be hits.  Secondly, the same hits should be hits across all independent social influence groups.

What the study found was dramatically different.  Each social-influence group had its own hit songs, and those songs commanded a much larger market share of downloads than songs did in the socially-independent group.

Introducing social-influence did two things: it made hits bigger and it made hits more unpredictable.  The authors called this effect “cumulative advantage.”  The consequences are profound.  To quote an article in the New York Times by Watts,

It’s a simple result to state, but it has a surprisingly deep consequence. Because the long-run success of a song depends so sensitively on the decisions of a few early-arriving individuals, whose choices are subsequently amplified and eventually locked in by the cumulative-advantage process, and because the particular individuals who play this important role are chosen randomly and may make different decisions from one moment to the next, the resulting unpredictability is inherent to the nature of the market. It cannot be eliminated either by accumulating more information — about people or songs — or by developing fancier prediction algorithms, any more than you can repeatedly roll sixes no matter how carefully you try to throw the die.

7.2 The Limits to Arbitrage Thesis

EMH assumes that any mis-pricing in public markets will be immediately arbitraged away by rational market participants. The limits to arbitrage theory recognizes that there are often restrictions – both regulatory and capital based – that may limit rational traders from fully arbitraging away these price inefficiencies.

In support of this thesis is Chabot, Ghysels, and Jagannathan (2009), which finds that when arbitrage capital is in short supply, momentum cycles last longer.

Similarly, those investors bringing good news to the market may lack the capital to take full advantage of that information. So if there has been good news in the past, there may be good news not yet incorporated into the price.

7.3 The Rational Inattention Thesis

Humans possess a finite capacity to process the large amounts of information they are confronted with. Time is a scarce resource for decision makers.

The rational inattention theory argues that some information may be evaluated less carefully, or even outright ignored. Or, alternatively, it may be optimal for investors to obtain news with limited frequency or limited accuracy. This can cause investors to over- or under-invest and could cause the persistence of trends.

Chen and Yu (2014) found that portfolios constructed from stocks “more likely to grab attention” based on visual patterns induces investor over-reaction. They provide evidence that momentum continuation is induced by visually-based psychological biases.

8. Advances in Cross-Sectional Research

Much like there are many ways to identify value, there are many ways to identify momentum. Recent research has identified methods that may improve upon traditional total return momentum.

52-Week Highs: Hwang and George (2004) shows that nearness to a 52-week high price dominates and improves upon the forecasting power of past returns (i.e. momentum). Perhaps most interestingly, future returns forecast using a 52-week high do not mean-revert in the long run, like traditional momentum.

Liu, Liu, and Ma (2010) tests the 52-week high strategy in 20 international markets and finds that it is profitable in 18 and significant in 10.

Residual Momentum: Using a universe of domestic equities, covering the period of January 1926 to December 2009, Blitz, Huij, and Martens (2009) decomposes stock returns using the Fama-French three-factor model. Returns unexplained by the market, value, and size factors are considered to be residual. The study finds that momentum strategies built from residual returns exhibit risk-adjusted profits that are twice as large as those associated with total return momentum.

Idiosyncratic Momentum: Similar to Blitz, Huij, and Martens, Chaves (2012) uses the CAPM model to correct stocks for market returns and identify idiosyncratic returns. Idiosyncratic momentum is found to work better than momentum in a sample of 21 developed countries. Perhaps most importantly, idiosyncratic momentum is successful in Japan, where most traditional momentum strategies have failed.

9. Using Momentum to Manage Risk

While most research in the late 1990s and early 2000s focused on relative momentum, research after 2008 has been heavily focused on time-series momentum for its risk-mitigating and diversification properties.

Some of the earliest, most popular research was done by Faber (2006), in which a simple price-minus-moving-average approach was used to drive a portfolio of U.S. equities, foreign developed equities, commodities, U.S. REITs, and U.S. government bonds. The resulting portfolio demonstrates “equity-like returns with bond-like volatility.”

Hurst, Ooi, and Pedersen (2010) identifies that trend-following, or time-series momentum, is a significant component of returns for managed futures strategies. In doing so, the research demonstrates the consistency of trend-following approaches in generating returns in both bull and bear markets.

Going beyond managed futures specifically, Moskowitz, Ooi, Hua, and Pedersen (2011) documents significant time-series momentum in equity index, currency, commodity, and bond futures covering 58 liquid instruments over a 25-year period.

Perhaps some of the most conclusive evidence comes from Hurst, Ooi, Pedersen (2012), which explores time-series momentum going back to 1903 and through 2011.

The study constructs a portfolio of an equal-weight combination of 1-month, 3-month, and 12-month time-series momentum strategies for 59 markets across 4 major asset classes, including commodities, equity indices, and currency pairs. The approach is consistently profitable across decades. The research also shows that incorporating a time-series momentum approach into a traditional 60/40 stock/bond portfolio increases returns, reduces volatility, and reduces maximum drawdown.

Finally, Lempérière, Deremble, Seager, Potters, and Bouchard (2014) extends the tests even further, using both futures and spot prices to go back to 1800 for commodity and stock indices. It finds that excess returns driven by trend-following is both significant and stable across time and asset classes.

10. Evidence & Advances in Time-Series Momentum

While the evidence for time-series momentum was significantly advanced by the papers and teams cited above, there were other, more focused contributions throughout the years that helped establish it in more specific asset classes.

Wilcox and Crittenden (2005) demonstrates that buying stocks when they make new 52-week highs and selling after a prescribed stop-loss is broken materially outperforms the S&P 500 even after accounting for trading slippage.

ap Gwilym, Clare, Seaton, and Thomas (2009) explores whether trend-following can be used as an allocation tool for international equity markets. Similar to Faber (2006), it utilizes a 10-month price-minus-moving-average model. Such an approach delivers a similar compound annual growth rate to buy and hold, but with significantly lower volatility, increasing the Sharpe ratio from 0.41 to 0.75.

Szakmary, Shen, and Sharma (2010) explores trend-following strategies on commodity futures markets covering 48 years and 28 markets. After deducting reasonable transaction costs, it finds that both a dual moving-average-double-crossover strategy and a channel strategy yield significant profit over the full sample period.

Antonacci (2012) explores a global tactical asset allocation approach utilizing both relative and absolute momentum techniques in an approach called “dual momentum.” Dual momentum increases annualized return, reduces volatility, and reduces maximum drawdown for equities, high yield & credit bonds, equity & mortgage REITs, and gold & treasury bonds.

Dudler, Gmuer, and Malamud (2015) demonstrates that risk-adjusted time series momentum – returns normalized by volatility – outperforms time series momentum on a universe of 64 liquid futures contracts for almost all combinations of holdings and look-back periods.

Levine and Pedersen (2015) uses smoothed past prices and smoothed current prices in their calculation of time-series momentum to reduce random noise in data that might occur from focusing on a single past or current price.

Clare, Seaton, Smith and Thomas (2014) finds that trend following “is observed to be a very effective strategy over the study period delivering superior risk-adjusted returns across a range of size categories in both developed and emerging markets.

11. Unifying Momentum & Technical Analysis

Despite their similarities, trend-following moving average rules are often still considered to be technical trading rules versus the quantitative approach of time-series momentum. Perhaps the biggest difference is that the trend-following camp tended to focus on prices while the momentum camp focused on returns.

Momentum - Bruder Dao Richard and RoncalliHowever, research over the last half-decade actually shows that they are highly related strategies.

Bruder, Dao, Richard, and Roncalli (2011) unites moving-average-double-crossover strategies and time-series momentum by showing that cross-overs were really just an alternative weighting scheme for returns in time-series momentum. To quote,

The weighting of each return … forms a triangle, and the biggest weighting is given at the horizon of the smallest moving average. Therefore, depending on the horizon n2 of the shortest moving average, the indicator can be focused toward the current trend (if n2 is small) or toward past trends (if n2 is as large as n1/2 for instance).

We can see, above, this effect in play.  When n2 << n1 (e.g. n2=10, n1=100), returns are heavily back-weighted in the calculation.  As n2 approaches half of n1, we can see that returns are most heavily weighted at the middle point.

Marshall, Nguyen and Visaltanachoti (2012) proves that time-series momentum is related to moving-average-change-in-direction. In fact, time-series momentum signals will not occur until the moving average changes direction.  Therefore, signals from a price-minus-moving-average strategy are likely to occur before a change in signal from time-series momentum.

Levine and Pedersen (2015) shows that time-series momentum and moving average cross-overs are highly related. It also find that time-series momentum and moving-average cross-over strategies perform similarly across 58 liquid futures and forward contracts.

Beekhuizen and Hallerbach (2015) also links moving averages with returns, but further explores trend rules with skip periods and the popular MACD rule. Using the implied link of moving averages and returns, it shows that the MACD is as much trend following as it is mean-reversion.

Zakamulin (2015) explores price-minus-moving-average, moving-average-double-crossover, and moving-average-change-of-direction technical trading rules and finds that they can be interpreted as the computation of a weighted moving average of momentum rules with different lookback periods.

These studies are important because they help validate the approach of price-based systems. Being mathematically linked, technical approaches like moving averages can now be tied to the same theoretical basis as the growing body of work in time-series momentum.

12. Conclusion

As an investment strategy, momentum has a deep and rich history.

Its foundational principles can be traced back nearly two centuries and the 1900s were filled with its successful practitioners.

But momentum went long misunderstood and ignored by academics.

In 1993, Jegadeesh and Titman published “Returns to Buying Winners and Selling Losers: Implications for Stock Market Efficiency.”  Prevailing academic theories were unable to account for cross-sectional momentum in rational pricing models and the premier market anomaly was born.

While momentum’s philosophy of “buy high, sell higher” may seem counterintuitive, prevailing explanations identify its systemized process as taking advantage of the irrational behavior exhibited by investors.

Over the two decades following momentum’s (re)introduction, academics and practitioners identified the phenomenon as being robust in different asset classes and geographies around the globe.

After the financial crisis of 2008, a focus on using time-series momentum emerged as a means to manage risk.  Much like cross-sectional momentum, time-series momentum was found to be robust, offering significant risk-management opportunities.

While new studies on momentum are consistently published, the current evidence is clear: momentum is the premier market anomaly.

 


 

Antonacci, G., (2012). “Risk Premia Harvesting Through Dual Momentum”, Portfolio Management Associates.

Asness, Clifford S., John M. Liew, and Ross L. Stevens. “Parallels Between the Cross-Sectional Predictability of Stock and Country Returns.” Journal of Portfolio Management, 23, no. 3 (Spring 1997), pp. 79–87.

Asness, Cliff, Tobias Moskowitz, and Lasse H. Pedersen (2013), “Value and Momentum Everywhere,” The Journal of Finance 68(3), 929-985.

Beekhuizen, Paul and Hallerbach, Winfried G., Uncovering Trend Rules (May 11, 2015). Available at SSRN: http://ssrn.com/abstract=2604942

Bhojraj, Sanjeev and Swaminathan, Bhaskaran, Macromomentum: Returns Predictability in Currencies and International Equity Indices (October 2002). Available at SSRN: http://ssrn.com/abstract=356940

Blitz, David and Huij, Joop and Martens, Martin, Residual Momentum (August 1, 2009). Available at SSRN: http://ssrn.com/abstract=2319861

Bruder, Benjamin and Dao, Tung-Lam and Richard, Jean-Charles and Roncalli, Thierry, Trend Filtering Methods for Momentum Strategies (December 1, 2011). Available at SSRN: http://ssrn.com/abstract=2289097

Carhart, Mark M. 1997. “On Persistence in Mutual Fund Performance.” Journal of Finance, vol. 52, no. 1 (March):57–82.

Chaves, Denis. 2012. “Eureka! A Momentum Strategy that Also Works in Japan.” Research Affiliates Working Paper (January 9).

Chen, Li-Wen and Yu, Hsin-Yi, Investor Attention, Visual Price Pattern, and Momentum Investing (August 12, 2014). 27th Australasian Finance and Banking Conference 2014 Paper. Available at SSRN: http://ssrn.com/abstract=2292895

Chui, Andy C.W. and Titman, Sheridan and Wei, K. C. John, Individualism and Momentum Around the World (February 23, 2009). Journal of Finance, Forthcoming. Available at SSRN: http://ssrn.com/abstract=1150076

Clare, Andrew and Seaton, James and Smith, Peter N. and Thomas, Steve, Size Matters: Tail Risk, Momentum and Trend Following in International Equity Portfolios (November 6, 2014). Available at SSRN: http://ssrn.com/abstract=2520075

Dudler, Martin and Gmuer, Bruno and Malamud, Semyon, Risk Adjusted Time Series Momentum (June 22, 2014). Swiss Finance Institute Research Paper No. 14-71. Available at SSRN: http://ssrn.com/abstract=2457647

Erb, Claude, and Campbell Harvey, 2006, The strategic and tactical value of commodity futures, Financial Analysts Journal 62, 69–97.

Faber, Mebane T., A Quantitative Approach to Tactical Asset Allocation (February 1, 2013). The Journal of Wealth Management, Spring 2007 . Available at SSRN: http://ssrn.com/abstract=962461

Gorton, Gary B., Fumio Hayashi, and K. Geert Rouwenhorst, 2008, The fundamentals of commodity futures returns, Working paper, University of Pennsylvania.

Griffin, John M. and Martin, J. Spencer and Ji, Susan, Momentum Investing and Business Cycle Risk: Evidence from Pole to Pole (March 18, 2002). AFA 2003 Washington, DC Meetings; EFA 2002 Berlin Meetings Presented Paper. Available at SSRN: http://ssrn.com/abstract=291225

ap Gwilym O., Clare, A., Seaton, J., Thomas, S., (2009). “Price and Momentum as Robust Tactical Approaches to Global Equity Investing”, Journal of Investing, 19, 80-92.

Hurst, B., Ooi, Y.H., Pedersen L., (2010). “Understanding Managed Futures”, AQR Working Paper.

Hurst, B., Ooi, Y. H., Pedersen, L. H., A century of Evidence on Trend- Following Investing, AQR Working paper, 2012.

Hwang, Chuan-Yang and George, Thomas J., The 52-Week High and Momentum Investing. Journal of Finance, Vol. 59, No. 5, pp. 2145-2176. Available at SSRN: http://ssrn.com/abstract=1104491

Jostova, Gergana and Nikolova, Stanislava and Philipov, Alexander and Stahel, Christof W., Momentum in Corporate Bond Returns (December 12, 2012). Available at SSRN: http://ssrn.com/abstract=1651853

Lebaron, B., 1999, Technical trading rule profitability and foreign exchange intervention, Journal of International Economics, 49, 125–143.

Lempérière,Y., Deremble, C., Seager, P., Potters, M., and Bouchard, J.P. (2014) Two centuries of trend following. arXiv:1404.3274 [q-fin.PM]

Liew, Jim Kyung-Soo and Vassalou, Maria, Can Book-to-Market, Size, and Momentum Be Risk Factors That Predict Economic Growth? (April 1999). Available at SSRN: http://ssrn.com/abstract=159293

Liu, Ming and Liu, Qianqiu and Ma, Tongshu, The 52-Week High Momentum Strategy in International Stock Markets (June 12, 2010). Journal of International Money and Finance, Vol. 30, No. 1, 2011, 180-204. Available at SSRN: http://ssrn.com/abstract=1364566

Luu, B.V., and Yu, P. “Momentum in Government-Bond Markets.” The Journal of Fixed Income, 22 (2012), pp. 72-79.

Marshall, Ben R. and Nguyen, Nhut H. and Visaltanachoti, Nuttawat, Time-Series Momentum versus Moving Average Trading Rules (December 22, 2014). Available at SSRN: http://ssrn.com/abstract=2225551

Moskowitz, Tobias J., and Mark Grinblatt. 1999. “Do Industries Explain Momentum?” Journal of Finance, vol. 54, no. 4 (August):1249–1290.

Moskowitz, Tobias J. and Ooi, Yao Hua and Pedersen, Lasse Heje, Time Series Momentum (September 1, 2011). Chicago Booth Research Paper No. 12-21; Fama-Miller Working Paper. Available at SSRN: http://ssrn.com/abstract=2089463

Rouwenhorst, K. Geert. “International Momentum Strategies.” Journal of Finance, 53, no. 1 (February 1998), pp. 267–284.

Rouwenhorst 1999. “Local Return Factors and Turnover in Emerging Stock Markets.” Journal of Finance, vol. 54, no. 4 (August):1439–1464.

Szakmary, A., Shen, Q., and Sharma, S., (2010). “Trend-Following Trading Strategies in Commodity Futures: A Re-Examination”, Journal of Banking and Finance, 34, 409-426.

Wilcox, C. and Crittenden, E. (2005). “Does Trend-Following Work on Stocks?”, BlackStar Funds.

Zakamulin, Valeriy, Market Timing with Moving Averages: Anatomy and Performance of Trading Rules (May 13, 2015). Available at SSRN: http://ssrn.com/abstract=2585056

You Are Not a Monte-Carlo Simulation

This commentary is available as a PDF download here.

Summary­

  • Even when an investment has a positive expected average growth rate, the experience of most individuals may be catastrophic.
  • By focusing on the compound average growth rate, we can see the median realizations – which account for risk – are often more crucial decision points than ensemble averages, which are the focal point of Monte Carlo analysis.
  • These arguments also provide a simple explanation for investor behavior that avoids the need for utility theory concepts that have been used for the past 200+ years.
  • Since we can neither average our results with other investors nor average our results with potential copies of ourselves in infinite states of the world, the best we can do is try to average over time.
  • Because we all live in a multi-period world where we have a single investment portfolio that compounds over time, managing risk can help us maximize our long-term growth rate even if it seems foolish in hindsight.

Pretend we come to you offering a new investment strategy.  Each week, you earn 0.65% (such that over a year you earn 40%), but there is a 1-in-200 chance that you lose -95%.  Would you invest?

If we simulate out a single trial, we can see that within a year, we may lose most of our money.

Of course, just because things went wrong in our singular example does not mean that this is necessarily a bad investment.  In fact, if we evaluate the prospects of this investment by looking at the average experience, we end up with something far more attractive (the “Ensemble,” which is essentially a Monte-Carlo simulation of the strategy).

The math here is simple: 99.5% of the time we make 1.0065x our money and 0.5% of the time, we end up with 0.05x our money.  On average, then, we end up with 1.0017x, or 1.092x annualized.  While the average experience is not the 40% annualized we sought, the 9.2% return after a year is still nothing to scoff at.

Of course, the average is not actually achievable.  There are not infinite variations of this investment strategy for you to allocate your capital across, nor, we suspect, do you have access to infinite versions of you living in parallel universes who can pool their risk.

Rather, you are forced to diversify your risk over time.  Here we end up with a different picture.

Another series of unfortunate events?

Not so fast.  You see, when we move to diversifying over time, we need to look at a time-weighted average.  It is not the arithmetic mean we are after, but rather the geometric mean which will account for the effects of compounding.  Calculating the geometric mean – 1.006599.5% x 0.050.5% – leaves us with a value of 0.9915, i.e. our wealth is expected to decay over time.

Wait.

How is it possible that on average the strategy is a winner if each and every path is expected to decay over time?

The simple answer: A few fortunate outliers make up for all decaying paths.

The slightly more complex answer: In this investment, our wealth can never go below $0 but we can theoretically make an infinite amount of money.  Thus, over time, the average is dragged up.

The Misleading Mean

In many cases, the average experience can be entirely misleading for the experience you can expect.  In the world of bell-curves and normal distributions, we typically expect experiences to be clustered around the average.  For example, there are more people close to the average height than there are far away.

However, when other distributions apply, the average can be unlikely.  Wealth distribution is a perfect example of this.  In 2013 in the United States, the top 10% of families held 76% of the wealth while the bottom 50% held 1%.  Using 2017 figures, if we divided net worth among the U.S. population – i.e. the “average” household wealth – it would come out to around $760,000 per family.  The bottom 50%, however, have a net worth closer to $11,000 per family.

In other words, if you pick a random person off the street, their experience is likely much closer to $11,000 than $760,000.  It’s the wealthy outliers that are pulling the average up.

A more applicable metric, in this case, might be the median, which will say, “50% of experiences are below this level and 50% are above.”

The Role of Risk

As it turns out, the median is important for those of us diversifying over time as well.  If we consider our hypothetical investment strategy above, our intuition is that the median result is probably not great.  Eventually, it feels like, everyone goes practically bankrupt.  If we plot the median result, we see almost exactly that.

(As a side note, if you’re wondering why the median result exhibits a sawtooth pattern rather than the smoother results of the mean, the answer is the median is the actual result that sits at the 50th percentile.  Knowing that the probability of losing 95% of our wealth is 1-in-200, it takes time for enough individuals to experience a poor result for the median to drop.)

In fact, if we model investment wealth as a Geometric Brownian Motion (a commonly used stochastic process for modeling stock prices), then over the long run an investor’s compound growth rate approaches the median, not the mean.[1]  The important difference between the two is that while volatility does not affect the expected level of wealth, it does drive the mean and median further apart.  In fact, the median growth rate is the mean growth rate minus half the volatility squared (which you might recognize as being the common approximation for – drum roll please – the geometric growth rate).

In other words: volatility matters.

Most investors we speak with have an intuitive grasp of this concept.  They know that when you lose 10% of your wealth, you need to gain 11.11% back to get to break even.

And when you lose 50%, and you need to earn 100% to get back to break even.  Under compound results, feeling twice the pain from losses than the pleasure from gain makes complete sense.  There are no individual and independent trials: results have consequences.

This is why taking less risk can actually lead to greater growth in wealth in the long run.  If we take too little risk, we will will not participate, but too much risk can lead to ruin.  For example, below we plot final wealth results after a 50% drop in market value and a 100% recovery depending on your capture ratio.

As an example of reading this graph, if we start with $1 and experience a 50% loss and a 100% gain, but are only 50% exposed to each of those movements (i.e. we lose 25% and then gain 50%), we end up with $1.125.  At the far right of the graph, we can see that at 2x exposure, the first 50% move completely wipes out our capital.

Common Sense Utility Theory

What economists have found, however, is that even if we offer our investment as a one-off event – where the expected return is definitively positive – most would still forego it.  To resolve this conundrum, economists have proposed utility theory.

The argument is that investors do not actually try to maximize their expected change in wealth, but rather try to maximize the expected utility of that change.  The earliest formalization of this concept was in a paper written by Daniel Bernoulli in 1738, where he proposed a mathematical function that would correct the expected return to account for risk aversion.

Bernoulli’s originally proposed function was log-utility.  And under log-utility, our investment strategy offering is no longer appealing: log(1.0065) x 99.5% + log(0.05) x 0.50% is a negative value.  What’s interesting about log utility is that, due to the property of logarithms, it ends up creating the identical decision axiom as had we used our compound growth rate model.

log(1.0065) x 99.5% + log(0.05) x 0.50% = log(1.006599.5%) + log(0.050.5%) = log(1.006599.5% x 0.050.5%)

So while utility theory is supposed to correct for behavioral foibles like “risk aversion,” what it really does is take a single-period bet and turn it into a multi-period, compound bet.

Under the context of multi-period, compounding results, “risk aversion” is not so foolish.  If we have our arm mauled off by a lion on the African veldt, we cannot simply “average” our experience with others in the tribe and end up with 97% of an arm.  We cannot “average” our experience across the infinite universes of other potential outcomes where we were not necessarily mauled.  Rather, our state is permanently altered for life.

Similarly, if we lose 50% of our money, we cannot just “average” our results with other investors.  Nor can we average our results with all the potential infinite alternate universes where we did not lose 50%.  The best we can do is try to average over time, which means that our compound growth rate matters.  And, as we demonstrated above, so does risk.

Conclusion

Ex-post, managing risk can often feel foolish.  Almost exactly 9 years after the bottom of the 2008-2009 bear market, the S&P 500 has returned more than 380%.  Asset class, geographic, and process diversification largely proved foolish relative to simple buy-and-hold.

Ex-ante, however, few would forgo risk management.  Ask yourself this: would you sell everything today to buy only U.S. large-cap stocks?  If not, then there is little to regret about not having done it in the past.  While the narratives we spin often make realized results seem obvious in hindsight, the reality is that our collective crystal balls were just as cloudy back then as they are today.

Few lament that their house did not burn down when they buy fire insurance.  We buy insurance “in case,” not because we want the risk to materialize.

We all live in a multi-period world where we have a single investment portfolio that compounds over time.  In such a world, risk matters tremendously.  A single, large loss can take us permanently off plan.  Even small losses can put us off course when compounded in a streak of bad luck.  While a focus on risk aversion may seem foolish in hindsight when risk does not materialize, going forward we know that managing risk can help us maximize our long-term growth rate.

 


 

[1] Derivations for this result can be found in our commentary Growth Optimal Portfolios

Thinking in Long/Short Portfolios

This post is available as a PDF download here.

Summary­

  • Few investors hold explicit shorts in their portfolio, but all active investors hold them
  • We (re-)introduce the simple framework of thinking about an active portfolio as a combination of a passive benchmark plus a long/short portfolio.
  • This decomposition provides greater clarity into the often confusing role of terms like active bets, active share, and active risk.
  • We see that while active share defines the quantity of our active exposure, the active bets themselves define the quality.

Ask the average investor if they employ shorting in their portfolios and “no” is likely the answer.

Examine the average portfolio, however, and shorts abound.  Perhaps not explicitly, but certainly implicitly.  But what in the world is an implicit short?

As investors, if we held no particular views about the market, our default position would be a market-capitalization weighted portfolio.  Any deviation from market-capitalization weighted, then, expresses some sort of view (intentional or not).

For example, if we hold a portfolio of 40 blue-chip stocks instead of a total equity market index, we have expressed a view.  That view is in part determined by what we hold, but equally important is what we do not.

In fact, we can capture this view – our active bets ­– by looking at the difference between what we hold in our portfolio and the market-capitalization weighted index.  And we quite literally mean the difference.  If we take the weights of our portfolio and subtract the weights of the index, we will be left with a dollar-neutral long/short portfolio.  The long side will express those positions that we are overweight relative to the index, and the short side will express those positions we are underweight.

Below is a simple example of this idea.

PortfolioBenchmarkImplied Long/Short
Stock A25%50%-25%
Stock B75%50%25%

 

“Dollar-neutral” simply means that the long and short legs will be of notional equal size (e.g. in the above example they are both 25%).

While our portfolio may appear to be long only, in reality it expresses a view that is captured by a long/short portfolio.  As it turns out, our portfolio has an implicit short.

This framework is important because it allows us to go beyond evaluating what we hold and instead evaluate both the bets we are taking and the scale of those bets.  Generically speaking, we can say:

Portfolio = Benchmark + b x Long/Short

Here, the legs of the Long/Short portfolio are assumed to have 100% notional exposure.  Using the example above, this would mean that the long/short is 100% long Stock B, 100% short Stock A, and b is equal to 25%.

This step is important because it allows us to disentangle quantity from quality.  A portfolio that is very overweight AAPL and a portfolio that is slightly overweight AAPL are expressing the same bet: it is simply the magnitude of that bet that is different.

So while the Long/Short portfolio captures our active bets, b measures our active share.  In the context of this framework, it is easy to see that all active share determines is how exposed our portfolio is to our active bets.

We often hear a good deal of confusion about active share.  Is more active share a good thing?  A bad thing?  Should we pay up for active share?  Is active share correlated with alpha?  This framework helps illuminate the answers.

Let’s slightly re-write our equation to more explicitly highlight the difference between our portfolio and the benchmark.

Portfolio – Benchmark = b x Long/Short

This means that the difference in returns between the portfolio and the benchmark will be entirely due to the return generated by the Long/Short portfolio of our active bets and how exposed we are to the active bets.

RPortfolio – RBenchmark = b x RLong/Short

Our expected excess return is then quite easy to think about: it is quite simply the expected return of our active bets (the Long/Short portfolio) scaled by how exposed we are to them (i.e. our active share):

E[RPortfolio – RBenchmark] = b x E[RLong/Short]

Active risk (also known as “tracking error”) also becomes quite easy to conceptualize.  Active risk is simply the standard deviation of differences in returns between our Portfolio and the Benchmark.  Or, as our framework shows us, it is just the volatility of our active bets scaled by how exposed we are to them.

s[RPortfolio – RBenchmark] = b x s[RLong/Short]

We can see that in all of these cases, both our active bets as well as our active share play a critical role.  A higher active share means that the fee we are paying provides us more access to the active bets.  It does not mean, however, that those active bets are necessarily any good.  More is not always better.

Active share simply defines the quantity.  The active bets, expressed in the long/short portfolio, will determine the quality.  That quality is often captured by the Information Ratio, which is the expected excess return of our portfolio versus the benchmark divided by how much tracking error we have to take to generate that return.

IR = E[RPortfolio – RBenchmark] / s[RPortfolio – RBenchmark]

Re-writing these terms, we have:

IR = E[RLong/Short] / s[RLong/Short]

Note that the active share component cancels out.  The information ratio provides us a pure measure of the quality of our active bets and ignores how much exposure our portfolio actually has to those bets.

Both quantity and quality are ultimately important in determining whether the portfolio will be able to overcome the hurdle rate set by the portfolio’s fee.

b x E[RLong/Short] > FeePortfolio – FeeBenchmark

The lower our active share, the higher our expectation for our active bets needs to be to overcome the fee spread.  For example, if the spread in fee between our portfolio and the benchmark is 1% and our active share is just 25%, then we have to believe that our active bets can generate a return in excess of 4% to justify paying the fee spread.  If, however, our active share is 75%, then the return needed falls to 1.33%.

Through this equation we can also understand the implications of fee pressure.  If the cost of the active portfolio and the cost of the benchmark are equivalent, there is zero hurdle rate to overcome.  We would choose active so long as we expect a positive return from our active bets.[1]

However, through its organizational structure and growth, Vanguard has been able to continually lower the fee of the passive benchmark over the last several decades.  All else held equal, this means that the hurdle rate for active managers goes up.

Thus as the cost of passive goes down, active managers must lower their fee in a commensurate manner or boost the quality of their active bets.

Conclusion

For long-only “smart beta” and factor portfolios, we often see a focus on what the portfolio holds.  While this is important, it is only a piece of the overall picture.  Just as important in determining performance relative to a benchmark is what the portfolio does not hold.

In this piece, we explicitly calculate active bets as the difference between the active portfolio and its benchmark.  This framework helps illuminate that our active return will be a function both of the quality of our active bets as well as the quantity of our exposure to them.

Finally, we can see that if our aim is to outperform the benchmark, we must first overcome the fee we are paying.  The ability to overcome that fee will be a function of both quality and quantity.  By scaling the fee by the portfolio’s active share, we can identify the hurdle rate that our active bets must overcome.

[1] More technically, theory tells us we would need a positive marginal expected utility from the investment in the context of our overall portfolio.

Powered by WordPress & Theme by Anders Norén