trading performance

Trading Performance: Strategy Metrics, Risk-Adjusted Metrics, And Backtest

Trading performance and strategy metrics are important parameters to evaluate the quality of your trading strategy + both for your live trading strategy or if you are backtesting. Just looking at the end result, the CAGR or the annual returns, might be very misleading. If you’re a short-term trader we are pretty confident in saying that most traders would abandon a strategy if the drawdowns are too big no matter how high returns. This is why you need to look at strategy and system performance metrics. You need to measure your trading performance.

This article looks at several trading strategies and system performance metrics: the equity curve, max drawdown, win ratio, Sharpe Ratio, profit factor, CAR/MDD, the RAR/MDD, and the Ulcer Index. There are many more metrics, but we believe these should offer a very good background on where to look when you develop trading strategies and systems.

Trading Performance Metrics

Table of contents:

Trading Performance and Strategy Metrics don’t need to be complex

Trading Performance and Strategy Metrics are not complex. As a matter of fact, you get a very long way by using common sense and knowing your own limits. As we have explained many times, street smarts beat book smarts in trading!

Nevertheless, it might pay off to know a little about the different performance metrics that most trading software displays after you run a backtest.

Below is how a backtest system report looks in Amibroker. Most trading software reports the same performance metrics. We briefly touch upon the backtest report in our Amibroker course.

indicators metrics & strategies
Above is the trading and system performance metrics used by Amibroker.

As you can see, there is an abundance of metrics and numbers. Luckily, you don’t need to know them all. The most important in the report is, in our opinion, the max drawdown. Why? Because risk is mainly the risk of losing money, either permanent or temporary:

What is risk?


We define risk as downside volatility. It’s certainly not a perfect proxy for risk, but it makes sense because it is highly correlated to behavioral mistakes.

What are behavioral mistakes in trading? That is, for example, selling into a panic only to discover you nailed the bottom.

To better illustrate the return and risk, let’s look at the chart below:

Strategy Performance Metrics
Brummer & Partner’s return. Source: Website.

The red line is the performance of a Swedish hedge fund, while the grey line is the MSCI World Index. Which path would you prefer to traverse –  the red or the grey one?

Related reading: Trading System

Beginning traders and investors look at the annual returns and might pick the grey line because it has the highest returns. However, that is a little hindsight or survivorship bias. It’s easy to pick the best option when you know the result. But clearly, the red line has a much smoother traverse than the grey line. That’s why most professional traders and asset managers prefer the smoothest ride, ie. the red line.

Before we look in details, let’s briefly look at all the key performance metrics in trading (we’ll later in the article cover what we consider the best):

What are some important types of metrics in trading performance?

Risk-Adjusted Metrics

Some important metrics in trading performance are:

#1: the equity curve

First of all, you don’t need much mathematical knowledge to evaluate the performance of your trading strategy. A look at the equity curve is in most cases more than enough to judge if you have a viable strategy or not. The reason is simple:

The equity curve is the visual or graphical representation of your equity over the backtested time frame. An equity curve that is sloping gradually upward is, of course, preferable to a curve that is very volatile or even random.

Here are two examples (the blue lines on the bottom is the drawdowns):

how to measure trading performance
Versus this one:
They have different annual returns and end results, but clearly, the second curve has a very bumpy ride. The reason for that is the drawdown:
 

Trading strategy and system performance metric #2: max drawdown

We have written many articles about max drawdowns.

Put short, a drawdown is the difference between the latest peak in your equity/assets and the losses suffered after that peak to a through, both realized and unrealized profits. We rank this as a very important system performance metric.

The chart below shows both the equity curve and the drawdown expressed in percent. For example, in early 2020 the strategy made a new top in the equity curve, but the Covid-19 reduced your equity from 116 000 to under 98 000. The difference between those two numbers is the drawdown. You can express the drawdown in both absolute numbers and in percentages. We prefer the latter.

Most backtesting reports show you the drawdown as in the chart above.

Why is drawdown important?

Drawdown is very important in trading because it makes a huge impact on your behavior and subsequently your returns. Even if you have a strategy that returns 50% annually you might abandon it along the way if you suffer a temporary setback in the form of a drawdown. In the midst of a drawdown, you don’t know if the strategy is busted or if it’s just temporary. In practice, all strategies stop working sooner or later.

What is an acceptable drawdown? Only you can tell, but obviously, you want it as low as possible. But this is a thin line. A small drawdown makes the backtest susceptible to curve fitting or randomness or is just something waiting to “blow up” (the calm before the storm).

On the other hand, drawdowns might be the reason why strategies can last for a long time. Drawdowns shake out the weak hands.

Our experience indicates that any drawdown bigger than 20-25% makes most traders shaky and uncertain. This results in abandonment, fiddling, or curve fitting. Thus, 25% can serve as a heuristic for max drawdown.

#3: Win ratio

One of the first risk or performance metrics you should look at is the win ratio. The win ratio is rarely mentioned as a proper performance metric, but this is probably because most writers don’t trade themselves.

Why is the win ratio an important performance metric?

Let’s first explain what the win ratio is: The win ratio is the winning trades divided by the losing trades.

A high win ratio is important because it reduces behavioral mistakes and the risk of ruin:

  1. A low win ratio increases the probability of having many consecutive losers. Are you willing to pull the trigger after 8 losing trades in a row? Most traders would not.
  2. A low win ratio increases the risk of big drawdowns.
  3. A low win ratio even increases the risk of ruin compared to the same size with a higher win ratio.

As you get more knowledge and experience, we are confident you’ll appreciate a high win ratio, even though your average winners might be smaller than your average losers.

#4: Sharpe Ratio

Perhaps the most used trading strategy performance metric is the Sharpe Ratio. The Sharpe Ratio was invented by William Sharpe, hence the name.

Sharpe studied performance metrics as long back as the 1960s and this is probably the first trading performance metric that was quantified. We can also mention that William Sharpe was awarded the Nobel Prize in economics in 1990, not only because of the Sharpe Ratio but for his overall contribution to finance and economics.

The Sharpe Ratio looks at the relationship between excess return to the risk-free rate per unit of risk. Practically all hedge funds use this metric to evaluate performance.  

A good Sharpe Ratio is preferably above 0.75, but be careful if it’s above 1.5.

#5: the profit factor

Another widely used trading strategy and system performance metric is the profit factor. The profit factor looks at the relationship between gross profits and gross losses. For example, if your strategy has 1 000 in profits and 500 in losses, the profit factor is 2.

What is a good profit factor?

We like to use 1.75 as a threshold: anything above is reasonably good. We are not happy with extreme numbers above 4 either as it most likely signals a curve fitted test or a test that is has been lucky with the market cycles.

However, if your backtest spans a long time period you’ll rarely see any profit factors above 3 if you have a decent number of trading observations.

Keep in mind, however, that the profit factor, in reality, tells you little about the quality of the equity curve. Because of this, we add two more metrics to measure trading strategies performance:

CAR/MDD and RAR/MDD

These two system performance metrics should correlate better with the quality of the equity curve.

#6: CAR/MDD

CAR is an abbreviation for compound annual return (in percent / the same as CAGR) divided by the maximum drawdown.

For example, if the CAGR is 15% and the drawdown is 15%, the CAR/MDD ratio equals 1. You would want the ratio to be as high as possible, but realistically, this one is very hard to get above 1.

#7: RAR/MDD

RAR is an abbreviation for risk-adjusted return and MDD is maximum system drawdown.

First, we need to calculate the risk-adjusted return. This is the geometric return (annual) return in percent divided by the exposure in %. Exposure is the same as time spent in the market. If your strategy has 50 trades per year and makes 10% annual returns, this is more impressive than a strategy that returns the same but is invested 100% of the time. Time spent in the markets matter!

If the annual return is 15% and the exposure is 50%, then the risk-adjusted return is 15/0.5 = 30%

Now that we have defined the risk/adjusted return, we can calculate the RAR/MDD ratio assuming the max drawdown is 15%:

30/15 = 2

We would say that a metric better than 2 is very good.

(Keep in mind that the geometrical return is not the same as the arithmetic return. Please read our primer on the subject that explains why arithmetic and geometric averages differ in trading.)

We have written a separate article about risk-adjusted return:

#8: the Ulcer Index

The Ulcer Index is a relatively new indicator and was developed in the 1980s. It was primarily meant for mutual funds.

The Ulcer Index is a cousin to the standard deviation. Like many other risk parameters it looks at volatility, but only volatility on the downside, ie. max drawdown over the defined lookback period.

The reasoning is straightforward: we are only at risk if we lose money, we are not interested in risk on the upside. Most trading platforms or technical software has included the Ulcer index in their backtesting reports.

The Ulcer Index is calculated in threes steps. In this example, we use a 21-day lookback period.

First, we calculate the percentage drawdown during the period (21 bars):

((close – 21 bars max close) / 21 bars max close)

Second, we need to calculate the squared average:

((21 bars sum of percent drawdown squared) / 21 bars)

Third, we calculate the Ulcer Index:

The square root of the squared average.

Related reading: Evaluating Trading Strategies

Other trading strategy and system performance metrics:

The list we have provided in this article is not exhaustive, of course. There are plenty of others, like the K-ratio, Jensen Ratio, and the Treynor Ratio, for example. No matter what performance metrics you are using please make sure you understand what you are measuring. There is no right or wrong in trading and speculation, and hence you need to understand what you are measuring.

System metrics and the number of strategies – how to mitigate risk

The greatest enemy of a good plan is the dream of a perfect plan. Stick to the good plan.

The quote above was made by the late John Bogle, founder of the Vanguard group. But as a trader, not a long-term investor, you can change the quote to this:

The greatest enemy of a good trading strategy is the dream of a perfect strategy. Stick to the good trading strategy.

What do we mean by this? Many traders try to improve their trading strategies so that they end up curve fitting or fiddling with it. They dream of a strategy that has the perfect equity curve with hardly any drawdowns at all.

But even a good or worse strategy can offer good risk mitigation.

There are a number of ways to mitigate risk in trading. Frequent readers of our website will recognize our risk mitigation techniques:

Diversification

You need to diversify among different strategies. If one trading strategy is far from optimal, it might supplement other strategies if you allocate a small amount of capital to it. You need to look at the sum of the components you put into your trading arsenal.

One very important aspect in that regard is correlation in trading. To minimize correlation between the strategies you need to trade different asset classes, different time frames, and different market directions (long and short). Short selling is difficult, for example, but it can serve as an incredibly useful tool as risk mitigation in a portfolio, as explained by Mark Spitznagel.

Please read our two previous articles about this subject:

Trading strategy and system performance metrics – conclusions

We have provided you with eight different trading strategy and system performance metrics, but we strongly recommend using common sense when you evaluate your backtests.

Ask yourself this: Realistically, are you able to trade the strategy and suffer the drawdowns? Even if you say no, the strategy can add valuable diversification:

You can’t discard a strategy just based on its numbers – you need to look at the contribution to a portfolio of strategies. This makes it, of course, a bit more tricky to backtest.

But the reality in trading is that the sum of many perfect strategies might perform worse than a portfolio that might have much worse performance metrics in trading. It all depends on the correlation and interaction between the strategies!

What are trading performance and strategy metrics?

Trading performance and strategy metrics are parameters used to evaluate the effectiveness and risk of trading strategies. They include metrics such as drawdown, Sharpe Ratio, profit factor, CAR/MDD, RAR/MDD, and Ulcer Index, among others.

Why are trading performance and strategy metrics essential for evaluating trading strategies?

These metrics provide quantitative measures that help traders understand the profitability, risk-adjusted returns, and downside volatility of their strategies. By analyzing these metrics, traders can make informed decisions about strategy selection and optimization.

What are some common trading performance metrics?

Common trading performance metrics include drawdown, Sharpe Ratio, profit factor, CAR/MDD, RAR/MDD, Ulcer Index, win ratio, and others. These metrics offer insights into different aspects of strategy performance, such as profitability, risk, and consistency.

Why is drawdown considered a significant metric in trading?

Drawdown measures the decline in a trading account’s value from its peak to its lowest point, indicating potential losses during market downturns. It’s crucial because it impacts trader behavior and overall returns, influencing decisions to continue or abandon a strategy.

How does the win ratio contribute to assessing trading strategy performance?

The win ratio, calculated as the ratio of winning trades to losing trades, helps assess a trading strategy’s effectiveness in generating profits. A higher win ratio indicates a strategy with fewer consecutive losing trades and potentially lower drawdowns, contributing to overall performance stability.

What is the Sharpe Ratio, and why is it widely used in trading?

The Sharpe Ratio measures the risk-adjusted return of a trading strategy by considering the excess return relative to the risk-free rate per unit of risk. It’s widely used because it helps evaluate the efficiency of a strategy in generating returns while considering the associated risk.

How is the profit factor calculated, and what does it indicate about a strategy?

The profit factor assesses the relationship between gross profits and gross losses in a trading strategy. A higher profit factor indicates a more profitable strategy, with a balance between profits and losses. Extreme values may suggest overfitting or luck in the backtesting process.

What are CAR/MDD and RAR/MDD ratios, and why are they important in evaluating strategy performance?

CAR/MDD (Compound Annual Return to Maximum Drawdown) and RAR/MDD (Risk-Adjusted Return to Maximum Drawdown) ratios assess the relationship between annual returns and maximum drawdown, providing insights into risk-adjusted returns. Higher ratios indicate better performance relative to the endured risk.

How does the Ulcer Index contribute to evaluating trading strategy performance?

The Ulcer Index measures downside volatility by focusing on drawdowns over a defined lookback period. It quantifies the depth and duration of drawdowns, helping traders understand the risk associated with their strategies.

Why is it important for traders to go beyond annual returns when evaluating strategies?

Relying solely on annual returns can be misleading, as it doesn’t capture important aspects like downside risk and volatility. Traders need to delve into various performance metrics to gain a comprehensive understanding of strategy effectiveness and risk.

Why is it important to look beyond annual returns in trading?

Simply relying on annual returns can be misleading. Traders, especially short-term ones, might abandon a strategy if the drawdowns are too significant, regardless of high returns. This is why it’s crucial to delve into various performance metrics to get a more accurate picture.

What is max drawdown, and why is it a crucial metric?

The equity curve visually represents the performance of a trading strategy over time. A smooth, upward-sloping curve is preferable, indicating consistent growth, while a volatile or erratic curve may suggest higher risk. Max drawdown is the difference between the latest peak in equity and the subsequent losses. It’s an important metric as it reflects the risk of losing money, influencing trader behavior and overall returns.

How is the profit factor calculated, and what does it indicate about a strategy?

The profit factor assesses the relationship between gross profits and gross losses. A good profit factor, preferably above 1.75, suggests a healthy balance between profits and losses, avoiding extreme numbers that might indicate overfitting.

What is the importance of backtesting and trading metrics?


Backtesting and trading metrics are essential for evaluating the performance of trading strategies. They provide quantitative measures that help traders understand the profitability and risk of their strategies.

How is the equity curve used to evaluate trading strategy performance?


The equity curve provides a visual representation of a trading strategy’s performance over time. A smooth, upward-sloping curve indicates consistent growth, while a volatile or erratic curve may suggest higher risk. Traders use the equity curve to assess the quality and stability of a strategy.

Why is the max drawdown considered a critical metric in trading?


Max drawdown measures the largest decline in a trading account’s value from its peak to its lowest point. It’s critical because it reflects the potential losses traders may incur during market downturns, influencing their behavior and overall returns.

What role does the win ratio play in evaluating trading strategy performance?


The win ratio, calculated as the ratio of winning trades to losing trades, helps assess a trading strategy’s effectiveness in generating profits. A higher win ratio indicates a strategy with fewer consecutive losing trades and potentially lower drawdowns, contributing to overall performance stability.

How do metrics like CAR/MDD and RAR/MDD contribute to assessing risk-adjusted returns?


CAR/MDD (Compound Annual Return to Maximum Drawdown) and RAR/MDD (Risk-Adjusted Return to Maximum Drawdown) ratios provide insights into the relationship between annual returns and maximum drawdown, offering a measure of risk-adjusted returns. Higher ratios indicate better performance relative to the endured risk.

What is the importance of risk-adjusted metrics?

The importance of backtesting and trading metrics is essential for evaluating the performance of trading strategies.

They provide traders with quantitative measures that help them understand the profitability and risk of their strategies. By analyzing metrics such as the Sharpe ratio, profit factor, and annual growth rate, traders can better judge how robust or good the trading strategy is.

  1. Drawdown: Drawdown measures the decline in the value of a trading account from its peak to its lowest point. It helps traders understand the potential losses they might incur during periods of market downturns.
  2. Sharpe Ratio: The Sharpe ratio is the most widely used performance metric in backtesting and trading. It measures the risk-adjusted return of a trading strategy by taking into account the volatility of returns. A higher Sharpe ratio indicates a better risk-adjusted return.
  3. Profit Factor: The profit factor measures the ratio of the total profit to the total loss generated by a trading strategy. A profit factor greater than 1 indicates a profitable strategy.
  4. Calmar Ratio: The Calmar ratio measures the risk-adjusted return by comparing the annualized return to the maximum drawdown.
  5. Sortino Ratio: Similar to the Sharpe ratio, the Sortino ratio focuses on downside risk by considering only the standard deviation of negative returns, providing a more specific view of risk-adjusted returns.
  6. Ulcer Index: The Ulcer Index quantifies the depth and duration of drawdowns.
  7. MAR Ratio: The MAR (Minimum Acceptable Return) ratio compares the average annual return to the maximum drawdown. It assesses whether the returns are sufficient relative to the risk endured.
  8. Information Ratio: This metric evaluates the risk-adjusted returns by comparing the excess return of a strategy to its tracking error, indicating the ability to outperform a benchmark.
  9. Treynor Ratio: Similar to the Sharpe ratio, the Treynor ratio considers the risk-free rate in the denominator, making it useful for assessing the risk-adjusted performance of portfolios with different risk profiles.
  10. Sterling Ratio: The Sterling ratio measures risk-adjusted returns while accounting for skewness, providing a more complete view of the strategy’s performance. This is a rather unknown risk metric.
  11. Kurtosis: Kurtosis measures the distribution of returns, helping traders understand the strategy’s tail risk and whether it exhibits fat tails (outliers). Please also look at our take on negatively skewed trading strategies.
  12. VaR (Value at Risk): VaR quantifies the maximum loss a trader can expect within a specific confidence interval. It’s mostly used for potential worst-case scenarios.
  13. Conditional Value at Risk (CVaR): CVaR, also known as Expected Shortfall, provides additional information on the tail risk by calculating the average of the worst-case losses.
  14. Calibration Error: This metric assesses the accuracy of a trading model by measuring the difference between predicted and actual returns.
  15. Beta: Beta measures a strategy’s sensitivity to market movements. A beta of 1 indicates a strategy moves in line with the market, while values greater or less than 1 show higher or lower sensitivity, respectively.
  16. Alpha: Alpha evaluates a strategy’s excess return compared to its expected return based on its beta. A positive alpha indicates outperformance.
  17. Maximum Adverse Excursion (MAE): MAE measures the largest adverse movement from the peak return before reaching a new high. It helps identify potential losses in the worst-case scenario.
  18. Profit-to-Drawdown Ratio: This ratio assesses the strategy’s profitability relative to its drawdown. A higher ratio indicates a more efficient strategy.
  19. Win-Loss Ratio: This ratio calculates the percentage of winning trades relative to losing trades.
  20. R-Squared: R-squared measures the proportion of a strategy’s returns that can be explained by a benchmark index. It assesses how closely the strategy tracks the benchmark.
  21. Mean Absolute Deviation (MAD): MAD quantifies the average difference between actual and predicted returns.
  22. Time-Weighted Rate of Return: This metric calculates the average annualized return by considering the time value of investments.
  23. Profitability Index: The profitability index combines various performance metrics to provide an overall assessment of the strategy’s effectiveness and profitability.
  24. Equity Curve: A graphical representation of the performance of a trading strategy over time, showing the growth or decline of the trading account based on the strategy’s performance. This is the simplest and easiest way to look at the performance.

Similar Posts

>