SuperFast SuperTrend

Published in

DataDrivenInvestor

15 min readFeb 13, 2022

While Python is slower than many compiled languages, it’s easy to use and extremely diverse. For many, especially in the data science domain, the practicality of the language beats the speed considerations — it’s like a Swiss army knife for programmers and researchers alike.

Unfortunately for quants, Python becomes a real bottleneck when iterating over (a large amount of) data. For this reason, there is an entire ecosystem of scientific packages such as NumPy and Pandas, which are highly optimized for performance, with critical code paths often written in Cython or C. Those packages mostly work on arrays, giving us a common interface for processing data in an efficient manner.

This ability is highly appreciated when constructing indicators that can be translated into a set of vectorized operations, such as OBV. But even non-vectorized operations, such as the exponential weighted moving average (EMA) powering numerous indicators such as MACD, were implemented in a compiled language and are offered as a ready-to-use Python function. But sometimes, an indicator is difficult or even impossible to develop solely using standard array operations because the indicator introduces a path dependency, where a decision today depends upon a decision made yesterday. One member of such a family of indicators is SuperTrend.

In this article, you will learn how to design and implement a SuperTrend indicator, and gradually optimize it towards a never-seen performance using TA-Lib and Numba. We will also backtest the newly created indicator on a range of parameters using vectorbt PRO (+ see hints on how to run various parts using the community version as well).

Data

The first step is always getting the (right) data. In particular, we need a sufficient amount of data to benchmark different SuperTrend implementations. Let’s pull 2 years of hourly Bitcoin and Ethereum data from Binance using the vectorbt’s BinanceData class:

Hint: use download in the community version.

The fetching operation for both symbols took us around 80 seconds to complete. Since Binance, as any other exchange, will never return the whole data at once, vectorbt first requested the maximum amount of data starting on January 1st, 2020 and then gradually collected the remaining data by also respecting the Binance’s API rate limits. In total, this resulted in 36 requests per symbol. Finally, vectorbt aligned both symbols in case their indexes or columns were different and made the final index timezone-aware (in UTC).

To avoid repeatedly hitting the Binance servers each time we start a new Python session, we should save the downloaded data locally using either the vectorbt’s Data.to_csv or Data.to_hdf:

Hint: local data is not supported by the community version but you can still save and retrieve data manually using Pandas.

We can then access the saved data easily using HDFData:

Once we have the data, let’s take a quick look at what’s inside. To get any of the stored DataFrames, use the Data.data dictionary with each DataFrame keyed by symbol:

<class 'pandas.core.frame.DataFrame'> 
DatetimeIndex: 17514 entries, 2019-12-31 23:00:00+00:00 to 2021-12-31 22:00:00+00:00 
Data columns (total 10 columns): 
 #   Column              Non-Null Count  Dtype               
---  ------              --------------  -----               
 0   Open                17514 non-null  float64             
 1   High                17514 non-null  float64             
 2   Low                 17514 non-null  float64             
 3   Close               17514 non-null  float64             
 4   Volume              17514 non-null  float64             
 5   Close time          17514 non-null  datetime64[ns, UTC] 
 6   Quote volume        17514 non-null  float64             
 7   Number of trades    17514 non-null  int64               
 8   Taker base volume   17514 non-null  float64             
 9   Taker quote volume  17514 non-null  float64             
dtypes: datetime64[ns, UTC](1), float64(8), int64(1) 
memory usage: 2.0 MB

We can also get an overview of all the symbols captured:

Start                   2019-12-31 23:00:00+00:00
End                     2021-12-31 22:00:00+00:00
Period                                      17514
Total Symbols                                   2
Null Counts: BTCUSDT                          0.0
Null Counts: ETHUSDT                          0.0
Name: agg_func_mean, dtype: object

Each symbol has 17514 data points with no NaNs — good!

If you ever worked with vectorbt, you would know that vectorbt loves the data to be supplied with symbols as columns — one per backtest — rather than features as columns. Since SuperTrend depends upon the high, low, and close price, let’s get those three features as separate DataFrames using Data.get:

symbol                      BTCUSDT  ETHUSDT
Open time                                   
2019-12-31 23:00:00+00:00   7195.23   129.16
2020-01-01 00:00:00+00:00   7177.02   128.87
2020-01-01 01:00:00+00:00   7216.27   130.64
2020-01-01 02:00:00+00:00   7242.85   130.85
2020-01-01 03:00:00+00:00   7225.01   130.20
...                             ...      ...
2021-12-31 18:00:00+00:00  46686.41  3704.43
2021-12-31 19:00:00+00:00  45728.28  3626.27
2021-12-31 20:00:00+00:00  45879.24  3645.04
2021-12-31 21:00:00+00:00  46333.86  3688.41
2021-12-31 22:00:00+00:00  46303.99  3681.80[17514 rows x 2 columns]

We’re all set to design our first SuperTrend indicator!

Design

SuperTrend is a trend-following indicator that uses Average True Range (ATR) and median price to define a set of upper and lower bands. The idea is rather simple: when the close price crosses above the upper band, the asset is considered to be entering an uptrend, hence a buy signal. When the close price crosses below the lower band, the asset is considered to have exited the uptrend, hence a sell signal.

Unlike the idea, the calculation procedure is anything but simple:

BASIC UPPERBAND = (HIGH + LOW) / 2 + Multiplier * ATR
BASIC LOWERBAND = (HIGH + LOW) / 2 - Multiplier * ATRFINAL UPPERBAND = IF (Current BASICUPPERBAND < Previous FINAL UPPERBAND) or (Previous Close > Previous FINAL UPPERBAND)
                  THEN Current BASIC UPPERBAND
                  ELSE Previous FINAL UPPERBAND
FINAL LOWERBAND = IF (Current BASIC LOWERBAND > Previous FINAL LOWERBAND) or (Previous Close < Previous FINAL LOWERBAND)
                  THEN Current BASIC LOWERBAND 
                  ELSE Previous FINAL LOWERBANDSUPERTREND      = IF (Previous SUPERTREND == Previous FINAL UPPERBAND) and (Current Close <= Current FINAL UPPERBAND)) 
                  THEN Current FINAL UPPERBAND
                  ELIF (Previous SUPERTREND == Previous FINAL UPPERBAND) and (Current Close > Current FINAL UPPERBAND) 
                  THEN Current FINAL LOWERBAND
                  ELIF (Previous SUPERTREND == Previous FINAL LOWERBAND) and (Current Close >= Current FINAL LOWERBAND) 
                  THEN Current FINAL LOWERBAND
                  ELIF (Previous SUPERTREND == Previous FINAL LOWERBAND) and (Current Close < Current FINAL LOWERBAND) 
                  THEN Current FINAL UPPERBAND

Even though the basic bands can be well computed using the standard tools, you’ll certainly get a headache when attempting to do this for the final bands. The consensus among most open-source solutions is to use a basic Python for-loop and write the array elements one at a time. But is this scalable? We’re here to find out!

Pandas

Pandas is a fast, powerful, flexible and easy to use open source data analysis and manipulation tool. Since it’s a go-to library for processing data in Python, let’s write our first implementation using Pandas alone. It will take one column and one combination of parameters, and return four arrays: one for the SuperTrend (trend), one for the direction (dir_), one for the uptrend (long), and one for the downtrend (short). We'll also split the implementation into 5 parts for readability and to be able to optimize any component at any time:

Calculation of the median price — get_med_price
Calculation of the ATR — get_atr
Calculation of the basic bands — get_basic_bands
Calculation of the final bands — get_final_bands
Putting all puzzles together — supertrend

Let’s run the supertrend function on the BTCUSDT symbol:

Open time
2019-12-31 23:00:00+00:00             NaN
2020-01-01 00:00:00+00:00             NaN
2020-01-01 01:00:00+00:00             NaN
...                                   ...
2021-12-31 20:00:00+00:00    47608.346563
2021-12-31 21:00:00+00:00    47608.346563
2021-12-31 22:00:00+00:00    47608.346563
Length: 17514, dtype: float64

If you print out the head of the supert Series using supert.head(10), you'll notice that the first 6 data points are all NaN. This is because the ATR's rolling period is 7, so the first 6 computed windows contained incomplete data.

A graph is worth 1,000 words. Let’s plot the first month of data (January 2020):

SuperTrend over the first month of BTCUSDT

We’ve generated and visualized the SuperTrend values, but what about performance? Can we already make our overfitting machine with thousands of parameter combinations rolling? Not so fast. As you might have guessed, the supertrend function takes some time to compute:

2.15 s ± 19.3 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)

Ouch! Doing 1000 backtests would take us roughly 33 minutes.

Let’s hear what Pandas TA has to say about this:

That’s a 3x speedup, mostly due to the fact that Pandas TA uses ATR from TA-Lib.

Is it now acceptable? Of course not. Can we get better than this? Hell yeah!

NumPy + Numba = ❤️

Pandas shines whenever it comes to manipulating heterogeneous tabular data, but is this really applicable to indicators? You might have noticed that even though we used Pandas, none of the operations in any of our newly defined functions makes use of index or column labels. Moreover, most indicators take, manipulate, and return arrays of the same dimensions and shape, which makes indicator development a purely algebraic challenge that can be well decomposed into multiple vectorized steps or solved on the per-element basis (or both!). Given that Pandas just extends NumPy and the latter is considered as a faster (although lower level) package, let’s adapt our logic to NumPy arrays instead.

Both functions get_med_price and get_basic_bands are based on basic arithmetic computations such as addition and multiplication, which are applicable to both Pandas and NumPy arrays and require no further changes. But what about get_atr and get_final_bands? The former can be re-implemented using NumPy and vectorbt's own arsenal of Numba-compiled functions:

The latter, on the other hand, is an iterative algorithm — it’s rather a poor fit for NumPy and an ideal fit for Numba, which can easily run for-loops at a machine code speed:

If you look at the function above, you’ll notice that 1) it’s a regular Python code that can run even without being decorated with @njit, and 2) it's almost identical to the implementation with Pandas - the main difference is in each iloc[...] being replaced by [...]. We can write a simple Python function that operates on constants and NumPy arrays, and Numba will try to make it much faster, fully automatically. Isn't that impressive?

Let’s look at the result of this refactoring:

array([nan, nan, nan, ..., 47608.3465635, 47608.3465635, 47608.3465635])

As expected, those are arrays similar to the ones returned by the supertrend function, just without any labels. To attach labels, we can simply do:

Open time
2019-12-31 23:00:00+00:00             NaN
2020-01-01 00:00:00+00:00             NaN
2020-01-01 01:00:00+00:00             NaN
2020-01-01 02:00:00+00:00             NaN
2020-01-01 03:00:00+00:00             NaN
                                 ...     
2021-12-31 18:00:00+00:00    48219.074906
2021-12-31 19:00:00+00:00    47858.398491
2021-12-31 20:00:00+00:00    47608.346563
2021-12-31 21:00:00+00:00    47608.346563
2021-12-31 22:00:00+00:00    47608.346563
Length: 17514, dtype: float64

Wondering how much our code has gained in performance? Wonder no more:

1.11 ms ± 7.05 µs per loop (mean ± std. dev. of 7 runs, 1000 loops each)

That’s a 780x speedup over an average Pandas TA run 😈

NumPy + Numba + TA-Lib = ⚡️

If you think that this result cannot be topped, then apparently you haven’t worked with TA-Lib. Even though there is no SuperTrend indicator available in TA-Lib, we can still use its highly-optimized indicator functions for intermediate calculations. In particular, instead of reinventing the wheel and implementing the median price and ATR functionality from scratch, we can use the MEDPRICE and ATR TA-Lib functions respectively. They have two major advantages over our custom implementation:

Single pass through data
No compilation overhead from Numba

Another 4x improvement — by the time another trader processed a single column of data, we would have processed around 3 thousand columns. Agreed, the speed of our indicator is slowly getting ridiculously high 😄

Indicator factory

Let’s stop here and ask ourselves: why do we even need such a crazy performance?

That’s when parameter optimization comes into play. The two parameters that we have — period and multiplier - are the default values commonly used in technical analysis. But what makes those values universal and how do we know whether there aren't any better values for the markets we're participating in? Imagine having a pipeline that can backtest hundreds or even thousands of parameters and reveal configurations and market regimes that correlate better on average?

IndicatorFactory is a vectorbt’s own powerhouse that can make any indicator function parametrizable. To get a better idea of what this means, let’s supercharge the faster_supertrend_talib function:

Hint: takes_1d is not supported by the community version — you need to make faster_supertrend_talib accept two-dimensional arrays.

The indicator factory is a class that can generate so-called indicator classes. You can imagine it being a conveyor belt that can take a specification of your indicator function and produce a stand-alone Python class for running that function in a very flexible way. In our example, when we called vbt.IF(...), it has internally created an indicator class SuperTrend, and once we supplied faster_supertrend_talib to IndicatorFactory.with_apply_func, it attached a method SuperTrend.run for running the indicator. Let's try it out!

symbol                          BTCUSDT      ETHUSDT
Open time                                           
2019-12-31 23:00:00+00:00           NaN          NaN
2020-01-01 00:00:00+00:00           NaN          NaN
2020-01-01 01:00:00+00:00           NaN          NaN
2020-01-01 02:00:00+00:00           NaN          NaN
2020-01-01 03:00:00+00:00           NaN          NaN
...                                 ...          ...
2021-12-31 18:00:00+00:00  48219.074906  3701.151241
2021-12-31 19:00:00+00:00  47858.398491  3792.049621
2021-12-31 20:00:00+00:00  47608.346563  3770.258246
2021-12-31 21:00:00+00:00  47608.346563  3770.258246
2021-12-31 22:00:00+00:00  47608.346563  3770.258246[17514 rows x 2 columns]

Notice how our SuperTrend indicator magically accepted two-dimensional Pandas arrays, even though the function itself can only work on one-dimensional NumPy arrays. Not only it computed the SuperTrend on each column, but it also converted the resulting arrays back into the Pandas format for pure convenience. So, how does all of this impact the performance?

1.83 ms ± 140 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)

Not that much! With all the pre- and postprocessing taking place, the indicator needs roughly one millisecond to process one column (that is, 17k data points).

Expressions

If you think that calling vbt.IF(...) and providing input_names, param_names, and other information manually is too much work, well, vectorbt has something for you. Our faster_supertrend_talib is effectively a black box to the indicator factory - that's why the factory cannot introspect it and derive the required information programmatically. But it easily could if we converted faster_supertrend_talib into an expression!

Expressions are regular strings that can be evaluated into Python code. By giving such a string to IndicatorFactory.from_expr, the factory will be able to see what’s inside, parse the specification, and generate a full-blown indicator class.

Here’s an expression for faster_supertrend_talib:

Using annotations with @ we tell the factory how to treat specific variables. For instance, any variable with the prefix @talib gets replaced by the respective TA-Lib function that has been upgraded with broadcasting and multidimensionality. You can also see that parameters were annotated with @p, while inputs and outputs weren't annotated at all - the factory knows exactly that high is the high price, while the latest line apparently returns 4 output objects.

symbol                          BTCUSDT      ETHUSDT
Open time                                           
2019-12-31 23:00:00+00:00           NaN          NaN
2020-01-01 00:00:00+00:00           NaN          NaN
2020-01-01 01:00:00+00:00           NaN          NaN
2020-01-01 02:00:00+00:00           NaN          NaN
2020-01-01 03:00:00+00:00           NaN          NaN
...                                 ...          ...
2021-12-31 18:00:00+00:00  48219.074906  3701.151241
2021-12-31 19:00:00+00:00  47858.398491  3792.049621
2021-12-31 20:00:00+00:00  47608.346563  3770.258246
2021-12-31 21:00:00+00:00  47608.346563  3770.258246
2021-12-31 22:00:00+00:00  47608.346563  3770.258246

2.36 ms ± 81.4 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)

By the way, this is exactly how WorldQuant’s Alphas are implemented in vectorbt. Never stop loving Python for the magic it enables ✨

Note: expressions are not supported by the community version.

Plotting

Remember how we previously plotted SuperTrend? We had to manually select the date range from each output array and add it to the plot by passing the figure around. Let’s subclass SuperTrend and define a method plot that does all of this for us:

But how are we supposed to select the date range to plot? Pretty easy: the indicator factory made SuperTrend indexable just like any regular Pandas object! Let's plot the same date range and symbol but slightly change the color palette:

SuperTrend over the first month of BTCUSDT using IndicatorFactory

Beautiful!

Backtesting

Backtesting is usually the simplest step in vectorbt: convert the indicator values into two signal arrays — entries and exits - and supply them to Portfolio.from_signals. To make the test better reflect the reality, let's do several adjustments. Since we're calculating the SuperTrend values based on the current close price and vectorbt executes orders right away, we'll shift the execution of the signals by one tick forward:

We’ll also apply the commission of 0.1%:

We’ve got a portfolio with two columns that can be analyzed with numerous built-in tools. For example, let’s calculate and display the statistics for the ETHUSDT symbol:

Start                         2019-12-31 23:00:00+00:00
End                           2021-12-31 22:00:00+00:00
Period                                729 days 18:00:00
Start Value                                       100.0
End Value                                   1136.318395
Total Return [%]                            1036.318395
Benchmark Return [%]                        2750.572933
Max Gross Exposure [%]                            100.0
Total Fees Paid                              273.006977
Max Drawdown [%]                               37.39953
Max Drawdown Duration                  85 days 09:00:00
Total Trades                                        174
Total Closed Trades                                 174
Total Open Trades                                     0
Open Trade PnL                                      0.0
Win Rate [%]                                  43.103448
Best Trade [%]                                33.286985
Worst Trade [%]                              -13.783496
Avg Winning Trade [%]                          7.815551
Avg Losing Trade [%]                           -3.02012
Avg Winning Trade Duration              3 days 06:43:12
Avg Losing Trade Duration     1 days 07:55:09.090909090
Profit Factor                                  1.390995
Expectancy                                     5.955853
Sharpe Ratio                                   2.259173
Calmar Ratio                                     6.3245
Omega Ratio                                    1.103559
Sortino Ratio                                  3.279668
Name: ETHUSDT, dtype: object

Optimization

Optimization in vectorbt can be performed in two ways: iteratively and column-wise.

The first approach involves a simple loop that goes through every combination of the strategy’s parameters and runs the whole logic. This would require you to manually generate a proper parameter grid and concatenate the results for analysis. On the upside, you would be able to use Hyperopt and other tools that work on the per-iteration basis.

The second approach is natively supported by vectorbt and involves stacking columns. If you have 2 symbols and 5 parameters, vectorbt will generate 10 columns in total — one for each symbol and parameter, and backtest each column separately without leaving Numba (that’s why most functions in vectorbt are specialized in processing two-dimensional data, by the way). Not only this has a huge performance benefit for small to medium-sized data, but this also enables parallelization with Numba and presentation of the results in a Pandas-friendly format.

Let’s test the period values 4, 5, ..., 20, and the multiplier values 2, 2.1, 2.2, ..., 4, which would yield 336 parameter combinations in total. Since our indicator is now parametrized, we can pass those two parameter arrays directly to the SuperTrend.run method by also instructing it to do the Cartesian product using the param_product=True flag:

The indicator did 672 iterations — 336 per symbol. Let’s see the columns that have been stacked:

MultiIndex([( 4, 2.0, 'BTCUSDT'),
            ( 4, 2.0, 'ETHUSDT'),
            ( 4, 2.1, 'BTCUSDT'),
            ( 4, 2.1, 'ETHUSDT'),
            ( 4, 2.2, 'BTCUSDT'),
            ...
            (19, 3.8, 'ETHUSDT'),
            (19, 3.9, 'BTCUSDT'),
            (19, 3.9, 'ETHUSDT'),
            (19, 4.0, 'BTCUSDT'),
            (19, 4.0, 'ETHUSDT')],
           names=['st_period', 'st_multiplier', 'symbol'], length=672)

Each of the DataFrames has now 672 columns. Let’s plot the latest combination by specifying the column as a regular tuple:

SuperTrend on period of 19, multiplier of 4, and symbol ETHUSDT

When stacking a huge number of columns, make sure that you are not running out of RAM. You can print the size of any pickleable object in vectorbt using the Pickleable.getsize method:

377.6 MB

The backtesting part remains the same, irrespective of the number of columns:

Instead of computing all the statistics for each single combination, let’s plot a heatmap of their Sharpe values with the periods laid out horizontally and the multipliers laid out vertically. Since we have an additional column level that contains symbols, we’ll make it a slider:

Heatmap with Sharpe values for all parameter combinations by symbol

We now have a nice overview of any parameter regions that performed well during the backtesting period, yay!

Conclusion

This example proved that technical analysis with Python doesn’t have to be slow: there is a range of accelerator packages such as Cython, PyPy, and Numba available to make the performance of our code to be on par with other compiled languages, while we can still enjoy the perks of the rich package ecosystem and great flexibility of this programming language. The vectorbt package takes a solid place in that ecosystem by allowing us to take advantage of the introduced acceleration, for example, to do parameter optimization on arbitrary trading strategies and analyze the dynamics of entire markets in the blink of an eye.

View the original tutorial here.

Support

Please support me on Github Sponsors and contribute to the new growing ecosystem for offline trading analytics. In return, you will get an early access to vectorbt PRO — the actively developed, commercial successor of vectorbt aimed at providing professional analytics tools for quants — and new exclusive learning content. In particular, you will learn how to

design and unit test a streaming SuperTrend indicator that passes across the data only once and gets updated with every new data point
make the SuperTrend indicator utilize all cores by using multithreading
properly design a backtesting pipeline and perform 67,200 backtests on SuperTrend in just one second!