GroupedCandleUniverse#

tradingstrategy.candle.GroupedCandleUniverse Python class in Trading Strategy framework.

class GroupedCandleUniverse[source]#

Bases: PairGroupedUniverse

A candle universe where each trading pair has its own candles.

This is helper class to create foundation for multi pair strategies.

For the data logistics purposes, all candles are lumped together in single columnar data blobs. However, it rarely makes sense to execute operations over different trading pairs. :py:class`GroupedCandleUniverse` creates trading pair id -> candle data grouping out from raw candle data.

Usage:

# Get candles for SUSHI-USDT

exchange_universe = client.fetch_exchange_universe()
raw_pairs = client.fetch_pair_universe().to_pandas()
raw_candles = client.fetch_all_candles(TimeBucket.d7).to_pandas()

pair_universe = PandasPairUniverse(raw_pairs)
candle_universe = GroupedCandleUniverse(raw_candles)

# Do some test calculations for a single pair
sushi_swap = exchange_universe.get_by_chain_and_name(ChainId.ethereum, "sushi")
sushi_usdt = pair_universe.get_one_pair_from_pandas_universe(sushi_swap.exchange_id, "SUSHI", "USDT")

raw_candles = client.fetch_all_candles(TimeBucket.d7).to_pandas()
candle_universe = GroupedCandleUniverse(raw_candles)
sushi_usdth_candles = candle_universe.get_candles_by_pair(sushi_usdt.pair_id)
__init__(df, time_bucket=TimeBucket.d1, timestamp_column='timestamp', index_automatically=True, fix_wick_threshold=(0.1, 1.9))#
Parameters:
  • time_bucket – What bar size candles we are operating at. Default to daily. TODO: Currently not used. Will be removed in the future versions.

  • timestamp_column – What column use to build a time index. Used for QStrader / Backtrader compatibility.

  • index_automatically – Convert the index to use time series. You might avoid this with QSTrader kind of data.

  • fix_wick_threshold (tuple | None) –

    Apply abnormal high/low wick fix filter.

    Percent value of maximum allowed high/low wick relative to close. By default fix values where low is 90% lower than close and high is 90% higher than close.

    See tradingstrategy.utils.groupeduniverse.fix_bad_wicks() for more information.

  • df (DataFrame) –

Methods

__init__(df[, time_bucket, ...])

param time_bucket:

clear_cache()

Clear candles cached by pair.

create_empty()

Return an empty GroupedCandleUniverse

create_empty_qstrader()

Return an empty GroupedCandleUniverse.

create_from_multiple_candle_datafarames(dfs)

Construct universe based on multiple trading pairs.

create_from_single_pair_dataframe(df)

Construct universe based on a single trading pair data.

forward_fill([columns, drop_other_columns])

Forward-fill sparse OHLCV candle data.

get_all_pairs()

Go through all liquidity samples, one DataFrame per trading pair.

get_all_samples_by_range(start, end)

Get list of candles/samples for all pairs at a certain range.

get_all_samples_by_timestamp(ts)

Get list of candles/samples for all pairs at a certain timepoint.

get_candle_count()

Return the dataset size - how many candles total

get_candles_by_pair(pair_id)

Get candles for a single pair.

get_closest_price(pair_id, when[, kind, ...])

Get the available liquidity for a trading pair at a specific timepoint or some candles before the timepoint.

get_columns()

Get column names from the underlying pandas.GroupBy object

get_last_entries_by_pair_and_timestamp(...)

Get samples for a single pair before a timestamp.

get_pair_count()

Return the number of pairs in this dataset

get_pair_ids()

Get all pairs present in the dataset

get_price_with_tolerance(pair_id, when, ...)

Get the price for a trading pair at a specific time point, or before within a time range tolerance.

get_prior_timestamp(ts)

Get the first timestamp in the index that is before the given timestamp.

get_sample_count()

Return the dataset size - how many samples total for all pairs

get_samples_by_pair(pair_id)

Get samples for a single pair.

get_single_pair_data([timestamp, ...])

Get all candles/liquidity samples for the single alone pair in the universe by a certain timestamp.

get_timestamp_range([use_timezone])

Return the time range of data we have for.

iterate_samples_by_pair_range(start, end)

Get list of candles/samples for all pairs at a certain range.

get_candle_count()[source]#

Return the dataset size - how many candles total

Return type:

int

get_candles_by_pair(pair_id)[source]#

Get candles for a single pair.

Returns:

Pandas dataframe object with the following columns

  • timestamp

  • open

  • high

  • low

  • close

Parameters:

pair_id (int) –

Return type:

Optional[DataFrame]

get_closest_price(pair_id, when, kind='close', look_back_time_frames=5)[source]#

Get the available liquidity for a trading pair at a specific timepoint or some candles before the timepoint.

Warning

This is a slow lookup method. You might want to use get_price_with_tolerance() instead.

Parameters:
  • pair_id (int) – Trading pair id

  • when (Timestamp) – Timestamp to query

  • kind – One of OHLC data points: “open”, “close”, “low”, “high”

  • look_back_timeframes – If there is no liquidity sample available at the exact timepoint, look to the past to the get the nearest sample. For example if candle time interval is 5 minutes and look_back_timeframes is 10, then accept a candle that is maximum of 50 minutes before the timepoint.

Returns:

We always return a price. In the error cases an exception is raised.

Raises:

CandleSampleUnavailable – There was no samples available with the given condition.

Return type:

float

get_price_with_tolerance(pair_id, when, tolerance, kind='close')[source]#

Get the price for a trading pair at a specific time point, or before within a time range tolerance.

The data may be sparse data. There might not be sample available in the same time point or immediate previous time point. In this case the method looks back for the previous data point within tolerance time range.

This method should be relative fast and optimised for any price, volume and liquidity queries.

Example:

test_price, distance = universe.get_price_with_tolerance(
    pair_id=1,
    when=pd.Timestamp("2020-02-01 00:05"),
    tolerance=pd.Timedelta(30, "m"))

# Returns closing price of the candle 2020-02-01 00:00,
# which is 5 minutes off when we asked
assert test_price == pytest.approx(100.50)
assert distance == pd.Timedelta("5m")
Parameters:
  • pair_id (int) – Trading pair id

  • when (Timestamp) – Timestamp to query

  • kind – One of OHLC data points: “open”, “close”, “low”, “high”

  • tolerance (Timedelta) – If there is no liquidity sample available at the exact timepoint, look to the past to the get the nearest sample. For example if candle time interval is 5 minutes and look_back_timeframes is 10, then accept a candle that is maximum of 50 minutes before the timepoint.

Returns:

Return (price, delay) tuple. We always return a price. In the error cases an exception is raised. The delay is the timedelta between the wanted timestamp and the actual timestamp of the candle.

Candles are always timestamped by their opening.

Raises:

CandleSampleUnavailable – There were no samples available with the given condition.

Return type:

Tuple[float, Timedelta]

static create_empty()[source]#

Return an empty GroupedCandleUniverse

Return type:

GroupedCandleUniverse

static create_empty_qstrader()[source]#

Return an empty GroupedCandleUniverse.

TODO: Fix QSTrader to use “standard” column names.

Return type:

GroupedCandleUniverse

static create_from_single_pair_dataframe(df)[source]#

Construct universe based on a single trading pair data.

Useful for synthetic data/testing.

Parameters:

df (DataFrame) –

Return type:

GroupedCandleUniverse

static create_from_multiple_candle_datafarames(dfs)[source]#

Construct universe based on multiple trading pairs.

Useful for synthetic data/testing.

Parameters:

dfs (Iterable[DataFrame]) – List of dataframes/series where each trading pair is as isolated OHLCV data feed.

Return type:

GroupedCandleUniverse