Benchmark suite for time series analytical databases #3782

evenyag · 2024-04-23T12:41:10Z

evenyag
Apr 23, 2024
Maintainer

We use TSBS and prometheus-benchmark to benchmark our database.

TSBS only supports a limited number of tables and cannot read the usage of system resources.
Prometheus-benchmark doesn't support ingesting and querying historical data. It also requires a lot of resources to run the benchmark.

Both benchmarks are mainly designed for metrics workload and lack the ability to benchmark analytical workload. We might implement a new benchmark for GreptimeDB and other time series databases. The benchmark should cover more analytical workload for time series data.

It should be easy to set up. It'd be helpful if it could simulate workloads of TSBS and prometheus-benchmark.

I created a repo https://github.com/GreptimeTeam/greptime-bench for this. We could do further discussion in this thread or the repo's issue.

Dysprosium0626 · 2024-05-16T09:39:18Z

Dysprosium0626
May 16, 2024

We are aiming for designing and implementing a benchmark to evaluate the performance of analysis of time series databases.

Here is my understanding for designing for analytical workload

Dataset

In TSBS we have Metrics data from DevOps or IoT devices(e.g. CPU/memory utilization), there are still other kinds of time series data we can take into consideration:

Event: e.g. User log in/out on websites, IoT device turn on/off. The event data typically consists of timestamps and event type.
Logs: e.g. Log context from server/application/database/network devices. The log data typically consists of timestamps, log level, log content and backtrace

How to generate dataset is one of the main concern.

Deep learning(which I am not familiar with): refer to TSM-bench, it seems like more complicated but the generated data may be more
Statistical method: In some papers, they use Hidden Markov Chain to generate data or extract data pattern and generate more data with that pattern. In TPCDS, they use synthetic datasets that are built using well studied distributions such as the Normal or the Poisson distributions. They are mathematically very well defined and easy to implement in a data generator.

We should have a option to control

Data type: Metrics, Event, Logs
Scaling factor to control data size. Both domain and tuple should be scaled(refer to TPCDS)

Query

I collect some scenarios which involve the analysis performance in time series database

Data fetching: This is a very basic function, like select data by time range with some filter and aggregation
Anomaly detection: Detect the existence of abnormal value. This may involves downsampling
Prediction: Maybe it involves sliding window. Some of TSDB may have customized prediction functions or user-defined prediction function.
Trending: downsampling
Value filling: upsampling

Pseudo code for some queries:

Data fetching

SELECT time, id 
FROM t 
WHERE time > ts_start 
AND time < ts_stop
AND a > value

2.Aggregation and Join

SELECT time, id, AVG(a), SUM(b)
FROM t 
WHERE time > ts_start 
AND time < ts_stop

SELECT time, id, AVG(t1.a), SUM(t2.b)
FROM t1 JOIN t2 ON t1.a = t2.b
WHERE time > ts_start 
AND time < ts_stop

Downsampling

SELECT time, id, AVG(a), SUM(b)
FROM t 
WHERE time > ts_start 
AND time < ts_stop
GROUP BY id, time
SAMPLE BY 1H

Upsampling

SELECT time, id 
FROM t 
WHERE time > ts_start 
AND time < ts_stop
SAMPLE BY 10s
FILL(LINER)

Test suite

Like TPCDS we can have the following tests:

Loading Test: evaluate the time to import raw data to DB(single/multi thread)
Power Test: Single thread performance
Throughput Test: Multi-thread performance
Maybe we should add some tests for ETL performance evaluation

Outputs

This is not a point that can be designed at the very beginning, in a nutshell benchmark is a tool to test the performance, so the most important thing is the time it takes to execute the query, when we have the import, single threaded, and multithreaded execution time we can figure out the metrics like THROUGHPUT and PRICE OVER PERFORMANCE.

Reference:

TSM-Bench: Benchmarking Time Series Database Systems for Monitoring Applications：https://www.vldb.org/pvldb/vol16/p3363-khelifati.pdf
https://www.odbms.org/2023/12/on-benchmarking-time-series-database-systems-for-monitoring-applications-qa-with-abdelouahab-khelifati-and-mourad/
SciTS: A Benchmark for Time-Series Databases in Scientific Experiments and Industrial Internet of Things：https://arxiv.org/pdf/2204.09795
YCSB-TS: https://github.com/TSDBBench/YCSB-TS
TS-Benchmark: A Benchmark for Time Series Databases：https://www.benchcouncil.org/bench2018/chenyueguo.pdf
The Making of TPC-DS: https://www.tpc.org/tpcds/presentations/the_making_of_tpcds.pdf

0 replies

evenyag · 2024-05-16T13:35:08Z

evenyag
May 16, 2024
Maintainer Author

Deep learning(which I am not familiar with): refer to TSM-bench, it seems like more complicated but the generated data may be more

Statistical method: In some papers, they use Hidden Markov Chain to generate data or extract data pattern and generate more data with that pattern. In TPCDS, they use synthetic datasets that are built using well studied distributions such as the Normal or the Poisson distributions. They are mathematically very well defined and easy to implement in a data generator.

Statistical method is easier to implement. We could also pre-generate some data and scale them to the factor we want.

Logs: e.g. Log context from server/application/database/network devices. The log data typically consists of timestamps, log level, log content and backtrace

For logs, the Web Server Access Log dataset is a well-known dataset. This is what simple-logging-benchmark uses.

0 replies

waynexia · 2024-05-21T12:12:58Z

waynexia
May 21, 2024
Maintainer

https://github.com/xo/usql is a brilliant project that provides many DB protocols. We can use it to run SQL. And benefits from Go's feature, its binary can be used in a wide environment which makes it also a good choice to be a binary dep.

0 replies

WenyXu · 2024-05-23T07:09:50Z

WenyXu
May 23, 2024
Maintainer

BTW, the paper and corresponding benchmark tool may also be useful
https://www.vldb.org/pvldb/vol17/p148-zeng.pdf
https://github.com/XinyuZeng/EvaluationOfColumnarFormats

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Greptime

Benchmark suite for time series analytical databases #3782

{{title}}

Replies: 4 comments

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

Greptime

Benchmark suite for time series analytical databases #3782

evenyag Apr 23, 2024 Maintainer

Replies: 4 comments

Dysprosium0626 May 16, 2024

Here is my understanding for designing for analytical workload

Dataset

Query

Test suite

Outputs

Reference:

evenyag May 16, 2024 Maintainer Author

waynexia May 21, 2024 Maintainer

WenyXu May 23, 2024 Maintainer

evenyag
Apr 23, 2024
Maintainer

Dysprosium0626
May 16, 2024

evenyag
May 16, 2024
Maintainer Author

waynexia
May 21, 2024
Maintainer

WenyXu
May 23, 2024
Maintainer