Add compare.py to compare the output of multiple benchmarks #5655

alamb · 2023-03-20T18:40:40Z

Which issue does this PR close?

Closes #5561

Rationale for this change

See #5561

What changes are included in this PR?

compare.py script from @Taza53 based on one from @isidentical (see Report and compare benchmark runs against two branches #5561 (comment))
Updated documentation

Are these changes tested?

Not really,

Are there any user-facing changes?

No

mustafasrepo · 2023-03-23T07:25:57Z

benchmarks/README.md

+```shell
+$ git checkout main
+# generate an output script in /tmp/output_main
+$ cargo run --release --bin tpch -- benchmark datafusion --iterations 5 --path /data --format parquet -o /tmp/output_main


I think --path /data should be replaced with --path ./data in this line. Also we can change the --format parquet with --format tbl (Assuming user doesn't run the conversion script. This is the format of the output of ./tpch-gen.sh)

mustafasrepo · 2023-03-23T07:26:23Z

benchmarks/README.md

+$ cargo run --release --bin tpch -- benchmark datafusion --iterations 5 --path /data --format parquet -o /tmp/output_main
+# generate an output script in /tmp/output_branch
+$ git checkout my_branch
+$ cargo run --release --bin tpch -- benchmark datafusion --iterations 5 --path /data --format parquet -o /tmp/output_my_branch


Similar changes can be applied with above suggestion

Thank you for these suggestions, I have made them in dc5099d

mustafasrepo · 2023-03-23T07:34:06Z

benchmarks/README.md

+```shell
+$ git checkout main
+# generate an output script in /tmp/output_main
+$ cargo run --release --bin tpch -- benchmark datafusion --iterations 5 --path /data --format parquet -o /tmp/output_main


Also when I run this script unless /tmp/output_main already exists. I receive IO Error. Is this expected?. If so, I think we should add mkdir /tmp/output_main above this line.

mustafasrepo · 2023-03-23T14:58:48Z

I added some minor comments. Other than those comments, This PR is LGTM!. Thanks @alamb for this PR. This is very useful to compare results with friendly report.

alamb · 2023-03-27T21:00:30Z

Thanks again for the review @mustafasrepo

alamb added 3 commits March 20, 2023 15:48

Add compare.py script and documentation

5645827

Add readme

9fd445b

prettier

bf5b1c8

alamb added the development-process Related to development process of DataFusion label Mar 20, 2023

alamb changed the title ~~Alamb/compare~~ Add compare.py to compare the output of multiple benchmarks Mar 20, 2023

github-actions bot removed the development-process Related to development process of DataFusion label Mar 20, 2023

alamb mentioned this pull request Mar 20, 2023

Report and compare benchmark runs against two branches #5561

Closed

jaylmiller mentioned this pull request Mar 20, 2023

Add -o option to all e2e benches #5658

Merged

mustafasrepo reviewed Mar 23, 2023

View reviewed changes

alamb added 2 commits March 27, 2023 16:03

Merge remote-tracking branch 'apache/main' into alamb/compare

24ecf3e

Updated per PR review

dc5099d

alamb merged commit b4dde57 into apache:main Mar 27, 2023

alamb deleted the alamb/compare branch March 27, 2023 21:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add compare.py to compare the output of multiple benchmarks #5655

Add compare.py to compare the output of multiple benchmarks #5655

alamb commented Mar 20, 2023

mustafasrepo Mar 23, 2023

mustafasrepo Mar 23, 2023

alamb Mar 27, 2023 •

edited

Loading

mustafasrepo Mar 23, 2023 •

edited

Loading

mustafasrepo commented Mar 23, 2023

alamb commented Mar 27, 2023

Add compare.py to compare the output of multiple benchmarks #5655

Add compare.py to compare the output of multiple benchmarks #5655

Conversation

alamb commented Mar 20, 2023

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

mustafasrepo Mar 23, 2023

Choose a reason for hiding this comment

mustafasrepo Mar 23, 2023

Choose a reason for hiding this comment

alamb Mar 27, 2023 • edited Loading

Choose a reason for hiding this comment

mustafasrepo Mar 23, 2023 • edited Loading

Choose a reason for hiding this comment

mustafasrepo commented Mar 23, 2023

alamb commented Mar 27, 2023

alamb Mar 27, 2023 •

edited

Loading

mustafasrepo Mar 23, 2023 •

edited

Loading