compare.py: compute and print 'OVERALL GEOMEAN' aggregate #1289

LebedevRI · 2021-11-23T21:59:56Z

Despite the wide variety of the features we provide,
some people still have the audacity to complain and demand more.

Concretely, i very often would like to see the overall result
of the benchmark. Is the 'new' better or worse, overall,
over all the non-aggregate time/cpu measurements.

This comes up for me most often when i want to quickly see
what effect some LLVM optimization change has on the benchmark.

The idea is straight-forward, just produce four lists:
wall times for LHS benchmark, CPU times for LHS benchmark,
wall times for RHS benchmark, CPU times for RHS benchmark;
then compute geomean for each one of those four lists,
and compute the two percentage change between

geomean wall time for LHS benchmark and geomean wall time for RHS benchmark
geomean CPU time for LHS benchmark and geomean CPU time for RHS benchmark
and voila!

It is complicated by the fact that it needs to graciously handle
different time units, so pandas.Timedelta dependency is introduced.
That is the only library that does not barf upon floating times,
i have tried numpy.timedelta64 (only takes integers)
and python's datetime.timedelta (does not take nanosecons),
and they won't do.

Fixes #1147

tools/gbench/report.py

Despite the wide variety of the features we provide, some people still have the audacity to complain and demand more. Concretely, i *very* often would like to see the overall result of the benchmark. Is the 'new' better or worse, overall, over all the non-aggregate time/cpu measurements. This comes up for me most often when i want to quickly see what effect some LLVM optimization change has on the benchmark. The idea is straight-forward, just produce four lists: wall times for LHS benchmark, CPU times for LHS benchmark, wall times for RHS benchmark, CPU times for RHS benchmark; then compute geomean for each one of those four lists, and compute the two percentage change between * geomean wall time for LHS benchmark and geomean wall time for RHS benchmark * geomean CPU time for LHS benchmark and geomean CPU time for RHS benchmark and voila! It is complicated by the fact that it needs to graciously handle different time units, so pandas.Timedelta dependency is introduced. That is the only library that does not barf upon floating times, i have tried numpy.timedelta64 (only takes integers) and python's datetime.timedelta (does not take nanosecons), and they won't do. Fixes google#1147

google#1289 uses added `pandas` to the `gbench` module in the `tools/` directory. That PR added `pandas` to the root `requirements.txt` but not the `requirements.txt` in the `tools/` directory.

Despite the wide variety of the features we provide, some people still have the audacity to complain and demand more. Concretely, i *very* often would like to see the overall result of the benchmark. Is the 'new' better or worse, overall, over all the non-aggregate time/cpu measurements. This comes up for me most often when i want to quickly see what effect some LLVM optimization change has on the benchmark. The idea is straight-forward, just produce four lists: wall times for LHS benchmark, CPU times for LHS benchmark, wall times for RHS benchmark, CPU times for RHS benchmark; then compute geomean for each one of those four lists, and compute the two percentage change between * geomean wall time for LHS benchmark and geomean wall time for RHS benchmark * geomean CPU time for LHS benchmark and geomean CPU time for RHS benchmark and voila! It is complicated by the fact that it needs to graciously handle different time units, so pandas.Timedelta dependency is introduced. That is the only library that does not barf upon floating times, i have tried numpy.timedelta64 (only takes integers) and python's datetime.timedelta (does not take nanosecons), and they won't do. Fixes google#1147

chfast · 2022-01-13T18:56:51Z

How to use this? I'm doing regular comparison and don't see any "OVERALL GEOMEAN".

LebedevRI · 2022-01-13T19:14:45Z

How to use this? I'm doing regular comparison and don't see any "OVERALL GEOMEAN".

It's not under any of the options, it's just there. Perhaps the benchmark/tools you use doesn't have it?

chfast · 2022-01-14T13:46:58Z

Aaa, I was on master branch...

google-cla bot added the cla: yes label Nov 23, 2021

LebedevRI requested a review from dmah42 November 23, 2021 22:00

dmah42 reviewed Nov 24, 2021

View reviewed changes

tools/gbench/report.py Show resolved Hide resolved

LebedevRI force-pushed the compare-geomean branch from df82178 to 31e8e3b Compare November 24, 2021 09:37

dmah42 merged commit d6ba952 into google:main Nov 24, 2021

LebedevRI deleted the compare-geomean branch November 24, 2021 11:07

jackgerrits mentioned this pull request Nov 29, 2021

Add pandas to tools/ requirements.txt #1292

Closed

matta mentioned this pull request Sep 8, 2022

[BUG] benchmarks of very fast things produce misleading "geomean" results #1482

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

compare.py: compute and print 'OVERALL GEOMEAN' aggregate #1289

compare.py: compute and print 'OVERALL GEOMEAN' aggregate #1289

LebedevRI commented Nov 23, 2021 •

edited

Loading

chfast commented Jan 13, 2022

LebedevRI commented Jan 13, 2022

chfast commented Jan 14, 2022

compare.py: compute and print 'OVERALL GEOMEAN' aggregate #1289

compare.py: compute and print 'OVERALL GEOMEAN' aggregate #1289

Conversation

LebedevRI commented Nov 23, 2021 • edited Loading

chfast commented Jan 13, 2022

LebedevRI commented Jan 13, 2022

chfast commented Jan 14, 2022

LebedevRI commented Nov 23, 2021 •

edited

Loading