save benchmarks in results and use for aggregate coverage calculation #650

DavidKorczynski · 2024-09-25T18:05:02Z

This fixes the discrepancy mentioned here #645 (comment). Saving the benchmark for each result has additional benefits than supporting the aggregate. It makes it easier to save results when auto-generation of benchmarks are used as part of the experiment (I use this a lot).

Notice not the full benchmark .yaml from benchmarks-set/... is set, but only the actual (one function or one test file) is saved per result dir.

Saving the benchmark for each result has additional benefits than supporting the aggregate. It makes it easier to save results when auto-generation of benchmarks are used as part of the experiment (I use this a lot). Signed-off-by: David Korczynski <[email protected]>

DavidKorczynski · 2024-09-25T18:08:31Z

/gcbrun exp -n d-cov-44 -b comparison -ns 4 -rd

Signed-off-by: David Korczynski <[email protected]>

DavidKorczynski · 2024-09-25T18:58:10Z

/gcbrun exp -n d-cov-45 -b comparison -ns 4 -rd

Signed-off-by: David Korczynski <[email protected]>

DavidKorczynski · 2024-09-25T19:55:14Z

Experiment is looking good, although there are some minor discrepancies still, but will look at those further down the line: https://llm-exp.oss-fuzz.com/Result-reports/ofg-pr/2024-09-26-650-d-cov-45-comparison/index.html

fix style

f8b3b25

Signed-off-by: David Korczynski <[email protected]>

add some logging

8368b90

Signed-off-by: David Korczynski <[email protected]>

DavidKorczynski requested review from DonggeLiu and oliverchang September 25, 2024 19:55

AdamKorcz approved these changes Sep 25, 2024

View reviewed changes

AdamKorcz merged commit b6fd6c1 into main Sep 25, 2024
6 checks passed

AdamKorcz deleted the save-benchmarks branch September 25, 2024 19:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

save benchmarks in results and use for aggregate coverage calculation #650

save benchmarks in results and use for aggregate coverage calculation #650

DavidKorczynski commented Sep 25, 2024 •

edited

Loading

DavidKorczynski commented Sep 25, 2024

DavidKorczynski commented Sep 25, 2024

DavidKorczynski commented Sep 25, 2024

save benchmarks in results and use for aggregate coverage calculation #650

save benchmarks in results and use for aggregate coverage calculation #650

Conversation

DavidKorczynski commented Sep 25, 2024 • edited Loading

DavidKorczynski commented Sep 25, 2024

DavidKorczynski commented Sep 25, 2024

DavidKorczynski commented Sep 25, 2024

DavidKorczynski commented Sep 25, 2024 •

edited

Loading