Small optimizations on iggen buffer handling #317

fknorr · 2024-12-04T11:20:08Z

perform_task_buffer_accesses updates last-writers twice to gracefully handle overlapping writes, which is an edge case. This PR quickly checks if overlapping writes are present, and sticks to a single update if there are not. By transposing the loop nest from chunk -> bid to bid -> chunk, we can also save avoid constructing another unordered_map.

Results are not looking too impressive in the benchmark report, but I do get a consistent 4% speedup for RSim room_small, which is scheduler bound on gpuc3.

github-actions · 2024-12-04T11:25:15Z

Check-perf-impact results: (c8fb992b35322012b54e351345fdf71a)

✔️ No significant performance change in the microbenchmark set. You are good to go!

Relative execution time per category: (mean of relative medians)

command-graph : 1.01x
graph-nodes : 1.03x
grid : 1.01x
instruction-graph : 0.97x
scheduler : 0.98x
system : 0.98x
task-graph : 1.02x

coveralls · 2024-12-04T11:31:02Z

Pull Request Test Coverage Report for Build 12787152826

Details

36 of 36 (100.0%) changed or added relevant lines in 1 file are covered.
No unchanged relevant lines lost coverage.
Overall coverage increased (+0.02%) to 95.067%

Totals
Change from base Build 12390656669:	0.02%
Covered Lines:	7087
Relevant Lines:	7192

💛 - Coveralls

GagaLP

Nicely done.
LGTM! 👍

psalz

LGTM! I've suggested two comment changes that I've added for my understanding while investigating how to implement replicated writes!

src/instruction_graph_generator.cc

This also avoids an unordered_map by transposing the perform_task_buffer_accesses loop.

fknorr added this to the 0.7.0 milestone Dec 4, 2024

fknorr requested review from psalz, PeterTh and GagaLP December 4, 2024 11:20

fknorr self-assigned this Dec 4, 2024

fknorr force-pushed the iggen-buffer-opt branch from e59eacf to 9785173 Compare December 4, 2024 11:21

celerity deleted a comment from github-actions bot Dec 4, 2024

fknorr force-pushed the iggen-buffer-opt branch 2 times, most recently from 0bd705c to a46a214 Compare December 4, 2024 11:24

GagaLP approved these changes Dec 4, 2024

View reviewed changes

psalz approved these changes Dec 17, 2024

View reviewed changes

src/instruction_graph_generator.cc Show resolved Hide resolved

src/instruction_graph_generator.cc Show resolved Hide resolved

fknorr force-pushed the iggen-buffer-opt branch from a46a214 to c088d0b Compare January 7, 2025 19:03

fknorr added 3 commits January 15, 2025 11:49

Optimization: exit early in establish_coherence_between_buffer_memories

9750a7a

Optimization: only update last-writers twice for overlapping writes

c5051fb

This also avoids an unordered_map by transposing the perform_task_buffer_accesses loop.

Update benchmark results for iggen buffer optimizations

3be3378

fknorr force-pushed the iggen-buffer-opt branch 2 times, most recently from 1ee0a18 to 3be3378 Compare January 15, 2025 11:15

fknorr merged commit 277403a into master Jan 15, 2025
32 checks passed

fknorr deleted the iggen-buffer-opt branch January 15, 2025 13:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Small optimizations on iggen buffer handling #317

Small optimizations on iggen buffer handling #317

fknorr commented Dec 4, 2024

github-actions bot commented Dec 4, 2024

coveralls commented Dec 4, 2024 •

edited

Loading

GagaLP left a comment

psalz left a comment

Small optimizations on iggen buffer handling #317

Small optimizations on iggen buffer handling #317

Conversation

fknorr commented Dec 4, 2024

github-actions bot commented Dec 4, 2024

coveralls commented Dec 4, 2024 • edited Loading

Pull Request Test Coverage Report for Build 12787152826

Details

💛 - Coveralls

GagaLP left a comment

Choose a reason for hiding this comment

psalz left a comment

Choose a reason for hiding this comment

coveralls commented Dec 4, 2024 •

edited

Loading