Push result from FUs #792

tilk · 2025-01-23T18:35:52Z

This is my third attempt at changing the result direction in FUs from pull to push, and I think this is finally it. The change is very incremental - just enough to achieve the goal and nothing more. The change reduces the number of lines of code and even improves performance a little, thanks to removing some FIFOs. This is a good start for tuning the announcement part of the core.

~~Depends on #784.~~ Merged.

github-actions · 2025-01-23T18:50:49Z

Benchmarks summary

Performance benchmarks

aha-mont64	crc32	minver	nettle-sha256	nsichneu	slre	statemate	ud
▲ 0.417 (+0.000)	▲ 0.532 (+0.007)	▲ 0.372 (+0.002)	0.631 (0.000)	0.359 (0.000)	▼ 0.291 (-0.000)	0.328 (0.000)	▲ 0.443 (+0.003)

You can view all the metrics here.

Synthesis benchmarks (basic)

Device utilisation: (ECP5)	LUTs used as DFF: (ECP5)	LUTs used as carry: (ECP5)	LUTs used as ram: (ECP5)	Max clock frequency (Fmax)
▼ 13690 (-153)	▼ 4389 (-9)	1456 (0)	▼ 1068 (-96)	▲ 53 (+0)

Synthesis benchmarks (full)

Device utilisation: (ECP5)	LUTs used as DFF: (ECP5)	LUTs used as carry: (ECP5)	LUTs used as ram: (ECP5)	Max clock frequency (Fmax)
▼ 21176 (-2096)	▼ 7141 (-56)	▲ 1914 (+32)	▼ 1120 (-96)	▼ 42 (-2)

github-actions · 2025-01-24T09:42:06Z

Benchmarks summary

Performance benchmarks

aha-mont64	crc32	minver	nettle-sha256	nsichneu	slre	statemate	ud
▲ 0.417 (+0.000)	▲ 0.532 (+0.007)	▲ 0.372 (+0.002)	0.631 (0.000)	0.359 (0.000)	▼ 0.291 (-0.000)	0.328 (0.000)	▲ 0.443 (+0.003)

You can view all the metrics here.

Synthesis benchmarks (basic)

Device utilisation: (ECP5)	LUTs used as DFF: (ECP5)	LUTs used as carry: (ECP5)	LUTs used as ram: (ECP5)	Max clock frequency (Fmax)
▼ 14438 (-3404)	▼ 4389 (-9)	1456 (0)	▼ 1068 (-96)	▼ 48 (-1)

Synthesis benchmarks (full)

Device utilisation: (ECP5)	LUTs used as DFF: (ECP5)	LUTs used as carry: (ECP5)	LUTs used as ram: (ECP5)	Max clock frequency (Fmax)
▼ 21324 (-1271)	▼ 7141 (-56)	▲ 1914 (+32)	▼ 1120 (-96)	▲ 45 (+7)

scripts/synthesize.py

Hazardu · 2025-01-25T10:14:28Z

It appears that a lot of code in FuncUnit derived classes is duplicated,
things like assignment of

self.gen_params = gen_params
self.layouts = layouts = gen_params.get(FuncUnitLayouts)
self.issue = Method(i=layouts.issue)
self.push_result = Method(i=layouts.push_result)

are the exact same in almost all cases, with one exception being dummyLsu.py that renamed layouts to fu_layouts

tilk · 2025-01-25T10:29:32Z

@Hazardu

It appears that a lot of code in FuncUnit derived classes is duplicated,

Indeed. This could probably be resolved by adding a base class for FUs, which would include a common constructor. This is not the subject of this PR, however - maybe you would be interested in doing such a refactor?

Hazardu · 2025-01-25T10:34:04Z

Yes, i'll do it. Seems quite simple.

github-actions · 2025-01-25T17:35:56Z

Benchmarks summary

Performance benchmarks

aha-mont64	crc32	minver	nettle-sha256	nsichneu	slre	statemate	ud
▲ 0.417 (+0.000)	▲ 0.532 (+0.007)	▲ 0.373 (+0.002)	▲ 0.632 (+0.001)	▼ 0.359 (-0.000)	▼ 0.291 (-0.000)	▲ 0.328 (+0.000)	▲ 0.443 (+0.003)

You can view all the metrics here.

Synthesis benchmarks (basic)

Device utilisation: (ECP5)	LUTs used as DFF: (ECP5)	LUTs used as carry: (ECP5)	LUTs used as ram: (ECP5)	Max clock frequency (Fmax)
▼ 14682 (-3160)	▼ 4331 (-67)	1456 (0)	▼ 1136 (-28)	▲ 53 (+3)

Synthesis benchmarks (full)

Device utilisation: (ECP5)	LUTs used as DFF: (ECP5)	LUTs used as carry: (ECP5)	LUTs used as ram: (ECP5)	Max clock frequency (Fmax)
▲ 26783 (+4188)	▼ 6821 (-376)	▲ 1940 (+58)	▲ 1664 (+448)	▼ 37 (-1)

github-actions · 2025-01-25T18:51:59Z

Benchmarks summary

Performance benchmarks

aha-mont64	crc32	minver	nettle-sha256	nsichneu	slre	statemate	ud
▲ 0.417 (+0.000)	▲ 0.532 (+0.007)	▲ 0.373 (+0.002)	▲ 0.632 (+0.001)	▼ 0.359 (-0.000)	▼ 0.291 (-0.000)	▲ 0.328 (+0.000)	▲ 0.443 (+0.003)

You can view all the metrics here.

Synthesis benchmarks (basic)

Device utilisation: (ECP5)	LUTs used as DFF: (ECP5)	LUTs used as carry: (ECP5)	LUTs used as ram: (ECP5)	Max clock frequency (Fmax)
▼ 14655 (-3187)	▼ 4331 (-67)	1456 (0)	▼ 1136 (-28)	▼ 43 (-6)

Synthesis benchmarks (full)

Device utilisation: (ECP5)	LUTs used as DFF: (ECP5)	LUTs used as carry: (ECP5)	LUTs used as ram: (ECP5)	Max clock frequency (Fmax)
▲ 27330 (+4735)	▼ 6821 (-376)	▲ 1940 (+58)	▲ 1664 (+448)	▲ 39 (+1)

github-actions · 2025-01-25T19:06:51Z

Benchmarks summary

Performance benchmarks

aha-mont64	crc32	minver	nettle-sha256	nsichneu	slre	statemate	ud
▲ 0.417 (+0.000)	▲ 0.532 (+0.007)	▲ 0.373 (+0.002)	0.632 (0.000)	0.359 (0.000)	▼ 0.291 (-0.000)	0.328 (0.000)	▲ 0.443 (+0.003)

You can view all the metrics here.

Synthesis benchmarks (basic)

Device utilisation: (ECP5)	LUTs used as DFF: (ECP5)	LUTs used as carry: (ECP5)	LUTs used as ram: (ECP5)	Max clock frequency (Fmax)
▲ 13464 (+124)	▼ 4331 (-9)	▼ 1424 (-32)	▼ 1136 (-96)	▲ 49 (+0)

Synthesis benchmarks (full)

Device utilisation: (ECP5)	LUTs used as DFF: (ECP5)	LUTs used as carry: (ECP5)	LUTs used as ram: (ECP5)	Max clock frequency (Fmax)
▲ 25212 (+942)	▼ 6821 (-56)	1940 (0)	▼ 1664 (-96)	▲ 43 (+6)

tilk added refactor Doesn't change functionality, but makes stuff nicer benchmark Benchmarks should be run for this change labels Jan 23, 2025

tilk added 5 commits January 24, 2025 10:27

Work in progress

b0cd747

Fix JB test

0136889

Fix LSU atomic wrapper test

6cb65af

Optional result FIFO

98813a4

Remove fifos from multiplication and division

2218bf0

tilk force-pushed the push-result branch from 437aec2 to 2218bf0 Compare January 24, 2025 09:28

Hazardu reviewed Jan 25, 2025

View reviewed changes

scripts/synthesize.py Outdated Show resolved Hide resolved

Hazardu mentioned this pull request Jan 25, 2025

Refactor FuncUnit to remove duplicate code #793

Open

Rename

0853adc

piotro888 approved these changes Jan 25, 2025

View reviewed changes

Another rename

0bbf2cf

Hazardu approved these changes Jan 25, 2025

View reviewed changes

Merge branch 'master' into push-result

1ed1683

tilk merged commit e200b5b into kuznia-rdzeni:master Jan 25, 2025
14 checks passed

github-actions bot pushed a commit that referenced this pull request Jan 25, 2025

Push result from FUs (#792)

9a34d78

tilk mentioned this pull request Jan 27, 2025

Change get_result to send_result #382

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Push result from FUs #792

Push result from FUs #792

tilk commented Jan 23, 2025 •

edited

Loading

github-actions bot commented Jan 23, 2025

github-actions bot commented Jan 24, 2025

Hazardu commented Jan 25, 2025

tilk commented Jan 25, 2025

Hazardu commented Jan 25, 2025

github-actions bot commented Jan 25, 2025

github-actions bot commented Jan 25, 2025

github-actions bot commented Jan 25, 2025

Push result from FUs #792

Push result from FUs #792

Conversation

tilk commented Jan 23, 2025 • edited Loading

github-actions bot commented Jan 23, 2025

Benchmarks summary

Performance benchmarks

Synthesis benchmarks (basic)

Synthesis benchmarks (full)

github-actions bot commented Jan 24, 2025

Benchmarks summary

Performance benchmarks

Synthesis benchmarks (basic)

Synthesis benchmarks (full)

Hazardu commented Jan 25, 2025

tilk commented Jan 25, 2025

Hazardu commented Jan 25, 2025

github-actions bot commented Jan 25, 2025

Benchmarks summary

Performance benchmarks

Synthesis benchmarks (basic)

Synthesis benchmarks (full)

github-actions bot commented Jan 25, 2025

Benchmarks summary

Performance benchmarks

Synthesis benchmarks (basic)

Synthesis benchmarks (full)

github-actions bot commented Jan 25, 2025

Benchmarks summary

Performance benchmarks

Synthesis benchmarks (basic)

Synthesis benchmarks (full)

tilk commented Jan 23, 2025 •

edited

Loading