Update OpenFPGA #2983

amin1377 · 2025-04-16T15:48:17Z

No description provided.

The original implementation of APPack was focused on reconstructing a given flat placement. This can cause issues if the given flat placement disagrees with the decisions of the packer. Instead, updated APPack so that it treats the flat placement as a hint to help guide how it performs clustering. Added the following new features: - APPack computes the location of clusters based on the centroid of the molecules packed within. - APPack attenuates the gain terms of candidates based on their distance from the cluster. - APPack drops candidates which are too far from the cluster being created. Remove adding molecules near to the position of the cluster. This had similar affects to unrelated clustering and should be investigated separately later. With these changes to APPack, the AP flow now improves WL of circuits by 1-3% at the expense of up to 15% runtime compared to the default VPR flow.

Remove Redundant print_pb

[APPack] Updated How APPack Adheres to Given Placement

…rilog-to-routing into init_place_wl

Parsing Initial Placement WL and CPD

[RR Graph] RR Node Indices Value Type

…_packing Remove usage of atom to pb lookup from packing

Updated the partial legalizer to now take into account block types when spreading blocks. This will create windows around overfilled bins that is aware of which block types are overfilled and how large the window needs to be to accomodate them. It also takes these block types into account when spreading to only allow blocks to spread into sub-windows that they can exist in. This improves quality but was detremental to performance, so some performance improvements were needed. To improve the performance of the partial legalizer, I split the problem into groups of models which must be spread together. This allows us to create tighter windows and can make some parts of the legalizer more efficient. Create a model grouper class which forms the model pack patterns into a graph and find disconnected sub-graphs to form the model groups. Also improved the window generation by pre-clustering the overfilled bins before creating the windows. This sped up the window generation code since less windows overlap.

…lizer [AP][GlobalPlacement] Improved Partial Legalizer Legality

When no fixed blocks are provided by the user, the AP flow can still work. Currently, in the first iteration, the solver will put all blocks at 0,0 and use the legalized solution in the next iteration as fixed points. Instead of (0,0), it makes more sense to put the blocks in the center of the device. Also added a guess to the solver to help CG converge faster each iteration. Added a regression test to ensure that not describing the fixed blocks is supported.

[AP][Solver] Supporting Unfixed Blocks

…rilog-to-routing into openfpga_update

The SDF file generated by the post-implementation netlist writer was only using the max delays of timing connections in the timing graph. In the SDF file, it set all values of the rising and falling triples to the max delay. When using this SDF file for external timing analysis, the minimum timing (hold) paths were incorrect. Updated the netlist writer to work with triples instead of bare delays. This allows (minimum, typical, maximum) delays to be passed through the different functions and be printed cleanly. For standard delay signals in the circuit (not setup / hold times) Tatum provides the minimum delays. These are now being printed in the SDF file and the minimum timing paths are being found correctly in the external timing analyzer. Cleaned up some parts of the netlist printing code as well. 1) netlist_writer.cpp declared many functions in the global scope which may cause conflicts at link time in VTR. Put all of these methods in anonymous namespace to prevent this. 2) The code was casting the delays from seconds to picoseconds in strange places. This was tricky to work with since these are both stored as doubles. Changed all of the code to only work with delays in seconds, and only cast to picoseconds when printing. 3) General cleanup of the header file and the include files.

Thank you to Fred Tombs for pointing out this issue!

The old Initial Placer used in the AP flow was constructed within the initial placer of the non-AP flow. This forced the AP flow to try to place blocks one at a time with minimum displacement. This is non-ideal since blocks that were placed earlier were being getting first picks at locations, which may displace a future cluster which may be a better fit for that location. Separated out the AP initial placement code. For AP, initial placement is done in passes. The first pass will try to place clusters exactly at the tile that the centroid of all atoms within the cluster want to be placed (according to the global placement). Any clusters that could not be placed are reserved for the next pass. The second pass will allow clusters to be placed within 1 tile of their centroid. All subsequent passes will allow cluster to be placed exponentially farther from their centroid. The initial placement terminates when all clusters have been placed or if the max displacement is the size of the entire device. The clusters are sorted based on the size of the macro that contains them and the variance of the placement of the atoms within the macro. This allows large macro blocks with low variance to be placed first.

[STA] Updated SDF File Generation to Include Min Delays

[AP][InitialPlacement] Created Isolated AP Flow

Override edge attributes in RR graph

Fixed a couple of small known issues around the AP flow related to how we handle fixed blocks. Offset the fixed block locations by 0.5 such that they are no longer on the edge. Previously, fixed blocks were placed at the root location of tiles. This was a problem since atoms would want to be generally close to the fixed block and may be biased to the bottom/left tiles to the fixed-block tile. This does not handle large tiles, but will help in general. If no fixed blocks are provided, the AP solver will always produce the trivial solution (all blocks placed on top of one another anywhere on the device). We were wasting time running bound2bound to solve this and the solution was probably being put on the bottom-left corner (0,0) which is not ideal. Instead of running bound2bound during the first iteration in this case, just placed all blocks in the center of the device. This greatly speeds up the first iteration when no fixed blocks are provided.

…ve_ctx Remove PlacerMoveContext

…l_packer Remove atom_net global context mutation from packer

[AP] General Fixed/Unfixed Blocks Cleanup

…rilog-to-routing into openfpga_update

amin1377 and others added 30 commits March 18, 2025 16:24

[vpr][ap] remove redundant print_pb

4f648f2

Fix styling regressions

e2ac829

Add reset_bimap helper method to AtomPBBimap

9e0d48a

Remove copying empty bimap from global context to cluster legalizer

4c61867

Refactor is_atom_blk_in_pb function to get two t_pb* arguments

624f251

Merge branch 'master' into refactor_atom_pb_from_packing

0e6f62a

Fix minor styling issues

dfb6462

[vpr][pack] reomve redundant function calls

5f7b793

[vpr][place] fix estimated_wl var name

7110636

Merge pull request #2939 from verilog-to-routing/redundant_print_pb

3a19e8d

Remove Redundant print_pb

Merge branch 'master' into feature-appack

47fca21

Merge pull request #2934 from AlexandreSinger/feature-appack

ab25381

[APPack] Updated How APPack Adheres to Given Placement

Merge branch 'master' of https://github.com/verilog-to-routing/vtr-ve…

04bb518

…rilog-to-routing into init_place_wl

make format

0b526ab

[vpr][route] remove redundant functions from rr_graph2

4fdd98d

make format

4992203

[vpr][route] remove redundant functions from rr_graph2

11c9a57

[libs][rr_graph] change rr_node_indices value type to RRNodeId

1a17be2

fix formatting issues

9338a57

make format

c115a4a

Merge pull request #2938 from verilog-to-routing/init_place_wl

0af49af

Parsing Initial Placement WL and CPD

Merge pull request #2941 from verilog-to-routing/rr_node_indices

c6802d6

[RR Graph] RR Node Indices Value Type

Merge branch 'master' into refactor_atom_pb_from_packing

f77c3c7

Merge pull request #2932 from AmirhosseinPoolad/refactor_atom_pb_from…

ccb2396

…_packing Remove usage of atom to pb lookup from packing

Merge pull request #2942 from AlexandreSinger/feature-ap-partial-lega…

c9e6075

…lizer [AP][GlobalPlacement] Improved Partial Legalizer Legality

[vpr][rr_graph] fix comment

e824925

Merge pull request #2944 from AlexandreSinger/feature-ap-solver

b3d9694

[AP][Solver] Supporting Unfixed Blocks

soheilshahrouz and others added 4 commits April 19, 2025 12:32

Merge branch 'master' into ingest_per_edge_delay

2ef555f

fix typo

4c8d908

get_bb_from_scratch_() accepts use_ts as its argument

649f3b6

[libs][librrgraph] update echo file of rr graph

b013af7

amin1377 requested a review from tangxifan April 20, 2025 16:00

amin1377 and others added 23 commits April 20, 2025 09:05

Merge branch 'master' of https://github.com/verilog-to-routing/vtr-ve…

b8d0455

…rilog-to-routing into openfpga_update

[test][strong] update golden result

9db1939

[test] update strong tileable golden result

345d251

explain what RR edge override feature is useful for

d390291

[test][tileable] update golden results

40a71cc

add comment for MoveGenerator::first_rlim

8bcf4f4

[STA] Updated How Un-Initialized Delay Triples are Handled

e03cd90

Thank you to Fred Tombs for pointing out this issue!

add doxygen comment for X_coord, Y_coord, and layer_coord

118165e

remove X_coord and Y_coord from feasibe_region_move_generator

82bc33d

add comment explaining ts and permanent data members

900e2e2

make format

1706dd4

Merge pull request #2986 from AlexandreSinger/feature-open-sta

e4f4f4e

[STA] Updated SDF File Generation to Include Min Delays

Merge pull request #2988 from AlexandreSinger/feature-ap-initial-placer

3663572

[AP][InitialPlacement] Created Isolated AP Flow

Merge pull request #2930 from verilog-to-routing/ingest_per_edge_delay

c881146

Override edge attributes in RR graph

Merge pull request #2989 from verilog-to-routing/temp_remove_place_mo…

f71176c

…ve_ctx Remove PlacerMoveContext

Remove atom_net global context mutation from packer

f3b166e

Merge pull request #2984 from verilog-to-routing/wip_remove_mut_globa…

735448c

…l_packer Remove atom_net global context mutation from packer

Merge pull request #2990 from AlexandreSinger/feature-ap-fixed-blocks

1e479b9

[AP] General Fixed/Unfixed Blocks Cleanup

Merge branch 'master' of https://github.com/verilog-to-routing/vtr-ve…

c97ca21

…rilog-to-routing into openfpga_update

[vpr][tileable_rr_graph] fix rr_switch usage

dd08d1e

tangxifan merged commit cab1db1 into openfpga Apr 24, 2025
36 checks passed

tangxifan deleted the openfpga_update branch April 24, 2025 16:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update OpenFPGA #2983

Update OpenFPGA #2983

Uh oh!

amin1377 commented Apr 16, 2025

Uh oh!

Uh oh!

Uh oh!

Update OpenFPGA #2983

Update OpenFPGA #2983

Uh oh!

Conversation

amin1377 commented Apr 16, 2025

Uh oh!

Uh oh!

Uh oh!