fix: MinNamespace/MaxNamespace optimizations #1

bruscar010 · 2025-01-01T18:13:51Z

In this issue celestiaorg#216, it was raised that

More time was spent allocating than hashing in the HashLeaf function
The min and max namespace functions were taking up a large amount of RAM

More time was spent allocating than hashing in the HashLeaf function

The benchmark test described in the issue listed above no longer exists, and when I run the BenchmarkComputeRoot in the nmt package HERE I can't replicate memory allocation taking longer than the hashing, although this may be related to my test parameters.

Below are FlameGraphs of CPU/Mem usage (I've updated the Benchmark to have 20k leaf nodes with 512 bytes transaction size)

Base Mem Flamegraph
Base CPU Flamegraph

BenchmarkComputeRoot/64-leaves-12         	     100	    141785 ns/op	   65166 B/op	    1535 allocs/op
BenchmarkComputeRoot/128-leaves-12        	     100	    280652 ns/op	  123894 B/op	    3074 allocs/op
BenchmarkComputeRoot/256-leaves-12        	     100	    501392 ns/op	  254611 B/op	    6154 allocs/op

I've instead focused on the second issue

The min and max namespace functions were taking up a large amount of RAM

Solution 1: Reduce the number of calls to MinNamespace & MaxNamespace

The HashNode function made 4 duplicate function calls to MaxNamespace/MinNamespace for the same node.

1. HashNode -> ValidateNodes -> ValidateNodeFormat.Min/MaxNamespace
2. HashNode -> ValidateNodes -> validateSiblingsNamespaceOrder -> ValidateNodeFormat(left|right).Min/MaxNamespace
3. HashNode -> ValidateNodes -> validateSiblingsNamespaceOrder.Min/MaxNamespace
4. HashNode.Min/MaxNamespace

After reducing to a single call, here is the new benchmark output (running go test -run="none" -bench=. -benchtime=100x -benchmem)

BenchmarkComputeRoot/64-leaves-12         	     100	    137292 ns/op	   60100 B/op	     905 allocs/op
BenchmarkComputeRoot/128-leaves-12        	     100	    237664 ns/op	  113758 B/op	    1804 allocs/op
BenchmarkComputeRoot/256-leaves-12        	     100	    462802 ns/op	  234241 B/op	    3604 allocs/op

Solution 2: Update the MinNamespace/MaxNamespace to return a slice of the existing array rather than creating a new underlying array

Copying the slice stops the possibility array capacity memory leaks if for some reason the namespace ID variable's lifetime is longer than the lifetime of the original slice, but I don't think that is the case. Slicing with a max capacity also stops the potential for appends to the namespace ID slice editing the underlying ns|ns|hash array.

Benchmark output after this change

BenchmarkComputeRoot/64-leaves-12         	     100	    136436 ns/op	   58180 B/op	     653 allocs/op
BenchmarkComputeRoot/128-leaves-12        	     100	    241836 ns/op	  109747 B/op	    1297 allocs/op
BenchmarkComputeRoot/256-leaves-12        	     100	    470231 ns/op	  225924 B/op	    2583 allocs/op

New Mem Flamegraph
New CPU Flamegraph

The improvements made by the first solution are made redundant by the second, but I was initially reluctant to replace the slice copy in case it would lead to a memory leak. As I got more familiar with the package, I became more confident that a slicing the array may be more appropriate.

Note

Given that validation is done in the Push method, adding an UnsafeHashNode that skips validation was also a method I considered, but I felt that this could lead to more complications & would not make any notable differences compared to either of the other solutions.
After reworking the ValidateSiblings function, I also removed the associated tests, but it looks like the same test cases also exist in TestValidateNodes HERE

bruscar010 force-pushed the hasher-optimizations branch from 6ea0c35 to 4738dff Compare January 1, 2025 18:35

bruscar010 changed the title ~~fix: reduce calls to MinNamespace & MaxNamespace~~ fix: MinNamespace/MaxNamespace optimizations Jan 1, 2025

fix: reduce calls to MinNamespace & MaxNamespace

05e3447

bruscar010 force-pushed the hasher-optimizations branch from 4738dff to 05e3447 Compare January 16, 2025 08:35

fix: review comments

6cb4212

bruscar010 force-pushed the hasher-optimizations branch from 17fcd37 to 6cb4212 Compare January 17, 2025 10:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: MinNamespace/MaxNamespace optimizations #1

fix: MinNamespace/MaxNamespace optimizations #1

bruscar010 commented Jan 1, 2025 •

edited

Loading

fix: MinNamespace/MaxNamespace optimizations #1

Are you sure you want to change the base?

fix: MinNamespace/MaxNamespace optimizations #1

Conversation

bruscar010 commented Jan 1, 2025 • edited Loading

More time was spent allocating than hashing in the HashLeaf function

The min and max namespace functions were taking up a large amount of RAM

Solution 1: Reduce the number of calls to MinNamespace & MaxNamespace

Solution 2: Update the MinNamespace/MaxNamespace to return a slice of the existing array rather than creating a new underlying array

bruscar010 commented Jan 1, 2025 •

edited

Loading