Debug Assertion failure in btree_pack_post_loop() -> mini_destroy_unused() reporting mismatch in (mini->num_extents == mini->num_batches) #458

gapisback · 2022-10-13T16:42:50Z

Describe the bug

While trying to setup a manual unit-test to repro issue #457, ran into the following stack using debug build of Splinter:

Program received signal SIGABRT, Aborted.
__pthread_kill_implementation (no_tid=0, signo=6, threadid=140737349751616) at ./nptl/pthread_kill.c:44
44	./nptl/pthread_kill.c: No such file or directory.
(gdb) bt
#0  __pthread_kill_implementation (no_tid=0, signo=6, threadid=140737349751616) at ./nptl/pthread_kill.c:44
#1  __pthread_kill_internal (signo=6, threadid=140737349751616) at ./nptl/pthread_kill.c:78
#2  __GI___pthread_kill (threadid=140737349751616, signo=signo@entry=6) at ./nptl/pthread_kill.c:89
#3  0x00007ffff7d15476 in __GI_raise (sig=sig@entry=6) at ../sysdeps/posix/raise.c:26
#4  0x00007ffff7cfb7f3 in __GI_abort () at ./stdlib/abort.c:79
#5  0x0000555555559c72 in platform_assert_false (filename=0x7ffff7fa19a0 "src/mini_allocator.c", linenumber=702,
    functionname=0x7ffff7fa1fb0 <__FUNCTION__.10> "mini_destroy_unused",
    expr=0x7ffff7fa1bb0 "(mini->num_extents == mini->num_batches)",
    message=0x7ffff7fa1b88 "num_extents=%lu, num_batches=%lu\n") at src/platform_linux/platform.c:344
#6  0x00007ffff7f656ef in mini_destroy_unused (mini=0x7fffffffe0a0) at src/mini_allocator.c:702
#7  0x00007ffff7f5f608 in btree_pack_post_loop (req=0x7fffffffbc40, last_key=...) at src/btree.c:2895
#8  0x00007ffff7f5fac2 in btree_pack (req=0x7fffffffbc40) at src/btree.c:2977
#9  0x00007ffff7f802ed in trunk_compact_bundle (arg=0x5555557ca2c0, scratch_buf=0x7ffff79e8ae8) at src/trunk.c:4840
#10 0x00007ffff7f97bea in task_group_perform_one (group=0x7ffff79e7800) at src/task.c:687
#11 0x00007ffff7f97d59 in task_perform_one (ts=0x7ffff79e5040) at src/task.c:713
#12 0x00007ffff7f8562b in trunk_insert (spl=0x7fffe6ef5040, key=0x7fffffffe350 "\b28223148", data=...)
    at src/trunk.c:6192
#13 0x00007ffff7f6989b in splinterdb_insert_message (kvs=0x55555555f700, key=..., msg=...) at src/splinterdb.c:710
#14 0x00007ffff7f69949 in splinterdb_insert (kvsb=0x55555555f700, key=..., value=...) at src/splinterdb.c:718
#15 0x00005555555586a4 in ctest_missing_in_use_virtual_fn_test_basic_flow_run (
    data=0x55555555e220 <ctest_missing_in_use_virtual_fn_test_basic_flow_data>)
    at tests/unit/missing_in_use_virtual_fn_test.c:74
#16 0x0000555555556eb2 in ctest_main (argc=1, argv=0x7fffffffe768) at tests/unit/main.c:305
#17 0x0000555555556974 in main (argc=1, argv=0x7fffffffe768) at tests/unit/main.c:151

The extended assertion message reports:

Inserted 28 million KV-pairs ...

Assertion failed at src/mini_allocator.c:702:mini_destroy_unused(): "(mini->num_extents == mini->num_batches)". num_extents=9, num_batches=8

Reproduction steps

Under construction: Run a single-client inserting 10s of millions of simple KV pairs into Splinter.
Checked that it is not a case of device being full.

Updated (gapisback):

To help move the fix/dev along, I pushed this branch: agurajada/458-mini_destroy_unused-Assert
Containing a new stress test: large_inserts_bugs_stress_test.c. It's pretty simple but useful to repro this problem with debug builds.
...

Expected behavior

Assertion should not occur.

Additional context

From code inspection, mini->num_batches seems to be fixed when btree_create calls mini_init, here:

1172    // set up the mini allocator
1173    mini_init(mini,
1174              cc,
1175              cfg->data_cfg,
1176              root.addr + btree_page_size(cfg),
1177              0,
1178              BTREE_MAX_HEIGHT,
1179              type,
1180              type == PAGE_TYPE_BRANCH);
1181
1182    return root.addr;

BTREE_MAX_HEIGHT is #defined to 8, which is consistent with the value of num_batches=8 seen when the assert trips.

The text was updated successfully, but these errors were encountered:

This commit adds a stress test to exercise large #s of inserts, from a main connection to Splinter, or from multiple threads. Some test-configuration is provided o --num-inserts; default 20 Million o --num-threads default 8, each thread inserts --num-insert KV-pairs All combinations of seq/random keys/value-pairs are supported. Currently, no new bugs were discovered with this test. Usage: ./unit/large_inserts_bugs_stress_test [--num-inserts <x-Millions>] [--num-threads 10] [--verbose-progress] One test case, currently skipped, caught issue #458.

gapisback · 2023-03-08T20:07:16Z

The specific assertion failure unearthed by this new test case has already been fixed in /main under:

fde3643 Rob Johnson 4 months ago Wed, 16-Nov-2022, 11:27:44 AM (Authored: Wed, 16-Nov-2022, 11:27:44 AM)
 port over mini_allocator assertion fix (#476)

An attempt was made to re-run this new test case with latest code from /main @ SHA 4b6e0b1 and several new issues were discovered. Those are being triaged and fixed one-at-a-time under issue #545.

So, closing this issue out and will track the test-case stabilization and eventually hopefully integration to /main under that newer issue.

Makefile fix for new test file.

- Minor off-by-1 fix in btree_pack_can_fit_tuple() - Fix page-addr computation in routing_filter_prefetch() Test moves along further, but still runs into other failures.

…serts() to run cleanly. Some attempted fixes to get test case to run cleanly: test_seq_htobe32_key_random_6byte_values_inserts() - Minor off-by-1 fix in btree_pack_can_fit_tuple() - Fix page-addr computation in routing_filter_prefetch() Test moves along further, but still runs into other failures.

…efetch()

(#458) Attempted fixes to get test_issue_458_mini_destroy_unused_debug_assert() to run cleanly. Test case was written to repro issue #458, but is still unstable. Some attempted fixes to get it to run cleanly. - Minor off-by-1 fix in btree_pack_can_fit_tuple() - Fix page-addr computation in routing_filter_prefetch() Test moves along further, but still runs into other failures. Re-enable test_seq_key_seq_values_inserts_threaded(); seems to work. Revisit. Add test_seq_htobe32_key_random_6byte_values_inserts() to repro PG-assertion. Add a new test case to load int32 key stored in htobe32() format, with a randomly generated 6-byte value. This reproduces the segv (release build) and assertion seen in Postgres benchmarking. Consistently use CTEST_LOG_INFO() rather than platform_default_log() Skip some tests that are failing. Revert fix in btree_pack_can_fit_tuple() - clang-format fixes. Change debug_assert() to platform_assert()

(#458) Attempted fixes to get test_issue_458_mini_destroy_unused_debug_assert() to run cleanly. Test case was written to repro issue #458, but is still unstable. Some attempted fixes to get it to run cleanly. - Minor off-by-1 fix in btree_pack_can_fit_tuple() - Fix page-addr computation in routing_filter_prefetch() Test moves along further, but still runs into other failures. Re-enable test_seq_key_seq_values_inserts_threaded(); seems to work. Revisit. Add test_seq_htobe32_key_random_6byte_values_inserts() to repro PG-assertion. Add a new test case to load int32 key stored in htobe32() format, with a randomly generated 6-byte value. This reproduces the segv (release build) and assertion seen in Postgres benchmarking. Consistently use CTEST_LOG_INFO() rather than platform_default_log() Skip some tests that are failing. Revert fix in btree_pack_can_fit_tuple() - clang-format fixes. Change debug_assert() to platform_assert() - Test has been renamed to large_inserts_stress_test

…int object mgmt. Stabilize large inserts stress tests. Skip some failing cases. (#458) Attempted fixes to get test_issue_458_mini_destroy_unused_debug_assert() to run cleanly. Test case was written to repro issue #458, but is still unstable. Some attempted fixes to get it to run cleanly. - Minor off-by-1 fix in btree_pack_can_fit_tuple() - Fix page-addr computation in routing_filter_prefetch() Test moves along further, but still runs into other failures. Re-enable test_seq_key_seq_values_inserts_threaded(); seems to work. Revisit. Add test_seq_htobe32_key_random_6byte_values_inserts() to repro PG-assertion. Add a new test case to load int32 key stored in htobe32() format, with a randomly generated 6-byte value. This reproduces the segv (release build) and assertion seen in Postgres benchmarking. Consistently use CTEST_LOG_INFO() rather than platform_default_log() Skip some tests that are failing. Revert fix in btree_pack_can_fit_tuple() - clang-format fixes. Change debug_assert() to platform_assert() - Test has been renamed to large_inserts_stress_test -- Add more diags to track shm-free calls. Rework large inserts stress test cases. This commit is an expedition to triage better shmem OOMs that we are routinely seeing in few of the test cases in large_inserts_stress_test.c . These used to work just fine with 4 GiB shmem in dev branch, but now even with 8 GiB shmem we are running out of memory. Some tracing changes are added to pass-down func/line# where alloc / free is occurring from. And to better report the call-site-info where large-fragments' handling occurs. No real logic change; tests are still unstable. Few cases are marked SKIP, to reduce noise during debugging. -- Minor rework of alloc/free interfaces to use PROCESS_PRIVATE_HEAP_ID -- Change free methods to track 'size' of memory chunk being freed. Plumb thru various free()-related interfaces tracking the 'size' of the memory fragment being freed. Track distribution of # of free calls for memory fragments of diff sizes. Enhance OOM-diags to report these metrics. We can now see that large # of small <= 32-byte fragments are being allocated and freed. Fix platform_realloc() interface, to return new size of aligned memory fragment. This way, when writable_buffer_ensure_space() invokes it, we do proper tracking of the 64-byte memory fragment, rather than mis-tracking it as a 32-byte fragment. - Fix bug in releasing memory from task_system. - Correct couple of uses of platform_free() to now use _free_mem(). - Fix one more case of free_mem() for fingerprint array. -- Implement small free fragment recycling. Add tracing. Use mutex v/s spinlock. This commit adds support for tracking small free fragments in lists, with a fragment header tracking {next, size} info. Add diagnostics to track free fragment usage metrics. Change from use of spinlock to a mutex to protect shared memory stats / lists mgmt. (Otherwise, for multi-threaded test runs, we were running into CPU pegged situation.) -- Suppress assert on size > 0 in platform_shm_track_free() -- Add platform_mem_frag{}, passed to platform_free() to free memory objects This commit starts to bring back the interface to platform_free() and keep platform_free_mem() as an internal interface. Callers that allocate memory fragment for an opaque type, say an array of n-bytes, will now have to package their memory fragment's info in a new platform_mem_frag{} struct. platform_free() is enhanced to recognize an object of this type, and will do the necessary unpacking to call the lower free methods. Fix a bug in trunk_unmount() to free memory for stats-fragments. -- Percolate errors from shared memory destroy all the way to splinterdb_close() As part of dismantling the shared segment, we [already] check to see if there are "un-freed" large-fragments lying around. This indicates some code-flow / logic error in use of platform_free() or platform_free_mem() interfaces. This commit adds the plumbing to percolate this error condition all the way up the top to splinterdb_close(). That has been changed from a void function to one returning platform_status. Other minor fixes, comment cleanup. -- Update few unit-tests to check for rc from splinterdb_close(). -- Re-enable few cases in large_inserts_stress_test. Bump up cache to 4GiB, drop shmem to 1GiB -- Fix to get large_inserts_stress test to run in MSAN builds. -- Implement fingerprint object mgmt apis. Test seems to work. -- Fix call to realloc() that trips assertions. Tighten tests to check rc. Update few unit-tests to check rc from platform_heap_destroy(). If any test does not dismantle sub-systems, test run with --use-shmem will trip up as platform_heap_destroy() will return a failure. -- Fix failing unit-tests. Tighten platform_free() to require non-NULL ptr to free. -- Make fp_array{} a derived object from platform_mem_frag{} To eliminate use of platform_free_mem() interface, define fingerprint fp_array{} as an object extended from a memory fragment. Rework interfaces. Add alias refcount diagnostics. -- Rename to platform_memfrag(). Add more tracing. Fix bugs. -- Purge all refs to platform_free_mem(). Replace with mem_frag{}. Needs to be tested / debugged. -- Rework stats-gathering struct. Fix one debug tests bug. -- Add repro test_fp_num_tuples_out_of_bounds_bug_trunk_build_filters() and instrumentation. Fix failing assert from trunk_build_filters(); Code reorgn issue. -- Fix frags_inuse and frags_inuse_HWM accounting bugs. -- Fix assertion failure in trunk_build_filters() -> fingerprint_unalias(). Consolidate outputs when asserts in fingerprint methods trip.

…int object mgmt. Stabilize large inserts stress tests. Skip some failing cases. (#458) Attempted fixes to get test_issue_458_mini_destroy_unused_debug_assert() to run cleanly. Test case was written to repro issue #458, but is still unstable. Some attempted fixes to get it to run cleanly. - Minor off-by-1 fix in btree_pack_can_fit_tuple() - Fix page-addr computation in routing_filter_prefetch() Test moves along further, but still runs into other failures. Re-enable test_seq_key_seq_values_inserts_threaded(); seems to work. Revisit. Add test_seq_htobe32_key_random_6byte_values_inserts() to repro PG-assertion. Add a new test case to load int32 key stored in htobe32() format, with a randomly generated 6-byte value. This reproduces the segv (release build) and assertion seen in Postgres benchmarking. Consistently use CTEST_LOG_INFO() rather than platform_default_log() Skip some tests that are failing. Revert fix in btree_pack_can_fit_tuple() - clang-format fixes. Change debug_assert() to platform_assert() - Test has been renamed to large_inserts_stress_test -- Add more diags to track shm-free calls. Rework large inserts stress test cases. This commit is an expedition to triage better shmem OOMs that we are routinely seeing in few of the test cases in large_inserts_stress_test.c . These used to work just fine with 4 GiB shmem in dev branch, but now even with 8 GiB shmem we are running out of memory. Some tracing changes are added to pass-down func/line# where alloc / free is occurring from. And to better report the call-site-info where large-fragments' handling occurs. No real logic change; tests are still unstable. Few cases are marked SKIP, to reduce noise during debugging. -- Minor rework of alloc/free interfaces to use PROCESS_PRIVATE_HEAP_ID -- Change free methods to track 'size' of memory chunk being freed. Plumb thru various free()-related interfaces tracking the 'size' of the memory fragment being freed. Track distribution of # of free calls for memory fragments of diff sizes. Enhance OOM-diags to report these metrics. We can now see that large # of small <= 32-byte fragments are being allocated and freed. Fix platform_realloc() interface, to return new size of aligned memory fragment. This way, when writable_buffer_ensure_space() invokes it, we do proper tracking of the 64-byte memory fragment, rather than mis-tracking it as a 32-byte fragment. - Fix bug in releasing memory from task_system. - Correct couple of uses of platform_free() to now use _free_mem(). - Fix one more case of free_mem() for fingerprint array. -- Implement small free fragment recycling. Add tracing. Use mutex v/s spinlock. This commit adds support for tracking small free fragments in lists, with a fragment header tracking {next, size} info. Add diagnostics to track free fragment usage metrics. Change from use of spinlock to a mutex to protect shared memory stats / lists mgmt. (Otherwise, for multi-threaded test runs, we were running into CPU pegged situation.) -- Suppress assert on size > 0 in platform_shm_track_free() -- Add platform_mem_frag{}, passed to platform_free() to free memory objects This commit starts to bring back the interface to platform_free() and keep platform_free_mem() as an internal interface. Callers that allocate memory fragment for an opaque type, say an array of n-bytes, will now have to package their memory fragment's info in a new platform_mem_frag{} struct. platform_free() is enhanced to recognize an object of this type, and will do the necessary unpacking to call the lower free methods. Fix a bug in trunk_unmount() to free memory for stats-fragments. -- Percolate errors from shared memory destroy all the way to splinterdb_close() As part of dismantling the shared segment, we [already] check to see if there are "un-freed" large-fragments lying around. This indicates some code-flow / logic error in use of platform_free() or platform_free_mem() interfaces. This commit adds the plumbing to percolate this error condition all the way up the top to splinterdb_close(). That has been changed from a void function to one returning platform_status. Other minor fixes, comment cleanup. -- Update few unit-tests to check for rc from splinterdb_close(). -- Re-enable few cases in large_inserts_stress_test. Bump up cache to 4GiB, drop shmem to 1GiB -- Fix to get large_inserts_stress test to run in MSAN builds. -- Implement fingerprint object mgmt apis. Test seems to work. -- Fix call to realloc() that trips assertions. Tighten tests to check rc. Update few unit-tests to check rc from platform_heap_destroy(). If any test does not dismantle sub-systems, test run with --use-shmem will trip up as platform_heap_destroy() will return a failure. -- Fix failing unit-tests. Tighten platform_free() to require non-NULL ptr to free. -- Make fp_array{} a derived object from platform_mem_frag{} To eliminate use of platform_free_mem() interface, define fingerprint fp_array{} as an object extended from a memory fragment. Rework interfaces. Add alias refcount diagnostics. -- Rename to platform_memfrag(). Add more tracing. Fix bugs. -- Purge all refs to platform_free_mem(). Replace with mem_frag{}. Needs to be tested / debugged. -- Rework stats-gathering struct. Fix one debug tests bug. -- Add repro test_fp_num_tuples_out_of_bounds_bug_trunk_build_filters() and instrumentation. Fix failing assert from trunk_build_filters(); Code reorgn issue. -- Fix frags_inuse and frags_inuse_HWM accounting bugs. -- Fix assertion failure in trunk_build_filters() -> fingerprint_unalias(). Consolidate outputs when asserts in fingerprint methods trip. -- Rework fingerprint unalias'ing in trunk_bundle_build_filters(); clang compile fails w/ older code.

gapisback added the bug Something isn't working label Oct 13, 2022

gapisback mentioned this issue Oct 13, 2022

Signal in trunk_should_reclaim_space() -> allocator_in_use() due to NULL virtual 'in_use' function pointer. #457

Closed

gapisback added a commit that referenced this issue Oct 13, 2022

Add large_inserts_bugs_stress_test.c, for issue #458.

df8be15

gapisback mentioned this issue Oct 28, 2022

Assertion failure while running multi-thread concurrent inserts workload: trunk_split_leaf(): "(num_leaves + trunk_num_pivot_keys(spl, parent) <= spl->cfg.max_pivot_keys)" #467

Open

gapisback mentioned this issue Nov 6, 2022

Enable use of background threads in Splinter. Engage tests to verify basic stability of this existing feature. #469

Merged

gapisback assigned rtjohnso Nov 15, 2022

gapisback mentioned this issue Mar 2, 2023

Assertion failure routing_filter_prefetch() -> cache_get(), reporting invalid input page addr. #545

Closed

gapisback closed this as completed Mar 8, 2023

gapisback added a commit that referenced this issue Mar 24, 2023

Add large_inserts_bugs_stress_test.c, for issue #458.

287d37b

Makefile fix for new test file.

gapisback mentioned this issue Mar 29, 2023

Debug assert fails in routing_filter_add(), "(index_no / addrs_per_page < pages_per_extent)": Causes large inserts workload to fail. #560

Open

gapisback added a commit that referenced this issue Mar 29, 2023

(#458) Partial back-port of SHA 4257e30; Fix bug in routing_filter_pr…

b95cb26

…efetch()

gapisback added a commit that referenced this issue Mar 29, 2023

(#458) Partial back-port of SHA 4257e30; Fix bug in routing_filter_pr…

54061c8

…efetch()

gapisback mentioned this issue Apr 5, 2023

Arithmetic error in computation of page-addr in routing_filter_prefetch() causes debug-assert, or can lead to unending looping hang. #561

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Debug Assertion failure in btree_pack_post_loop() -> mini_destroy_unused() reporting mismatch in (mini->num_extents == mini->num_batches) #458

Debug Assertion failure in btree_pack_post_loop() -> mini_destroy_unused() reporting mismatch in (mini->num_extents == mini->num_batches) #458

gapisback commented Oct 13, 2022 •

edited

Loading

gapisback commented Mar 8, 2023

Debug Assertion failure in btree_pack_post_loop() -> mini_destroy_unused() reporting mismatch in (mini->num_extents == mini->num_batches) #458

Debug Assertion failure in btree_pack_post_loop() -> mini_destroy_unused() reporting mismatch in (mini->num_extents == mini->num_batches) #458

Comments

gapisback commented Oct 13, 2022 • edited Loading

Describe the bug

Reproduction steps

Expected behavior

Additional context

gapisback commented Mar 8, 2023

gapisback commented Oct 13, 2022 •

edited

Loading