Investigate unstable benchmark results on macOS #1648

andygrove · 2025-04-14T20:46:19Z

Describe the bug

I am running TPC-H benchmarks on macOS using the instructions in #1647.

q13 sometimes takes 7 seconds and sometimes takes more than 5 minutes when Comet is enabled. I do not see any spilling. I am testing with Comet with the DataFusion 47 upgrade from #1563

Steps to reproduce

No response

Expected behavior

No response

Additional context

No response

andygrove · 2025-04-14T21:03:16Z

This comparison of two runs suggests that the issue is with a hash aggregate.

mbutrovich · 2025-04-14T21:04:04Z

I wonder if it's related to the Vec resize issues that @Kontinuation was seeing in #1511 (comment)

parthchandra · 2025-04-14T22:47:00Z

Why wouldn't we see the issue every time?

andygrove · 2025-04-15T15:30:11Z

The issue seems at least partly due to a mix of performance and efficiency cores being used, which we do not have any control over.

andygrove · 2025-04-15T15:31:39Z

thermal management (CPU throttling) could also be a factor

andygrove · 2025-04-15T16:41:40Z

I switched from 100 GB to 10 GB data set and the situation is even worse. q13 hangs and there are no stats at all in the plan.

edit: it did eventually complete, and another run was fast, so this is just more of the same instability

kazuyukitanimura · 2025-04-17T20:23:03Z

The plan is to put a disclaimer

mbutrovich · 2025-04-21T20:11:54Z

I profiled it and we're getting crushed in OS mutexes in the allocator. I noticed there's some support for mimalloc in Comet already that's not really documented. DF enables it by default for its benchmarks. I am testing a build with make release COMET_FEATURES=mimalloc to see if the issue goes away.

mbutrovich · 2025-04-21T20:54:00Z

Anecdotally, my performance issues on macOS disappear when using mimalloc. We should have a larger discussion about where third-party allocators fit into the picture of optimizing Comet (off-heap, etc.).

parthchandra · 2025-04-22T00:00:04Z

That's a great find. Arrow Cpp discussion: https://lists.apache.org/thread/dts9ggvkthczfpmd25wrz449mxod76o2
Also, I always thought we were using mimalloc (but never actually checked)!

andygrove · 2025-04-22T14:06:45Z

For my local Linux benchmark, I saw performance improve from ~275 s to ~266 s when using mimalloc

Dandandan · 2025-04-22T14:49:57Z

FYI: We found mimalloc to be unstable for running long term (keeps memory allocated but seems doesn't release it over time => running into OOMs much more easily). Switching to jemalloc fixed this.

Kontinuation · 2025-04-22T16:10:41Z

We prefer to use jemalloc with Comet by setting LD_PRELOAD, as it returns freed memory to the operating system more aggressively than ptmalloc, resulting in a more predictable RSS for the Spark executor process. Additionally, jemalloc's memory profiler is helpful for diagnosing OOM issues.

However, I'm uncertain whether jemalloc can still be used with LD_PRELOAD when Comet is compiled with mimalloc. The last time I attempted to dynamically override malloc it didn't work.

mbutrovich · 2025-04-22T16:25:57Z

I've also only ever used jemalloc in the past. I'm not sure what the discussion was at the time to pursue mimalloc for Comet. @mdcallag has some recent writing on the topic, althrough RocksDB is a different workload than what we're doing:

https://smalldatum.blogspot.com/2025/04/battle-of-mallocators.html
https://smalldatum.blogspot.com/2025/04/battle-of-mallocators-part-2.html

The Germans 🇩🇪 did a bake-off a few years ago and they chose jemalloc for Umbra (and likely CedarDB): https://www.adms-conf.org/2019-camera-ready/durner_adms19.pdf

mbutrovich · 2025-04-22T16:26:47Z

However, I'm uncertain whether jemalloc can still be used with LD_PRELOAD when Comet is compiled with mimalloc. The last time I attempted to dynamically override malloc it didn't work.

Yeah I don't think you want to mix and match like that.

Does LD_PRELOAD also change the allocator for the JVM?

Kontinuation · 2025-04-22T16:34:32Z

Does LD_PRELOAD also change the allocator for the JVM?

Memory allocated by Unsafe_AllocateMemory0 (Arrow native memory, Spark off-heap memory) uses the allocator. The large JVM heap regions seems to be directly allocated using syscalls such as mmap, which is not affected by allocator.

alamb · 2025-04-29T13:27:10Z

Related discord thread: https://discord.com/channels/885562378132000778/1363995762182193373

andygrove added the bug Something isn't working label Apr 14, 2025

andygrove self-assigned this Apr 14, 2025

andygrove added this to the 0.8.0 milestone Apr 14, 2025

andygrove mentioned this issue Apr 15, 2025

docs: Add instructions on running TPC-H on macOS #1647

Merged

andygrove modified the milestones: 0.8.0, 0.9.0 Apr 17, 2025

mbutrovich mentioned this issue Apr 23, 2025

feat: add jemalloc as optional custom allocator #1679

Merged

andygrove closed this as completed in #1679 Apr 25, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Investigate unstable benchmark results on macOS #1648

Investigate unstable benchmark results on macOS #1648

andygrove commented Apr 14, 2025

andygrove commented Apr 14, 2025

mbutrovich commented Apr 14, 2025

parthchandra commented Apr 14, 2025

andygrove commented Apr 15, 2025

andygrove commented Apr 15, 2025

andygrove commented Apr 15, 2025 •

edited

Loading

kazuyukitanimura commented Apr 17, 2025

mbutrovich commented Apr 21, 2025

mbutrovich commented Apr 21, 2025

parthchandra commented Apr 22, 2025

andygrove commented Apr 22, 2025

Dandandan commented Apr 22, 2025

Kontinuation commented Apr 22, 2025

mbutrovich commented Apr 22, 2025

mbutrovich commented Apr 22, 2025

Kontinuation commented Apr 22, 2025

alamb commented Apr 29, 2025

Investigate unstable benchmark results on macOS #1648

Investigate unstable benchmark results on macOS #1648

Comments

andygrove commented Apr 14, 2025

Describe the bug

Steps to reproduce

Expected behavior

Additional context

andygrove commented Apr 14, 2025

mbutrovich commented Apr 14, 2025

parthchandra commented Apr 14, 2025

andygrove commented Apr 15, 2025

andygrove commented Apr 15, 2025

andygrove commented Apr 15, 2025 • edited Loading

kazuyukitanimura commented Apr 17, 2025

mbutrovich commented Apr 21, 2025

mbutrovich commented Apr 21, 2025

parthchandra commented Apr 22, 2025

andygrove commented Apr 22, 2025

Dandandan commented Apr 22, 2025

Kontinuation commented Apr 22, 2025

mbutrovich commented Apr 22, 2025

mbutrovich commented Apr 22, 2025

Kontinuation commented Apr 22, 2025

alamb commented Apr 29, 2025

andygrove commented Apr 15, 2025 •

edited

Loading