[Elixir/phoenix] Implementing suggestions from @josevalim #9302

atavistock · 2024-09-30T01:24:02Z

Conversation in #9198 lead to some meaningful optimizations

Specifically

Obtain one db connection for all queries within a request.
Optimized flow of getting world ids

josevalim · 2024-09-30T07:19:20Z

This is beautiful, thank you for working on it! ❤️

The only other feedback I have, which we should explore in a separate pull request anyway, is to increase the pool size. For multiple queries, we are not getting more performance from high concurrency numbers, and I assume this is either because our pool size is too small (we use 50, some of the Rust repos use 1024) or our polling is a bottleneck (it is a single process).

I am away from my computer with 10 cores, so when I am back home, I want to run some benchmarks on top of Ecto polling and try to get some numbers. In particular, I may want to add multiple pools inside Ecto, and then we can try running 8 pools with 32 connections each or similar. But we should probably wait for this PR to be merged and see a trial run, so we can see baseline numbers. :)

If you want, I can keep you posted on the Ecto efforts. Have a fantastic week!

…ly doing reads

atavistock · 2024-10-03T18:35:07Z

@srcrip I just wanted to drop a note that this PR was in flight to do Repo.checkout where you were adding Repo.transaction. I tested with checkout, transaction, and checkout+transaction and in these specific performance tests it seems like just the just using Repo.checkout is the best performance presumably because theres very few writes.

josevalim · 2024-10-03T18:48:45Z

Correct. transaction is checkout+begin+commit, but if you are only doing reads, you don't need begin+commit. :)

josevalim · 2024-10-07T08:57:22Z

I have added pool_count support to EctoSQL v3.12.1+: elixir-ecto/ecto_sql#636

My suggestion is that, once this PR is merged and we measure its new baseline, we should update ecto_sql and use this configuration:

pool_count: 16,
pool_size: 32

This is exactly 512 connections. We can also try pool_count: 32, pool_size: 32 (some Rust benchmarks use 1024 connections) but, given this is tuning, we need to have the numbers to compare against. :)

srcrip · 2024-10-07T20:32:58Z

Thank you @atavistock and @josevalim, yes that makes sense. I also agree that we probably need to increase the pool size.

When I made some initial improvements, I got way better results on my machine than in the last runs on https://tfb-status.techempower.com/. I'm not really sure why. If you look at the results there it seems like it's gotten worse over the past few runs. But I have no idea as to why.

atavistock · 2024-10-07T23:11:26Z

@NateBrady23 I think this one should be good to merge and we'll have a follow up based on the Ecto changes that @josevalim did.

josevalim · 2024-10-10T07:07:43Z

@srcrip some numbers have come out for using transactions and it already does better, so I am sure we are in the right track: https://www.techempower.com/benchmarks/#section=test&runid=176ba510-3607-4faa-996e-74f0778b88d4&hw=ph&test=query

Btw, it seems the plain text benchmark is failing on the continuous benchmarks. Any idea why?

josevalim · 2024-10-10T07:09:57Z

frameworks/Elixir/phoenix/lib/hello_web/controllers/page_controller.ex

+      returning: false
+    )
+
+    json(conn, world_updates)
  end

  def plaintext(conn, _params) do
    text(conn, "Hello, World!")


text/2 includes "charset=utf-8" in the content type and the other benchmarks seem to not do it (and this will matter on the plain text one numbers). So it probably makes sense for us to set the content-type explicitly instead:

Suggested change

text(conn, "Hello, World!")

conn

|> put_resp_header("content-type", "text/plain")

|> send_resp(200, "Hello, World!")

WDYT?

Makes sense to me. Adding that now.

srcrip · 2024-10-10T13:55:26Z

@josevalim I think there's something up with the compression headers on the plaintext benchmark. I tried to properly disable it in bandit, as that is what is required. But not sure if I made a mistake in the implementation.

edit: ahhh, I didn't see your comment about the char type thing. that makes a lot more sense.

josevalim · 2024-10-10T16:11:22Z

@srcrip I double checked your compression configs and they look right to me (the one inside :http, we don't need the one at the endpoint level). So it has to be the charset. :)

…upings)

atavistock · 2024-10-10T20:13:51Z

Okay, I've also pulled the version of Ecto with support for pool_count.

I tested different permutations of count and size, and arrived at 24 pools with 64 connections each which seems to be the best performance within the connection limit for the docker container. Having fewer pools started to create contention for connections, having more pools seemed to reduce throughput. I also did test fewer overall connections (16/64 and 32/32) but 24/64 seemed to be the right balance on my Macbook M2

josevalim · 2024-10-10T20:32:50Z

How many cores do you have on your M2? I am thinking that for Techempower, we probably want pool count to be the double of cores (in their case, that would be 56) and 15 connections on each?

atavistock · 2024-10-12T17:30:08Z

@NateBrady23 I've seen some links to run ids, do those include these pull requests and how can I find the most current?

NateBrady23 · 2024-10-22T14:46:17Z

@atavistock This will be included in the next full run that starts on https://tfb-status.techempower.com

josevalim · 2024-10-28T22:12:24Z

Awesome job @atavistock and @srcrip, Phoenix is doing much better in the benchmarks now! 🎉 In some of those we are putting almost 10x of what we got before! https://www.techempower.com/benchmarks/#section=test&runid=1300aee6-f9c9-42a2-8d17-252ba597202f&hw=ph&test=db

Although it unfortunately still fails at the plaintext one... 🤔

atavistock · 2024-10-29T01:49:54Z

Good progress! So first the good news, I can reproduce the plaintext error locally. Now the bad news, the problem seems to be within Bandit. Testing with ApacheBench and 160 concurrent workers, 10,000 connections, and zero throttling: Cowboy and Conn.Plug starts to noticeably slow down almost immediately and after 2000 requests its noticeably crawling. To its credit it does eventually return a valid response for every request with a p95 of 14 seconds. In contrast Bandit is clearly faster at lower numbers and just blazes through requests but then around 7000 requests it just completely collapses and stops accepting new connections for 30+ seconds or so. The largest test here goes up to 256 workers with 10 threads and sends over 50,000 requests, which is much more than my little load test. Filing an issue with Bandit on this and open to any ideas on how to help resolve this.

…

On Mon, Oct 28, 2024 at 3:12 PM José Valim ***@***.***> wrote: Awesome job @atavistock and @srcrip, Phoenix is doing much better in the benchmarks now! 🎉 In some of those we are putting almost 10x of what we got before! https://www.techempower.com/benchmarks/#section=test&runid=1300aee6-f9c9-42a2-8d17-252ba597202f&hw=ph&test=db Although it unfortunately still fails at the plaintext one... 🤔 — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you were mentioned.Message ID: ***@***.***>

josevalim · 2024-10-29T07:50:11Z

Great digging! Mat (from Bandit) is super responsive, so please open up an issue and share it with us here (or ping me there, I will be glad to contribute).

josevalim · 2024-10-29T07:59:07Z

@atavistock btw, do you think it is worth doing a run with pool_count: 128, pool_size: 8 to see if we get even better numbers?

atavistock · 2024-11-28T03:59:09Z

@josevalim Still trying to figure out why the plaintext test is getting errors. mtrudel/bandit#431

josevalim · 2024-11-28T09:21:12Z

Awesome, thank you for your continuous effort on this!

atavistock · 2025-03-05T21:51:09Z

@josevalim / @srcrip / @mtrudel Just thought it was worth pointing out that the new public benchmarks are out.

https://www.techempower.com/benchmarks/#hw=ph&test=composite&section=data-r23

josevalim · 2025-03-06T09:06:10Z

Those are great improvements across the board. Fantastic work @atavistock and everyone involved!

srcrip · 2025-05-26T15:01:26Z

Much better! I do wonder @josevalim and @atavistock if there's still something slowing down the plaintext benchmark though. One of the rails benchmarks is just slightly outperforming the phoenix+bandit one and it seems like maybe there's still something happening there?

atavistock · 2025-05-27T04:07:48Z

I see that. The code feels like a bit of a cheat, as they're they're not really even using Rails, but routing to a proc that returns a canned Rack response. They're doing the same for the JSON test.

from routes.rb

   PlaintextApp = ->(env) { [200, {'Server' => 'Rails', 'Content-Type' => 'text/plain'}, ['Hello, World!']] }
   get "plaintext", to: PlaintextApp

josevalim · 2025-05-27T07:02:19Z

I dropped a comment on the other PR asking for some clarification. If it is ok to forego the Rails request/response objects, then we can likely do something similar and use canned responses from Bandit.

atavistock added 2 commits September 29, 2024 18:19

Implementing suggestions from @josevalim

d61a61e

We don't need to worry about 'accepts' plug in this case

55170e0

atavistock changed the title ~~Implementing suggestions from @josevalim~~ [Elixir/phoenix] Implementing suggestions from @josevalim Sep 30, 2024

atavistock added 2 commits September 29, 2024 20:53

Prematurely removed accepts plug (still needed for 'fortunes' test)

954b570

Avoid checking a list we don't need to

5691623

Impact of 'Repo.transaction' tests worse than 'Repo.checkout' when on…

27a7e8a

…ly doing reads

josevalim reviewed Oct 10, 2024

View reviewed changes

atavistock added 2 commits October 10, 2024 11:29

Using explicit content-type for plain-text

426829b

Updating ecto to use pool count (also tested different count/size gro…

d743048

…upings)

db pools set for the target machine

cf7babf

NateBrady23 merged commit 55c1436 into TechEmpower:master Oct 22, 2024
3 checks passed

atavistock deleted the patch/elixir_tuning_29092024 branch December 3, 2024 04:47

-    text(conn, "Hello, World!")
+    conn
+    |> put_resp_header("content-type", "text/plain")
+    |> send_resp(200, "Hello, World!")

[Elixir/phoenix] Implementing suggestions from @josevalim #9302

[Elixir/phoenix] Implementing suggestions from @josevalim #9302

Uh oh!

Conversation

atavistock commented Sep 30, 2024

Uh oh!

josevalim commented Sep 30, 2024

Uh oh!

atavistock commented Oct 3, 2024

Uh oh!

josevalim commented Oct 3, 2024

Uh oh!

josevalim commented Oct 7, 2024

Uh oh!

srcrip commented Oct 7, 2024

Uh oh!

atavistock commented Oct 7, 2024

Uh oh!

josevalim commented Oct 10, 2024

Uh oh!

josevalim Oct 10, 2024

Choose a reason for hiding this comment

Uh oh!

atavistock Oct 10, 2024

Choose a reason for hiding this comment

Uh oh!

srcrip commented Oct 10, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

josevalim commented Oct 10, 2024

Uh oh!

atavistock commented Oct 10, 2024

Uh oh!

josevalim commented Oct 10, 2024

Uh oh!

atavistock commented Oct 12, 2024

Uh oh!

Uh oh!

NateBrady23 commented Oct 22, 2024

Uh oh!

josevalim commented Oct 28, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

atavistock commented Oct 29, 2024 via email • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

josevalim commented Oct 29, 2024

Uh oh!

josevalim commented Oct 29, 2024

Uh oh!

atavistock commented Nov 28, 2024

Uh oh!

josevalim commented Nov 28, 2024

Uh oh!

atavistock commented Mar 5, 2025

Uh oh!

josevalim commented Mar 6, 2025

Uh oh!

srcrip commented May 26, 2025

Uh oh!

atavistock commented May 27, 2025

Uh oh!

josevalim commented May 27, 2025

Uh oh!

Uh oh!

srcrip commented Oct 10, 2024 •

edited

Loading

josevalim commented Oct 28, 2024 •

edited

Loading

atavistock commented Oct 29, 2024 via email •

edited

Loading