Optimise shops page: Enable injected enterprise data to be scoped to specific enterprise ids #12755

johansenja · 2024-08-09T10:00:18Z

What? Why?

When visiting https://openfoodnetwork.org.uk/shops, the page load is very slow. Given that this page is linked to from the
CTA above the fold on https://openfoodnetwork.org.uk/, it seems ideal that should load quickly.

It appears that part of the problem is that the InjectionHelper (via OpenFoodNetwork::EnterpriseInjectionData) fetches information about all enterprises (even ones which don't need to be serialised).

In some cases there could be a lot of rows pulled from the DB into memory, especially in the cases where there are joins (eg. Spree::Taxon.supplied_taxons pulls in all taxons of all products for all enterprises) - and then sometimes there are application-side operations happening on them too (eg. looping or creating hashes).

I opted to make this scoping optional where possible to try to ensure that they are non-breaking for other areas

I did a bit of benchmarking locally using

ab -n 20 -c 1 -H 'Accept: text/html'  -H 'Cookie: ...' 'http://localhost:3000/shops#/'

for this branch against master, and these were the results:

master (1940ms page load average):

This is ApacheBench, Version 2.3 <$Revision: 1903618 $>
Copyright 1996 Adam Twiss, Zeus Technology Ltd, http://www.zeustech.net/
Licensed to The Apache Software Foundation, http://www.apache.org/

Benchmarking localhost (be patient).....done


Server Software:        
Server Hostname:        localhost
Server Port:            3000

Document Path:          /shops#/
Document Length:        44150 bytes

Concurrency Level:      1
Time taken for tests:   38.817 seconds
Complete requests:      20
Failed requests:        19
   (Connect: 0, Receive: 0, Length: 19, Exceptions: 0)
Total transferred:      922991 bytes
HTML transferred:       884035 bytes
Requests per second:    0.52 [#/sec] (mean)
Time per request:       1940.863 [ms] (mean)
Time per request:       1940.863 [ms] (mean, across all concurrent requests)
Transfer rate:          23.22 [Kbytes/sec] received

Connection Times (ms)
              min  mean[+/-sd] median   max
Connect:        0    0   0.0      0       0
Processing:  1697 1941 218.5   1815    2531
Waiting:     1696 1941 218.5   1815    2531
Total:       1697 1941 218.5   1815    2532

Percentage of the requests served within a certain time (ms)
  50%   1815
  66%   2015
  75%   2095
  80%   2132
  90%   2259
  95%   2532
  98%   2532
  99%   2532
 100%   2532 (longest request)

this branch (686ms page load average):

This is ApacheBench, Version 2.3 <$Revision: 1903618 $>
Copyright 1996 Adam Twiss, Zeus Technology Ltd, http://www.zeustech.net/
Licensed to The Apache Software Foundation, http://www.apache.org/

Benchmarking localhost (be patient).....done


Server Software:        
Server Hostname:        localhost
Server Port:            3000

Document Path:          /shops#/
Document Length:        43823 bytes

Concurrency Level:      1
Time taken for tests:   13.730 seconds
Complete requests:      20
Failed requests:        19
   (Connect: 0, Receive: 0, Length: 19, Exceptions: 0)
Total transferred:      914886 bytes
HTML transferred:       879896 bytes
Requests per second:    1.46 [#/sec] (mean)
Time per request:       686.478 [ms] (mean)
Time per request:       686.478 [ms] (mean, across all concurrent requests)
Transfer rate:          65.07 [Kbytes/sec] received

Connection Times (ms)
              min  mean[+/-sd] median   max
Connect:        0    0   0.2      0       1
Processing:   543  686  91.4    671     955
Waiting:      543  686  91.4    670     955
Total:        543  686  91.4    671     956

Percentage of the requests served within a certain time (ms)
  50%    671
  66%    684
  75%    745
  80%    757
  90%    814
  95%    956
  98%    956
  99%    956
 100%    956 (longest request)

What should we test?

The /shops page is the main one
The groups page (eg /groups/1#/map) - from inject_group_enterprises
The order show or edit page (eg /orders/R888114387) - from inject_enterprise_and_relatives

The other 2 pages incidentally also seem to be quicker for me, by a similar amount

Release notes

Changelog Category (reviewers may add a label for the release notes):

Technical changes only

Dependencies

Documentation updates

johansenja · 2024-08-09T10:28:29Z

spec/models/spree/taxon_spec.rb

-      let!(:p1) {
-        create(:simple_product, primary_taxon_id: t1.id, supplier_id: e.id)
-      }
+RSpec.describe Spree::Taxon do


These are a lot of whitespace changes in this file (changed from module Spree nesting to RSpec.describe Spree::Taxon to avoid linting problems) - I would recommend hiding whitespace to view, to distinguish the functional differences

rioug

That's awesome ! thanks @johansenja
Keep them coming 😄

spec/lib/open_food_network/enterprise_injection_data_spec.rb

dacook

I've often noticed how slow it was and thought "surely we can improve this..". So thanks for taking on this big task to improve the performance!

Firstly, I have a few general notes:

When adding specs to cover existing code, it's best to add them in an earlier commit (you can re-arrange commits with git). Then, when you add your changes we can more clearly see what behaviour was existing, and what features you've added.
It's worth running the linter each time you commit, so that you can catch these adjustments earlier, and keep the git history simple

I also have a suggestion, for bonus points:
We have an rspec matcher to check what DB queries are running. This seems like a great opportunity to test that it's running only the necessary queries. See here for examples, or a real example here
This could be added in a separate PR though; I think it's worth proceeding with manual testing.

app/models/spree/shipping_method.rb

spec/models/order_cycle_spec.rb

dacook · 2024-08-13T00:48:27Z

lib/open_food_network/enterprise_injection_data.rb

-      @shipping_method_services ||= CacheService.cached_data_by_class("shipping_method_services",
-                                                                      Spree::ShippingMethod) do
+      @shipping_method_services ||= CacheService.cached_data_by_class(
+        "shipping_method_services_#{@enterprise_ids.hash}",


I looked up the hash method to make sure I understood it, and discovered that it only creates a reproducible hash within the current process (ref https://stackoverflow.com/questions/2505824/how-does-rubys-string-hash-method-work)
For example:

~/projects/openfoodnetwork $ ruby -e "puts [1,2,3].hash" 1856927473914424695 ~/projects/openfoodnetwork $ ruby -e "puts [1,2,3].hash" 3622279804335312461

As far as I can tell, we have shared cache between server processes (with redis_cache_store), so the benefit of caching will be reduced. It means we'll end up with perhaps 4 times the cache entries (assuming 4 puma processes which don't often crash and restart).

Hmm it should be fine :)
It's probably more performant than generating a more persistent hash.

Ah, that's a good point. My hope here would be that caching wouldn't even be needed if fetching and processing the data is fast enough - currently, CacheService.cached_data_by_class performs an extra DB query just to generate the cache key cached_class.maximum(:updated_at).to_f; on my machine it normally takes a few ms but has taken up to 45ms (though maybe it would be more consistent with an index on updated_at), but it's a similar time to actually pulling all the rows anyway:

Though I'm sure production data would be a bit different!

As far as I can tell, this is the only code that uses CacheService.cached_data_by_class, so there could be an opportunity to simplify the CacheService if the EnterpriseInjectionData no longer needed it

Ah, I see. It seems pointless making an extra DB query to find out if we need to make the real query, when we could just make the real query. Perhaps with further optimisation, the real query might be quite efficient (I just added a comment to Spree::Taxon.supplied_taxons)

johansenja · 2024-08-13T14:02:26Z

Thanks for the review. It's useful to know about the rspec matchers, because I have a few other changes in mind which could benefit from them. With these changes, I don't think I'll be chaning the number of queries, though there are a few areas which might (see #12724 - there are definitely some n+2s in there but I think I have a better solution for it)

app/models/spree/taxon.rb

dacook · 2024-08-14T00:06:57Z

lib/open_food_network/enterprise_injection_data.rb

-      @shipping_method_services ||= CacheService.cached_data_by_class("shipping_method_services",
-                                                                      Spree::ShippingMethod) do
+      @shipping_method_services ||= CacheService.cached_data_by_class(
+        "shipping_method_services_#{@enterprise_ids.hash}",


Ah, I see. It seems pointless making an extra DB query to find out if we need to make the real query, when we could just make the real query. Perhaps with further optimisation, the real query might be quite efficient (I just added a comment to Spree::Taxon.supplied_taxons)

dacook

Great, thank you!

dacook · 2024-08-15T04:28:24Z

Just needs a conflict fix, then we can do a manual test.

drummer83 · 2024-08-18T12:44:40Z

Hi @johansenja,
There's a conflict here. Could you please have a look?
I will try to test as soon as possible then.
Thanks!

johansenja · 2024-08-19T13:36:36Z

Yup no problem, I have just rebased against master and pushed again. Many thanks!

johansenja · 2024-08-19T19:53:19Z

I am not sure about the new test failure now - I was wondering if it could be a flake, but it seems suspcious that it is in the order cycles area, though I'm not yet sure how my changes here would affect it. I'll take another look in due course

drummer83 · 2024-08-19T23:42:11Z

Ja, I guess they are flaky. It's all green now.

Co-authored-by: Gaetan Craig-Riou <[email protected]>

drummer83 · 2024-08-22T16:26:15Z

Hi @johansenja,
Thank you so much for working on the performance! It's so important, but often neglected unfortunately.
I staged your PR to our staging server and couldn't spot any issues. It's hard to tell if there is a significant improvement on the page loading time. I don't have the skills to use any tools for that.
However, I didn't find any problems, so this PR is ready to being merged.

For now I will move it to Ready To Go, to double check if we need another approval after you fixed the conflict.
@openfoodfoundation/core-devs Could you please check if another approval is required? Otherwise please go ahead and merge.

Thanks again! 🥳

mkllnk

To test performance, roughly, I logged into uk-staging and did a few requests:

time for i in `seq 10`; do curl 'https://staging.openfoodnetwork.org.uk/shops#/' > /dev/null; done

The results are between 2.5s and 3.0s for ten requests but most of the time 2.9s. That's on another pull request.

After staging this pull request, the ten requests finish in 2.5s to 2.7s. That's an improvement.

github-actions bot assigned johansenja Aug 9, 2024

johansenja force-pushed the optimise-shops-page6 branch from 1bec8fd to cdfa30c Compare August 9, 2024 10:23

johansenja commented Aug 9, 2024

View reviewed changes

rioug approved these changes Aug 12, 2024

View reviewed changes

spec/lib/open_food_network/enterprise_injection_data_spec.rb Outdated Show resolved Hide resolved

rioug added the technical changes only These pull requests do not contain user facing changes and are grouped in release notes label Aug 12, 2024

dacook self-requested a review August 13, 2024 00:15

dacook approved these changes Aug 13, 2024

View reviewed changes

dacook reviewed Aug 14, 2024

View reviewed changes

sigmundpetersen requested a review from dacook August 14, 2024 09:36

dacook approved these changes Aug 15, 2024

View reviewed changes

johansenja force-pushed the optimise-shops-page6 branch from 797afe6 to ce9c5b6 Compare August 15, 2024 13:55

johansenja requested a review from dacook August 15, 2024 13:57

dacook approved these changes Aug 16, 2024

View reviewed changes

dacook changed the title ~~Enable injected enterprise data to be scoped to specific enterprise ids~~ Optimise shops page: Enable injected enterprise data to be scoped to specific enterprise ids Aug 16, 2024

drummer83 added the feedback-needed label Aug 18, 2024

johansenja force-pushed the optimise-shops-page6 branch from ce9c5b6 to 18c3bc2 Compare August 19, 2024 13:33

sigmundpetersen removed the feedback-needed label Aug 21, 2024

johansenja and others added 7 commits August 22, 2024 17:41

Scope injected enterprise properties to specific enterprises

b73e529

Add unit tests

1b8e256

Undo minor unneeded changes

66b519b

Add enterprise_ids to cache key

96f9894

Use subject method in spec

1621f97

Co-authored-by: Gaetan Craig-Riou <[email protected]>

Add spec for confirming correct values when scoped

ce6ae04

Optimise Spree::Taxon.supplied_taxons

4718fdb

drummer83 force-pushed the optimise-shops-page6 branch from 18c3bc2 to 4718fdb Compare August 22, 2024 15:41

drummer83 self-assigned this Aug 22, 2024

drummer83 added pr-staged-au staging.openfoodnetwork.org.au and removed pr-staged-au staging.openfoodnetwork.org.au labels Aug 22, 2024

drummer83 added the feedback-needed label Aug 22, 2024

mkllnk added the pr-staged-uk staging.openfoodnetwork.org.uk label Aug 22, 2024

mkllnk approved these changes Aug 22, 2024

View reviewed changes

mkllnk merged commit 7f09044 into openfoodfoundation:master Aug 22, 2024
56 checks passed

mkllnk removed pr-staged-uk staging.openfoodnetwork.org.uk feedback-needed labels Aug 22, 2024

mkllnk mentioned this pull request Sep 20, 2024

Micromarché Hubs on coopcircuits.fr opens very slowly #12870

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimise shops page: Enable injected enterprise data to be scoped to specific enterprise ids #12755

Optimise shops page: Enable injected enterprise data to be scoped to specific enterprise ids #12755

johansenja commented Aug 9, 2024

johansenja Aug 9, 2024 •

edited

Loading

rioug left a comment

dacook left a comment

dacook Aug 13, 2024

johansenja Aug 13, 2024

dacook Aug 14, 2024

johansenja commented Aug 13, 2024

dacook Aug 14, 2024

dacook left a comment

dacook commented Aug 15, 2024

drummer83 commented Aug 18, 2024

johansenja commented Aug 19, 2024

johansenja commented Aug 19, 2024

drummer83 commented Aug 19, 2024

drummer83 commented Aug 22, 2024

mkllnk left a comment

Optimise shops page: Enable injected enterprise data to be scoped to specific enterprise ids #12755

Optimise shops page: Enable injected enterprise data to be scoped to specific enterprise ids #12755

Conversation

johansenja commented Aug 9, 2024

What? Why?

What should we test?

Release notes

Dependencies

Documentation updates

johansenja Aug 9, 2024 • edited Loading

Choose a reason for hiding this comment

rioug left a comment

Choose a reason for hiding this comment

dacook left a comment

Choose a reason for hiding this comment

dacook Aug 13, 2024

Choose a reason for hiding this comment

johansenja Aug 13, 2024

Choose a reason for hiding this comment

dacook Aug 14, 2024

Choose a reason for hiding this comment

johansenja commented Aug 13, 2024

dacook Aug 14, 2024

Choose a reason for hiding this comment

dacook left a comment

Choose a reason for hiding this comment

dacook commented Aug 15, 2024

drummer83 commented Aug 18, 2024

johansenja commented Aug 19, 2024

johansenja commented Aug 19, 2024

drummer83 commented Aug 19, 2024

drummer83 commented Aug 22, 2024

mkllnk left a comment

Choose a reason for hiding this comment

johansenja Aug 9, 2024 •

edited

Loading