src: avoid allocation in the Size method for BASE64URL and BASE64 #53550

lemire · 2024-06-22T17:26:39Z

For large base64 strings, this PR multiplies the performance of Buffer.from(..., "base64"); by making the Size function simpler and non-allocating.

This will reduce the gap with Bun from being over 3x slower to being about only 40% slower for large inputs. For modest inputs, Node.js is still about 2x slower than Bun. Note that both Bun and Node.js use the same underlying library (simdutf) for base64 decoding so any difference is entirely due to the runtime (not to the base64 decoding per se).

Benchmark (from bun):

import { bench, run } from "mitata";
function makeBenchmark(size, isToString) {
  const base64Input = Buffer.alloc(size, "latin1").toString("base64");
  const base64From = Buffer.from (base64Input, "base64");
  bench(`Buffer. from(${size} bytes, 'base64')`, () => {
      Buffer.from(base64Input, "base64");
  });
}
[128, 1024, 32 * 1024, 1024 * 1024 * 8]. forEach (s => makeBenchmark(s, false)) ;
await run();

Node.js 22:

cpu: Apple M2
runtime: node v22.3.0 (arm64-darwin)

benchmark                                  time (avg)             (min … max)       p75       p99      p999
----------------------------------------------------------------------------- -----------------------------
Buffer. from(128 bytes, 'base64')         110 ns/iter       (101 ns … 457 ns)    110 ns    140 ns    208 ns
Buffer. from(1024 bytes, 'base64')        306 ns/iter       (239 ns … 453 ns)    323 ns    426 ns    453 ns
Buffer. from(32768 bytes, 'base64')     9'087 ns/iter     (7'333 ns … 317 µs)  8'709 ns 27'750 ns 51'041 ns
Buffer. from(8388608 bytes, 'base64')   3'273 µs/iter   (3'144 µs … 3'564 µs)  3'387 µs  3'546 µs  3'564 µs

Node.js with this PR:

cpu: Apple M2
runtime: node v23.0.0-pre (arm64-darwin)

benchmark                                  time (avg)             (min … max)       p75       p99      p999
----------------------------------------------------------------------------- -----------------------------
Buffer. from(128 bytes, 'base64')         111 ns/iter       (101 ns … 470 ns)    112 ns    145 ns    224 ns
Buffer. from(1024 bytes, 'base64')        308 ns/iter       (256 ns … 475 ns)    312 ns    452 ns    475 ns
Buffer. from(32768 bytes, 'base64')     7'384 ns/iter  (7'091 ns … 12'399 ns)  7'268 ns 11'472 ns 12'399 ns
Buffer. from(8388608 bytes, 'base64')   1'431 µs/iter   (1'358 µs … 2'034 µs)  1'453 µs  1'627 µs  2'034 µs

Bun canary (upcoming release)

cpu: Apple M2
runtime: bun 1.1.16 (arm64-darwin)

benchmark                                  time (avg)             (min … max)       p75       p99      p999
----------------------------------------------------------------------------- -----------------------------
Buffer. from(128 bytes, 'base64')         115 ns/iter   (89.42 ns … 1'054 ns)  92.67 ns    880 ns  1'034 ns
Buffer. from(1024 bytes, 'base64')        226 ns/iter       (174 ns … 923 ns)    184 ns    720 ns    913 ns
Buffer. from(32768 bytes, 'base64')     4'045 ns/iter   (3'904 ns … 4'504 ns)  4'083 ns  4'404 ns  4'504 ns
Buffer. from(8388608 bytes, 'base64')     998 µs/iter     (861 µs … 1'590 µs)  1'165 µs  1'460 µs  1'590 µs

To get the Bun results, you need the canary which you may get with bun upgrade --canary.

lemire · 2024-06-22T17:47:01Z

For people who care about history, Size has been expensive since the very beginning (we trace it back to the initial commit by @isaacs in 2013):

     case BASE64: {
       String::AsciiValue value(str);
       data_size = base64_decoded_size(*value, value.length());
       break;
     }

mcollina

lgtm

mildsunrise · 2024-06-22T22:02:18Z

for the record, Size() wasn't just doing an unnecessary allocation, it was also returning the wrong size, causing us to allocate a much larger buffer than necessary. simdutf::base64_length_from_binary calculates the length after encoding; what we need here is the reverse. @lemire has replaced the code with this calculation:

str->Length() % 4 <= 1 ? str->Length() / 4 * 3
    : str->Length() / 4 * 3 + (str->Length() % 4) - 1)

which is only an upper bound, as it doesn't look at the actual data, and so it assumes the worst case (all characters are data characters, i.e. no padding or whitespace). the math looks correct to me.

bear in mind that while Size() is allowed to return an upper bound, it is expected to return an exact prediction most of the time. if this is not the case, we do another allocation + memcpy to a new backing store with the actual size. for base64url the prediction is usually correct as it rarely has padding, but for base64 it will miss 2/3 of the time (or always, if the input contains whitespace).

lemire · 2024-06-22T22:08:47Z

@mildsunrise

As far as I can tell, Node never returned the exact size. Doing so requires scanning the entire input, checking for characters to discard.

mildsunrise · 2024-06-22T22:09:44Z

wasn't base64_decoded_size supposed to do exactly that?

jasnell · 2024-09-08T03:40:23Z

PR is currently blocked from landing due to unreliable CI

lemire · 2024-09-08T14:34:52Z

@jasnell Indeed.

anonrig · 2024-09-08T15:44:25Z

@lemire can you rebase and force-push if you don't mind?

lemire · 2024-09-08T17:34:06Z

@anonrig I synced.

lemire · 2024-09-08T21:51:05Z

@anonrig Looks like it is turning green... what did you do???? ❤️

anonrig · 2024-09-08T21:58:07Z

@anonrig Looks like it is turning green... what did you do???? ❤️

set bunch of tests as flaky - #54802 🫡

lemire · 2024-09-09T13:26:28Z

@anonrig It still won't complete the tests though.

nodejs-github-bot · 2024-09-09T13:43:55Z

CI: https://ci.nodejs.org/job/node-test-pull-request/62191/

lemire · 2024-09-09T20:37:32Z

@anonrig Stuck.

anonrig · 2024-09-09T20:39:10Z

@anonrig Stuck.

It seems that all macOS machines are down/offline at the moment. nodejs/build#3887

nodejs-github-bot · 2024-09-10T01:25:21Z

CI: https://ci.nodejs.org/job/node-test-pull-request/62227/

lemire · 2024-09-10T15:48:54Z

@anonrig This will never go through, will it?

nodejs-github-bot · 2024-09-11T14:11:54Z

CI: https://ci.nodejs.org/job/node-test-pull-request/62322/

aduh95 · 2024-09-17T15:14:31Z

This needs a rebase.

lemire · 2024-09-17T15:26:33Z

Made obsolete by 8191e1f (@anonrig).

I am dropping this PR.

anonrig · 2024-09-17T15:32:35Z

This PR avoids calling simdutf on base64 encodings. It is not obsolete!

lemire · 2024-09-17T16:05:52Z

@anonrig Here is what the PR was...

It looks like this was resolved in...

8191e1f

lemire · 2024-09-17T16:06:15Z

@anonrig It is possible I missed something, if so, let me know.

src: avoid allocation in the Size method for BASE64URL and BASE64

dcb9718

nodejs-github-bot added buffer Issues and PRs related to the buffer subsystem. c++ Issues and PRs that require attention from people who are familiar with C++. needs-ci PRs that need a full CI run. labels Jun 22, 2024

lemire added the request-ci Add this label to start a Jenkins CI on a PR. label Jun 22, 2024

lemire requested review from anonrig, addaleax and joyeecheung June 22, 2024 17:28

github-actions bot removed the request-ci Add this label to start a Jenkins CI on a PR. label Jun 22, 2024

This comment was marked as outdated.

Sign in to view

anonrig approved these changes Jun 22, 2024

View reviewed changes

anonrig requested review from mcollina and jasnell June 22, 2024 17:35

lemire added 2 commits June 22, 2024 14:34

simplify the function

a69442c

lint

cf1d6b9

anonrig approved these changes Jun 22, 2024

View reviewed changes

more simplification

fb76dbd

lemire requested a review from anonrig June 22, 2024 18:52

fix typo

2588eb9

anonrig approved these changes Jun 22, 2024

View reviewed changes

lemire added 2 commits June 22, 2024 15:16

explicit cast

b6a694d

lint

9ae78a4

mcollina approved these changes Jun 22, 2024

View reviewed changes

mcollina added the request-ci Add this label to start a Jenkins CI on a PR. label Jun 22, 2024

github-actions bot removed the request-ci Add this label to start a Jenkins CI on a PR. label Jun 22, 2024

This comment was marked as outdated.

Sign in to view

Merge branch 'main' into fix_problem_with_size_base64

5f44e48

This comment was marked as outdated.

Sign in to view

anonrig approved these changes Sep 8, 2024

View reviewed changes

anonrig added the request-ci Add this label to start a Jenkins CI on a PR. label Sep 8, 2024

github-actions bot removed the request-ci Add this label to start a Jenkins CI on a PR. label Sep 9, 2024

aduh95 removed the author ready PRs that have at least one approval, no pending requests for changes, and a CI started. label Sep 17, 2024

Merge branch 'main' into fix_problem_with_size_base64

3ad4996

lemire closed this Sep 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

src: avoid allocation in the Size method for BASE64URL and BASE64 #53550

src: avoid allocation in the Size method for BASE64URL and BASE64 #53550

lemire commented Jun 22, 2024 •

edited

Loading

This comment was marked as outdated.

lemire commented Jun 22, 2024

mcollina left a comment

This comment was marked as outdated.

mildsunrise commented Jun 22, 2024 •

edited

Loading

lemire commented Jun 22, 2024

mildsunrise commented Jun 22, 2024

This comment was marked as outdated.

This comment was marked as outdated.

This comment was marked as outdated.

jasnell commented Sep 8, 2024

This comment was marked as outdated.

lemire commented Sep 8, 2024

anonrig commented Sep 8, 2024

lemire commented Sep 8, 2024

This comment was marked as outdated.

lemire commented Sep 8, 2024

anonrig commented Sep 8, 2024

lemire commented Sep 9, 2024

nodejs-github-bot commented Sep 9, 2024

lemire commented Sep 9, 2024

anonrig commented Sep 9, 2024

nodejs-github-bot commented Sep 10, 2024

lemire commented Sep 10, 2024

nodejs-github-bot commented Sep 11, 2024

aduh95 commented Sep 17, 2024

lemire commented Sep 17, 2024

anonrig commented Sep 17, 2024

lemire commented Sep 17, 2024

lemire commented Sep 17, 2024

src: avoid allocation in the Size method for BASE64URL and BASE64 #53550

src: avoid allocation in the Size method for BASE64URL and BASE64 #53550

Conversation

lemire commented Jun 22, 2024 • edited Loading

This comment was marked as outdated.

lemire commented Jun 22, 2024

mcollina left a comment

Choose a reason for hiding this comment

This comment was marked as outdated.

mildsunrise commented Jun 22, 2024 • edited Loading

lemire commented Jun 22, 2024

mildsunrise commented Jun 22, 2024

This comment was marked as outdated.

This comment was marked as outdated.

This comment was marked as outdated.

jasnell commented Sep 8, 2024

This comment was marked as outdated.

lemire commented Sep 8, 2024

anonrig commented Sep 8, 2024

lemire commented Sep 8, 2024

This comment was marked as outdated.

lemire commented Sep 8, 2024

anonrig commented Sep 8, 2024

lemire commented Sep 9, 2024

nodejs-github-bot commented Sep 9, 2024

lemire commented Sep 9, 2024

anonrig commented Sep 9, 2024

nodejs-github-bot commented Sep 10, 2024

lemire commented Sep 10, 2024

nodejs-github-bot commented Sep 11, 2024

aduh95 commented Sep 17, 2024

lemire commented Sep 17, 2024

anonrig commented Sep 17, 2024

lemire commented Sep 17, 2024

lemire commented Sep 17, 2024

lemire commented Jun 22, 2024 •

edited

Loading

mildsunrise commented Jun 22, 2024 •

edited

Loading