Experiment embedding deno_runtime in Varnish TinyKVM.

This is very early in development, it only kinda works.

Currently overhead when running with per request isolation is about ~0.1ms for trivial programs and ~0.4ms for a react server rendering benchmark. This is an order a magnitude less than using a V8 isolate or process forking which likely makes this the fastest JS runtime with per request isolation for substantial programs. (WebAssembly can be faster for trivial ones.)

Long run its not clear yet if this should be a custom wrapped runtime or just a Deno extension.

The deno_runtime crate is not yet stable which could make keeping this up to date tricky.
However customising v8 build settings could be useful.
And might need to be custom to make upstream requests using fetch work.

To do

Add support for basic http serving using httparse to provide a better comparison running outside of TinyKVM. (V8 will run GC on a background thread by default so it would be helpful to have a single threaded comparator.)
Build/find a comparator for v8 isolates and forking.
Work out how to do perf recording when running in TinyKVM.
Support headers, streaming, etc.
Make backend requests work.

Basic benchmarks

Run with wrk -t1 -c1 -d10s --latency.

All runs use DENO_V8_FLAGS=--max-old-space-size=64,--max-semi-space-size=64.

Render about 30KB of HTML with React

Configuration	Mean	50%	99%
renderer.js ephemeral=true dynamic	1.08ms	990us	1.93ms
renderer.js ephemeral=true static	0.98ms	920us	1.74ms
renderer.js ephemeral=false static	950us	715us	5.86ms
deno --allow-net renderer.js	582us	571us	689us

I guess the high variability for ephemeral=false is a result of single threaded GC.

Just return the prerendered html

Configuration	Mean	50%	99%
output.js ephemeral=true dynamic	240us	161us	370us
output.js ephemeral=true static	146us	145us	245us
output.js ephemeral=false static	187us	109us	2.02ms
output.rs ephemeral=true dynamic	78us	76us	124us
output.rs ephemeral=true static	75us	73us	121us
output.rs ephemeral=false static	70us	67us	116us
output.synth	39us	39us	59us
deno --allow-net --allow-read output.js	45us	44us	67us

Hello world

Configuration	Mean	50%	99%
main.js ephemeral=true dynamic	236us	233us	370us
main.js ephemeral=true static	234us	232us	303us
main.js ephemeral=false	106us	79us	1.04ms
rust ephemeral=true dynamic	93us	82us	134us
rust ephemeral=true static	82us	82us	106us
rust ephemeral=false	59us	58us	85us
synth	30us	29us	40us
deno --allow-net main.js	15us	15us	20us

Varnish outputs more headers by default which makes a noticable difference for hello world timings.

Investigating V8 runtime flags

`--max-heap-size=64,--max-old-space-size=64`

Slows down ephemeral=true.
Still a handful of errors with ephemeral=false. (This is new, need to track down.)

`--max-old-space-size=64,--max-semi-space-size=64`

Doesn't seem to make a difference to ephemeral=true performance.
Still a handful of errors with ephemeral=false.

Investigating V8 build options

Build with GN_ARGS="..."

`cppgc_enable_caged_heap=false`

Runs well with "address_space": 8000 instead of 66000. 1.62ms median.
Also runs with "address_space": 4000 but then performance is much worse at 4ms median.

Build static binary and validate that it runs on Linux

Note

Do not put rustflags in .cargo/config.toml as that causes build error: error: cannot produce proc-macro for asn1-rs-derive v0.4.0 as the target x86_64-unknown-linux-gnu does not support these crate types

Uses the static glibc approach from Building static Rust binaries for Linux. By not using musl we avoid a lengthy v8 build and reuse the published glibc build artifacts from https://github.com/denoland/rusty_v8/releases/.

RUSTFLAGS="-C target-feature=+crt-static" cargo build --release --target x86_64-unknown-linux-gnu
ldd ./target/x86_64-unknown-linux-gnu/release/deno-varnish # statically linked
./target/x86_64-unknown-linux-gnu/release/deno-varnish state request $PWD/main.js

This produces the following output with the expected segmentation fault.

Running file:///[...]/deno_varnish/main.js...
Hello from deno_varnish
Segmentation fault (core dumped)

Inspecting the coredump with gdb shows that the segfault happens when the kvm asm api is called.

# Enable core dumps on Ubuntu 24.04
sudo sysctl -w kernel.core_pattern=core.%u.%p.%t # to enable core generation
ulimit -c unlimited
gdb ./target/x86_64-unknown-linux-gnu/release/deno-varnish core[...]
[...]
Program terminated with signal SIGSEGV, Segmentation fault.
#0  0x[...] in deno_varnish::varnish::set_backend_get ()

Running under gdb

Use rust-gdb wrapper.
Unsure how to load libc debug symbols when debugging crt-static build.
- Not sufficent to just install libc6-dbg.

Running inside Varnish TinyKVM

I've been using podman but this should also work under docker. I installed the static version of podman from https://github.com/mgoltzsche/podman-static on Ubuntu 24.04 and followed the apparmor profile instructions. And replaced docker.io with mirror.gcr.io in /etc/containers/registries.conf.

Enable huge pages on host

echo 2048 | sudo tee /sys/kernel/mm/hugepages/hugepages-2048kB/nr_hugepages

Build

podman build -t deno-varnish .

Then run concurrently:

podman run --rm -p 127.0.0.1:8080:8080 -e VARNISH_HTTP_PORT=8080 --device /dev/kvm --group-add keep-groups --name deno-varnish deno-varnish
podman exec -it deno-varnish varnishlog
curl http://localhost:8080/hello

Connecting with gdb

Use rust-gdb wrapper.
Configure with "allow_debug": true. Watch the connection timeout along with timeouts from "max_boot_time" and "max_request_time".

Allow remotely debugging requests with GDB. The request to be debugged has to cause a breakpoint. In the C API this is done with sys_breakpoint(). The GDB instance must load the program using file myprogram before it can remotely connect using target remote :2159.

JSON Glossary
- Does not work in combination with tinykvm.start in vcl_init.

Debugging through JS stackframes

Add --gdbjit_full to v8 flags. --gdbjit doesn't really do anything. No need to rebuild.

Slows things down. Can take a minute or so to hit a breakpoint that would otherwise be instantaneous.

Issues and open questions

Why are we now seeing errors for ephemeral=false?

Why is main.js so much slower than output.js?

(Resolved) Understand why we are running out of memory in non-ephemeral mode

This is fixed by specifying memory limits for v8 in the environment:

"DENO_V8_FLAGS=--max-heap-size=64,--max-old-space-size=64"

Even after ensuring the RcHttpRecord is dropped we still run out of memory.

Run with "ephemeral": false and repro with:

wrk -t 1 -c 1 http://127.0.0.1:8080/deno

Investigate glibc tunables

If we do want hugepages we can configure malloc to use them.

"GLIBC_TUNABLES=glibc.malloc.hugetlb=2:glibc.malloc.mmap_threshold=2097152"

Why does updating rust dependencies break things

Causes a check to fail in mutex.cc:75.

Presumably down to v8 update from 134.5.0 to 135.1.0.

Do we want any of these proc / sys mounts?

Should check where each of these are used. Easiest way seem to be running under gdb with catch syscall openat.

None of these seem to be absolutely necessary.

Everything under /proc is misleading as it reflects the host process not the current process running within the VM.

    "/proc/self/mountinfo",
    "/proc/self/maps",  # Used by rust panic handler setup and v8 setup
    "/proc/self/cgroup",
    "/proc/stat",

Most of these under /sys seem to be used by a transitive dependency of swc for the ts transform.

    "/sys/devices/system/cpu/online",
    "/sys/devices/system/cpu/cpu0/tsc_freq_khz", # probably safe
    "/sys/fs/cgroup/cgroup.controllers",
    "/sys/fs/cgroup/cpu.max",

(Resolved) Debugging rusty_v8 with full debug symbols for v8

See: denoland/rusty_v8#1750

Delete target directory and cargo build with env:

V8_FROM_SOURCE=1 V8_FORCE_DEBUG=1 PRINT_GN_ARGS=1 GN_ARGS="line_tables_only=false no_inline_line_tables=false symbol_level=2"

(Resolved) Backend VM memory exception: page_at: page directory not present

TinyKVM now lets us specify address_space separately from max_memory.

Seems ok to allocate 70GB of address space to avoid this for now.

Tracked down why here and follow ups: varnish/tinykvm#23 (comment)

Would gdb maintenance info sections let us know what is causing memory to be paged in?
- Cannot use /proc/self/maps as that reflects the varnish process on the host.

Only triggered after switching to wait_for_requests_paused which works in the rust demo.

*   << BeReq    >> 3
-   Begin          bereq 2 pass
-   VCL_use        boot
-   Timestamp      Start: 1742507690.861852 0.000000 0.000000
-   BereqMethod    GET
-   BereqURL       /foo
-   BereqProtocol  HTTP/1.1
-   BereqHeader    Host: localhost:8080
-   BereqHeader    User-Agent: curl/8.5.0
-   BereqHeader    Accept: */*
-   BereqHeader    X-Forwarded-For: 192.168.50.20
-   BereqHeader    Via: 1.1 61ebf08ab1c1 (Varnish/7.6)
-   BereqHeader    X-Varnish: 3
-   VCL_call       BACKEND_FETCH
-   VCL_return     fetch
-   Timestamp      Fetch: 1742507690.862047 0.000194 0.000194
-   VCL_Log        deno-varnish says: Running
-   VCL_Log        deno-varnish says: file:///main.js
-   VCL_Log        deno-varnish says: ...

-   VCL_Log        deno-varnish says: Hello from deno_varnish

-   VCL_Log        deno-varnish says: before wait_for_requests_paused

-   Error          Backend VM memory exception: page_at: page directory not present (addr: 0x1000003000, size: 0x40000000)
-   VCL_Log        deno-varnish says: CR0: 0x80040033  CR3: 0x7000000000

-   VCL_Log        deno-varnish says: CR2: 0x0  CR4: 0x350620

-   VCL_Log        deno-varnish says: RAX: 0x0  RBX: 0x0  RCX: 0x0

-   VCL_Log        deno-varnish says: RDX: 0x0  RSI: 0x0  RDI: 0x0

-   VCL_Log        deno-varnish says: RIP: 0x0  RBP: 0x0  RSP: 0x0

-   VCL_Log        deno-varnish says: SS: 0x23  CS: 0x2B  DS: 0x23  FS: 0x0  GS: 0x0

-   VCL_Log        deno-varnish says: FS BASE: 0x65E1040  GS BASE: 0x5030

-   VCL_Log        deno-varnish says: [0] 0x       0

-   VCL_Log        deno-varnish says: CR0: 0x80040033  CR3: 0x7000000000

-   VCL_Log        deno-varnish says: CR2: 0x0  CR4: 0x350620

-   VCL_Log        deno-varnish says: RAX: 0x0  RBX: 0x0  RCX: 0x0

-   VCL_Log        deno-varnish says: RDX: 0x0  RSI: 0x0  RDI: 0x0

-   VCL_Log        deno-varnish says: RIP: 0x0  RBP: 0x0  RSP: 0x0

-   VCL_Log        deno-varnish says: SS: 0x23  CS: 0x2B  DS: 0x23  FS: 0x0  GS: 0x0

-   VCL_Log        deno-varnish says: FS BASE: 0x65E1040  GS BASE: 0x5030

-   VCL_Log        deno-varnish says: [0] 0x       0

-   BerespProtocol HTTP/1.1
-   BerespStatus   500
-   BerespReason   Internal Server Error
-   BerespHeader   Content-Length: 0
-   BerespHeader   Last-Modified: Thu, 20 Mar 2025 21:54:52 GMT
-   Timestamp      Beresp: 1742507692.147852 1.285999 1.285804
-   BerespHeader   Date: Thu, 20 Mar 2025 21:54:52 GMT
-   VCL_call       BACKEND_RESPONSE
-   VCL_return     deliver
-   Timestamp      Process: 1742507692.147862 1.286010 0.000010
-   Filters
-   Storage        malloc Transient
-   Fetch_Body     3 length -
-   Timestamp      BerespBody: 1742507692.147911 1.286059 0.000048
-   Length         0
-   BereqAcct      0 0 0 0 0 0
-   End

(Resolved) Unhandled system calls

Does not prevent it from working. https://filippo.io/linux-syscall-table/

Info: Child (15) said deno-varnish: Unhandled system call 230  # clock_nanosleep
Info: Child (15) said deno-varnish: Unhandled system call 332  # statx

(Resolved) failed to create UnixStream

Avoid calling .enable_all() when building the tokio runtime as that includes .enable_io() which triggers this error.

*   << BeReq    >> 3
...
-   VCL_Log        deno-varnish says: Running
-   VCL_Log        deno-varnish says: file:///main.js
-   VCL_Log        deno-varnish says: ...

-   VCL_Log        deno-varnish says:
thread 'main' panicked at /usr/local/cargo/registry/src/index.crates.io-1949cf8c6b5b557f/tokio-1.44.1/src/signal/unix.rs:60:53:
failed to create UnixStream: Os { code: 38, kind: Unsupported, message: "Function not implemented" }

-   VCL_Log        deno-varnish says: note: run with `RUST_BACKTRACE=1` environment variable to display a backtrace

(Resolved) VM 'deno-varnish' exception: Too many relocations

Fixed in varnish/tinykvm#21.

*   << BeReq    >> 3
...
-   Error          VM 'deno-varnish' exception: Too many relocations
-   Error          KVM: Unable to reserve VM for index 0, program deno-varnish
...

Name		Name	Last commit message	Last commit date
Latest commit History 73 Commits
benches		benches
src		src
.dockerignore		.dockerignore
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
Dockerfile		Dockerfile
LICENSE.md		LICENSE.md
README.md		README.md
default.vcl		default.vcl
deno-compile.compute.json		deno-compile.compute.json
deno-run.compute.json		deno-run.compute.json
deno-varnish.compute.json		deno-varnish.compute.json
hello.ext.js		hello.ext.js
hello.ffi.ts		hello.ffi.ts
output.ext.js		output.ext.js
output.ffi.ts		output.ffi.ts
output.html		output.html
renderer.ext.js		renderer.ext.js
renderer.ffi.ts		renderer.ffi.ts
rust.compute.json		rust.compute.json
varnish.ts		varnish.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Experiment embedding deno_runtime in Varnish TinyKVM.

To do

Basic benchmarks

Render about 30KB of HTML with React

Just return the prerendered html

Hello world

Investigating V8 runtime flags

`--max-heap-size=64,--max-old-space-size=64`

`--max-old-space-size=64,--max-semi-space-size=64`

Investigating V8 build options

`cppgc_enable_caged_heap=false`

Build static binary and validate that it runs on Linux

Running under gdb

Running inside Varnish TinyKVM

Connecting with gdb

Debugging through JS stackframes

Issues and open questions

Why are we now seeing errors for ephemeral=false?

Why is main.js so much slower than output.js?

(Resolved) Understand why we are running out of memory in non-ephemeral mode

Investigate glibc tunables

Why does updating rust dependencies break things

Do we want any of these proc / sys mounts?

(Resolved) Debugging rusty_v8 with full debug symbols for v8

(Resolved) Backend VM memory exception: page_at: page directory not present

(Resolved) Unhandled system calls

(Resolved) failed to create UnixStream

(Resolved) VM 'deno-varnish' exception: Too many relocations

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

lrowe/deno_varnish

Folders and files

Latest commit

History

Repository files navigation

Experiment embedding deno_runtime in Varnish TinyKVM.

To do

Basic benchmarks

Render about 30KB of HTML with React

Just return the prerendered html

Hello world

Investigating V8 runtime flags

--max-heap-size=64,--max-old-space-size=64

--max-old-space-size=64,--max-semi-space-size=64

Investigating V8 build options

cppgc_enable_caged_heap=false

Build static binary and validate that it runs on Linux

Running under gdb

Running inside Varnish TinyKVM

Connecting with gdb

Debugging through JS stackframes

Issues and open questions

Why are we now seeing errors for ephemeral=false?

Why is main.js so much slower than output.js?

(Resolved) Understand why we are running out of memory in non-ephemeral mode

Investigate glibc tunables

Why does updating rust dependencies break things

Do we want any of these proc / sys mounts?

(Resolved) Debugging rusty_v8 with full debug symbols for v8

(Resolved) Backend VM memory exception: page_at: page directory not present

(Resolved) Unhandled system calls

(Resolved) failed to create UnixStream

(Resolved) VM 'deno-varnish' exception: Too many relocations

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

`--max-heap-size=64,--max-old-space-size=64`

`--max-old-space-size=64,--max-semi-space-size=64`

`cppgc_enable_caged_heap=false`

Packages