Clean up and compress some pre-join packets #15881

sfan5 · 2025-03-08T17:36:31Z

TOCLIENT_ANNOUNCE_MEDIA:

string list is compressed by zstd
hashes no longer base64 encoded (lol)

TOCLIENT_MEDIA:

file contents are compressed by zstd

TOCLIENT_ITEMDEF and NODEDEF:

compressed by zstd instead of zlib

Why do these matter?
All of them need to happen before the client can join a game session (minus media transfer if cached), and our network layer isn't that good at bulk data transfer.
So if we reduce the time spent here it will lead to faster join times.

What about media compression?
Taking my ~400MB cache/media folder I have amassed, zstd takes two(!) seconds to compress it all and manages a reduction to 65% of original size.
Zstd will efficiently skip compression if it deems the file to be incompressible.
In case we are ever too concerned about the compression time the server could very well just (pre-)cache all media files.

To do

This PR is Ready for Review.

How to test

test combination of old and new clients and servers

SmallJoker · 2025-03-08T18:08:21Z

src/network/clientpackethandler.cpp

+			std::istringstream iss(data, std::ios::binary);
+			std::ostringstream oss(std::ios::binary);
+			decompressZstd(iss, oss);
+			data = oss.str();


Perhaps it's time for a nice wrapper function? We're effectively copying the data 4 times:

readLongString (compressed)

istringstream constructor (compressed)

is.read(input_buffer, bufsize); (compressed)

os.write(output_buffer, output.pos); (decompressed)

data assignment (decompressed)

Could be simplified down to:

(no copy) Read raw data from NetworkPacket (compressed)

Write data to an output stream (decompressed)

data assignment (decompressed)

zero-copy string extraction/input for streams was only invented in C++20: https://en.cppreference.com/w/cpp/io/basic_stringstream/str (or C++26 for string views)
we could implement this ourselves like described here: http://videocortex.io/2017/custom-stream-buffers/

readLongString could be avoided if the function would return a string_view

See also: #14086

sfan5 · 2025-03-08T22:34:52Z

TOCLIENT_ITEMDEF and NODEDEF: compressed by zstd instead of zlib

During my research for #15885 I also kept a copy of definitions of every server so I have some nice data for this, too.
data basis: uncompressed itemdef and nodedef of 294 servers
total size: 2.4 GB
smallest size: 7 B for nodedef (huh?), 868 B for itemdef
biggest size: 26M for nodedef, 14M for itemdef
average size: 5M for nodedef, 2M for itemdef

anyway the important part:

zlib reduces this pile to only 129M in 21 seconds
zstd gets down to 116M but takes 3 seconds

conclusion: (without further tuning) zstd isn't the compression wonder, but it is a speed wonder. 😄

Desour

Code looks fine. Haven't tested.

Do we maybe want to have a flag per file for whether a media file is compressed? The server could then decide if it wants to compress.

In TOCLIENT_MEDIA, the names could also be stored separately, and compressed together.

Is the doc in networkprotocol.h supposed to only document the newest protocol version?

src/server.cpp

src/network/networkprotocol.h

sfan5 · 2025-03-10T18:06:00Z

Do we maybe want to have a flag per file for whether a media file is compressed? The server could then decide if it wants to compress.

Data that zstd doesn't want to compress seems to be passed through with 10 bytes added. It's also still fast at doing this.
So I don't see a benefit either in data size or speed.

If speed is really really a problem the Zstd frame format is not complicated and you can just paste the right bytes in front of uncompressed data to turn it into valid zstd.

In TOCLIENT_MEDIA, the names could also be stored separately, and compressed together.

Yeah, but there isn't that many per bunch actually.

Is the doc in networkprotocol.h supposed to only document the newest protocol version?

I think so. We have git for the history.

sfan5 added 3 commits March 8, 2025 17:58

aaa

4dd7c3b

docs

9c73d22

more stuff

38ed268

sfan5 added @ Network Performance labels Mar 8, 2025

SmallJoker reviewed Mar 8, 2025

View reviewed changes

fixed it

d1265e4

sfan5 force-pushed the seiso branch from a704827 to d1265e4 Compare March 8, 2025 19:18

Desour reviewed Mar 9, 2025

View reviewed changes

src/server.cpp Outdated Show resolved Hide resolved

src/network/networkprotocol.h Outdated Show resolved Hide resolved

bbb

04be946

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clean up and compress some pre-join packets #15881

Clean up and compress some pre-join packets #15881

sfan5 commented Mar 8, 2025 •

edited

Loading

SmallJoker Mar 8, 2025 •

edited

Loading

sfan5 Mar 8, 2025

lhofhansl Mar 8, 2025

sfan5 commented Mar 8, 2025 •

edited

Loading

Desour left a comment

sfan5 commented Mar 10, 2025 •

edited

Loading

Clean up and compress some pre-join packets #15881

Are you sure you want to change the base?

Clean up and compress some pre-join packets #15881

Conversation

sfan5 commented Mar 8, 2025 • edited Loading

To do

How to test

SmallJoker Mar 8, 2025 • edited Loading

Choose a reason for hiding this comment

sfan5 Mar 8, 2025

Choose a reason for hiding this comment

lhofhansl Mar 8, 2025

Choose a reason for hiding this comment

sfan5 commented Mar 8, 2025 • edited Loading

Desour left a comment

Choose a reason for hiding this comment

sfan5 commented Mar 10, 2025 • edited Loading

sfan5 commented Mar 8, 2025 •

edited

Loading

SmallJoker Mar 8, 2025 •

edited

Loading

sfan5 commented Mar 8, 2025 •

edited

Loading

sfan5 commented Mar 10, 2025 •

edited

Loading