optimize unnecessary require operation in write_ascii_slow #835

pyb1993 · 2021-06-14T16:31:55Z

you can see the discussion here
This PR is to reduce the require operation when serialize huge ascii string
You can use

mvn -f benchmarks/pom.xml compile exec:java -Dexec.args="-f 4 -wi 5 -i 3 -t 2 -w 2s -r 2 HugeStringBenchmark.writeAsciiHuge"
to test the performance (with and without this optimization), I have show a short result in the discussion

theigl · 2021-06-22T09:51:32Z

@pyb1993: I have not forgotten about this PR. I'm currently on vacation and will take a look when I'm back in 2 weeks.

theigl · 2021-07-05T11:10:43Z

@pyb1993: I just looked into your PR and can reproduce the performance gains. However, I'm not sure about applying your changes, since they only affect a rare corner case. As you wrote on the mailing list:

But If there is such a case(and I do a benchmark for this rare case), the performance will have huge improvement in writeAscii.

the length of string is larger than the initial output buffer size. (which will cause the unecessary require operation(allocate and copy))

the output is not reused, new output each time (I think the best practice is reused the output if possible)

If performance is important, the output should be pooled and re-used. Otherwise you will see a lot of allocation and a lot of GC pressure. Your change is simple enough, but it still makes the code slightly harder to understand and I'm not sure if any real user would benefit from the it.

pyb1993 · 2021-07-06T06:47:48Z

Hi，@theigl I know this is really a rare case，but I found some other case that this function “maxallowedRequired” may be used，such as the writeInts method，which directly use capcacity to compare a threshold。 maybe there are more exra case that can benefit from this function。this function should be a general function to be used in such situation。

…

------------------ Original ------------------ From: Thomas Heigl ***@***.***> Date: Mon,Jul 5,2021 7:10 PM To: EsotericSoftware/kryo ***@***.***> Cc: .... ***@***.***>, Mention ***@***.***> Subject: Re: [EsotericSoftware/kryo] optimize unnecessary require operation in write_ascii_slow (#835) @pyb1993: I just looked into your PR and can reproduce the performance gains. However, I'm not sure about applying your changes, since they only affect a rare corner case. As you wrote on the mailing list: But If there is such a case(and I do a benchmark for this rare case), the performance will have huge improvement in writeAscii. the length of string is larger than the initial output buffer size. (which will cause the unecessary require operation(allocate and copy)) the output is not reused, new output each time (I think the best practice is reused the output if possible) If performance is important, the output should be pooled and re-used. Otherwise you will see a lot of allocation and a lot of GC pressure. Your change is simple enough, but it still makes the code slightly harder to understand and I'm not sure if any real user would benefit from the it. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub, or unsubscribe.

pyb1993 · 2021-07-11T08:38:08Z

@pyb1993: I just looked into your PR and can reproduce the performance gains. However, I'm not sure about applying your changes, since they only affect a rare corner case. As you wrote on the mailing list:

But If there is such a case(and I do a benchmark for this rare case), the performance will have huge improvement in writeAscii.

the length of string is larger than the initial output buffer size. (which will cause the unecessary require operation(allocate and copy))

the output is not reused, new output each time (I think the best practice is reused the output if possible)

If performance is important, the output should be pooled and re-used. Otherwise you will see a lot of allocation and a lot of GC pressure. Your change is simple enough, but it still makes the code slightly harder to understand and I'm not sure if any real user would benefit from the it.

Hi, how do you think about his PR? as what I talked above, I think there are a lot similar situation that
capacity >= count << k in writeLongs, writeInts, and writeDoubles

theigl · 2021-07-14T14:19:41Z

@pyb1993: I'm still +0 on this. It can improve things for edge cases, but I'm not sure the change is worth it. I'll try to get some more feedback on this.

pyb1993 added 2 commits June 15, 2021 00:06

optimize unnecessary require operation in write_ascii_slow

836ab60

remove some typo and blank

a8bc025

pyb1993 force-pushed the optimize_write_ascii branch from 2d12505 to a8bc025 Compare June 15, 2021 05:37

theigl added the enhancement label Jul 23, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

optimize unnecessary require operation in write_ascii_slow #835

optimize unnecessary require operation in write_ascii_slow #835

pyb1993 commented Jun 14, 2021

theigl commented Jun 22, 2021

theigl commented Jul 5, 2021

pyb1993 commented Jul 6, 2021 via email •

edited

Loading

pyb1993 commented Jul 11, 2021

theigl commented Jul 14, 2021

optimize unnecessary require operation in write_ascii_slow #835

Are you sure you want to change the base?

optimize unnecessary require operation in write_ascii_slow #835

Conversation

pyb1993 commented Jun 14, 2021

theigl commented Jun 22, 2021

theigl commented Jul 5, 2021

pyb1993 commented Jul 6, 2021 via email • edited Loading

pyb1993 commented Jul 11, 2021

theigl commented Jul 14, 2021

pyb1993 commented Jul 6, 2021 via email •

edited

Loading