Simplifying the lazy formula in evaluate #4868

FauziAkram · 2023-11-09T11:01:16Z

Simplifying the lazy formula in evaluate, by removing one multiplication operation.

Passed STC:
LLR: 2.93 (-2.94,2.94) <-1.75,0.25>
Total: 44320 W: 11342 L: 11132 D: 21846
Ptnml(0-2): 163, 4947, 11731, 5155, 164
https://tests.stockfishchess.org/tests/view/654ab7f0136acbc57352aba7

Passed LTC:
LLR: 2.94 (-2.94,2.94) <-1.75,0.25>
Total: 53268 W: 13292 L: 13102 D: 26874
Ptnml(0-2): 35, 5908, 14556, 6102, 33
https://tests.stockfishchess.org/tests/view/654b6863136acbc57352b9c3

Passed also the test with adj=OFF:
LLR: 2.93 (-2.94,2.94) <-1.75,0.25>
Total: 91776 W: 23436 L: 23276 D: 45064
Ptnml(0-2): 344, 10619, 23818, 10747, 360
https://tests.stockfishchess.org/tests/view/654e31c5136acbc57352f2e8

bench: 1484225

FauziAkram · 2023-11-09T18:02:01Z

@snicolet Can you please give us your thoughts about this simplification?

locutus2 · 2023-11-10T06:35:47Z

Also i have started a test against this PR which completly remove the shuffling part.
STC passed here https://tests.stockfishchess.org/tests/view/654cfec8136acbc57352d9c1
LTC runs here https://tests.stockfishchess.org/tests/view/654db061136acbc57352e7d5

vdbergh · 2023-11-10T12:24:36Z

I wonder if this should not be tested with adjudication off.

XInTheDark · 2023-11-10T13:19:22Z

Yes testing with adjudication off would be quite useful in this instance.

locutus2 · 2023-11-10T16:14:05Z

@vdbergh
That seems a good idea. I stopped my LTC of the complete removal.

snicolet · 2023-11-11T12:08:29Z

@snicolet Can you please give us your thoughts about this simplification?

We introduced this term to be 100% sure that Stockfish would never enter a shuffling sub-branch without progress during search because of SimpleEval, but do whatever you want if you want to remove that :-)

FauziAkram · 2023-11-11T14:34:41Z

The test with adj=OFF is underway:
https://tests.stockfishchess.org/tests/view/654e31c5136acbc57352f2e8

@vondele in case also this test passes, is an LTC test with adj=OFF needed?

FauziAkram · 2023-11-11T18:43:18Z

Passed also the test with adj=OFF:
LLR: 2.93 (-2.94,2.94) <-1.75,0.25>
Total: 91776 W: 23436 L: 23276 D: 45064
Ptnml(0-2): 344, 10619, 23818, 10747, 360
https://tests.stockfishchess.org/tests/view/654e31c5136acbc57352f2e8

Vizvezdenec · 2023-11-13T04:11:09Z

I don't really like this stuff because indeed this part of code was never to gain elo but rather to not evaluate the same stuck position with lazy eval into oblivion.
I would prefer such things to be tested with elo gain bounds like verification search removal and stuff.

vdbergh · 2023-11-13T06:55:22Z

@Vizvezdenec I agree in principle. But there should be some evidence that this extra term really works.

It is trivial to show that removal of verification search is not good since it makes SF blind to zugzwang. One would like to see similar evidence for the shuffling term, before including it.

peregrineshahin · 2023-11-13T07:05:20Z

It is trivial to show that removal of verification search is not good since it makes SF blind to zugzwang

I don't think that's true at all.

peregrineshahin · 2023-11-13T07:10:54Z

I don't think that's true at all.

In the sense that it is not trivial. We don't have a script to test. and we don't have the data. so basically there is no actual procedure, to verify that one would go to the most obscure commits for pb0, and still the data and conclusions are very shaky.

peregrineshahin · 2023-11-13T07:14:28Z

The positions were tested using Arena GUI not even a Python script, Who on the earth uses Arena to test positions for development?

vdbergh · 2023-11-13T07:17:31Z

I don't think that's true at all.

In the sense that it is not trivial. We don't have a script to test. and we don't have the data. so basically there is no actual procedure, to verify that one would go to the most obscure commits for pb0, and still the data and conclusions are very shaky.

Well take any position whose solution depends on zugzwang and SF will not be able to solve it without verification search. So yes it is trivial. This is the reason why in the past attempts to remove verification search were quickly reverted after users started complaining.

But I agree that it would be good to have an explicit list of zugzwang positions which can be used to validate changes in nulmove pruning.

Vizvezdenec · 2023-11-13T08:26:11Z

I don't think that's true at all.

In the sense that it is not trivial. We don't have a script to test. and we don't have the data. so basically there is no actual procedure, to verify that one would go to the most obscure commits for pb0, and still the data and conclusions are very shaky.

Well take any position whose solution depends on zugzwang and SF will not be able to solve it without verification search. So yes it is trivial. This is the reason why in the past attempts to remove verification search were quickly reverted after users started complaining.

But I agree that it would be good to have an explicit list of zugzwang positions which can be used to validate changes in nulmove pruning.

This is not what actually happens. Stockfish is able to search quite a lot of positions that depend on zugzwang even without any verification search whatsoever.
Same goes for other engines that actually don't have verification search naturally.

vdbergh · 2023-11-13T08:46:47Z

I guess it depends on what you call "quite a lot". It's been a long time since I looked at this but I recall that SF was unable to do any endgame problem (even trivial ones) whose solution depends on zugzwang with verification search disabled. Perhaps this has changed but I would be surprised.

vondele · 2023-11-13T10:16:55Z

so, let's leave null moves out of the picture. The simpleEval threshold appears to have right now 3 kinds of safeguards, ensuring it is not used in winning positions (the addition of bestValue), it is not used when the root position is imbalanced (the additional of simpleEval for the root), as well as the shuffling term.

Do we have (or can construct) any position right now where complete removal (not this scaling down) of the shuffling term leads to a wrong evaluation? Probably a balanced draw position where an Q-capture leads to a draw fortress could do ?

FauziAkram · 2023-11-20T15:04:01Z

Ok, to proceed with this PR, launched the simplification removing the whole shuffling term altogether, and testing with adj OFF
https://tests.stockfishchess.org/tests/view/655b7544136acbc573540dfd

FauziAkram · 2023-11-22T10:56:14Z

Also the complete removal of shuffling, with adj OFF passed the test:
LLR: 2.94 (-2.94,2.94) <-1.75,0.25>
Total: 120928 W: 30658 L: 30530 D: 59740
Ptnml(0-2): 423, 14058, 31375, 14184, 424
https://tests.stockfishchess.org/tests/view/655b7544136acbc573540dfd

And since no one presented any position where this change leads to a wrong evaluation, maybe we can proceed in merging.

Disservin · 2024-01-21T11:17:37Z

Outdated, code no longer exists after dual net merge, so closing.

bench: 1484225

4e77951

bench: 1484225

Disservin added the bench-change Changes the bench label Nov 20, 2023

Disservin closed this Jan 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simplifying the lazy formula in evaluate #4868

Simplifying the lazy formula in evaluate #4868

FauziAkram commented Nov 9, 2023 •

edited

Loading

FauziAkram commented Nov 9, 2023

locutus2 commented Nov 10, 2023

vdbergh commented Nov 10, 2023

XInTheDark commented Nov 10, 2023

locutus2 commented Nov 10, 2023

snicolet commented Nov 11, 2023 •

edited

Loading

FauziAkram commented Nov 11, 2023

FauziAkram commented Nov 11, 2023

Vizvezdenec commented Nov 13, 2023

vdbergh commented Nov 13, 2023

peregrineshahin commented Nov 13, 2023 •

edited

Loading

peregrineshahin commented Nov 13, 2023 •

edited

Loading

peregrineshahin commented Nov 13, 2023

vdbergh commented Nov 13, 2023 •

edited

Loading

Vizvezdenec commented Nov 13, 2023 •

edited

Loading

vdbergh commented Nov 13, 2023 •

edited

Loading

vondele commented Nov 13, 2023

FauziAkram commented Nov 20, 2023

FauziAkram commented Nov 22, 2023

Disservin commented Jan 21, 2024

Simplifying the lazy formula in evaluate #4868

Simplifying the lazy formula in evaluate #4868

Conversation

FauziAkram commented Nov 9, 2023 • edited Loading

FauziAkram commented Nov 9, 2023

locutus2 commented Nov 10, 2023

vdbergh commented Nov 10, 2023

XInTheDark commented Nov 10, 2023

locutus2 commented Nov 10, 2023

snicolet commented Nov 11, 2023 • edited Loading

FauziAkram commented Nov 11, 2023

FauziAkram commented Nov 11, 2023

Vizvezdenec commented Nov 13, 2023

vdbergh commented Nov 13, 2023

peregrineshahin commented Nov 13, 2023 • edited Loading

peregrineshahin commented Nov 13, 2023 • edited Loading

peregrineshahin commented Nov 13, 2023

vdbergh commented Nov 13, 2023 • edited Loading

Vizvezdenec commented Nov 13, 2023 • edited Loading

vdbergh commented Nov 13, 2023 • edited Loading

vondele commented Nov 13, 2023

FauziAkram commented Nov 20, 2023

FauziAkram commented Nov 22, 2023

Disservin commented Jan 21, 2024

FauziAkram commented Nov 9, 2023 •

edited

Loading

snicolet commented Nov 11, 2023 •

edited

Loading

peregrineshahin commented Nov 13, 2023 •

edited

Loading

peregrineshahin commented Nov 13, 2023 •

edited

Loading

vdbergh commented Nov 13, 2023 •

edited

Loading

Vizvezdenec commented Nov 13, 2023 •

edited

Loading

vdbergh commented Nov 13, 2023 •

edited

Loading