-
Notifications
You must be signed in to change notification settings - Fork 2.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Again about the benchmark #5157
Comments
Yes the comment is outdated but a table not being used anymore and a useless prefetch not being present doesn't make the engine slower, the change was non functional. |
SF 16, still used a mixed of classical and main net, where the classical part is probably giving the other additional speedup. SF 16 bench was a mix of positions with NNUE and classical, I think 50/50 split. SF 16.1 uses the main net and the small net, with classical completely removed... |
We should probably simplify materialKey away though. |
Before removing classical eval:Simplify PvNode reduction Jul 11,2023 With removing classical eval: Jul 11, 2023 Same net I don't understand why. |
As Disservin already said, it is because of 50% of the bench positions used only the classical evaluation, which was much faster |
Done |
I will answer you, though I'm already regretting it (interaction with you is sickening). |
Lines 1056 to 1059 in 6a8767a
|
|
OK: so, for two elo points, do you degrade performance so fearfully? |
What |
how is this relevant to your problem about the benchmark nps? |
hce wasn't removed to gain 2 Elo, it was removed even if removing it lost 2 Elo, because with long enough tc the impact was 0. |
In general arguing about hce removal now is useless, it's a done deal, new and better solutions now fill the void the hce removal left, devs are working on sf 17 and stuff that was definitive in sf 16 is definitely not relevant now. |
For me, it is crazy and I don't agree at all, but clearly do as you wish. |
you don't understand anything you talk about, that's all there is to it, as usual. |
This comment was marked as off-topic.
This comment was marked as off-topic.
I don't even get your point... on my benchmarks #5157 (comment), HCE was removed with some loss at STC, the higher marginal higher NPS which Stockfish had before were countervailed with the stronger evaluation. |
I develop a fork of stockfish with the goal of making it understandable/useful to the OTB player, not necessarily, therefore, stronger, and I wondered why this frightening degradation of performance. That's all. |
how is a 2 Elo impact at bullet time controls a "frightening degradation of performance"?, why are we acting like the basically 0 Elo impact (at the very very very long time controls you otb players love) wasn't made even less impactful by the fact smallnet exists? the truth of the matter is that sf is now stronger than when hce was still present and it's stronger than it would be if hce was still present. |
NPS doesn't mean anything, if you want high nps go back to pure alpha beta but lose accuracy of the evaluation. Your speed comparison that you posted earlier were wrong since it used a mixed bench, which is not even what stockfish did during regular play. In my two tests, the difference was like 5%, which is not "frightening degradation of performance" imo. |
This comment was marked as spam.
This comment was marked as spam.
It doesn't matter what anyone thinks, this is about numbers, nps, elo and time controls. This is equivalent to math, it doesn't matter if i think that 3 + 2 is 7 or if my dream is for 12 * 10 to be 97, this is factual information that can only be correctly or incorrectly interpreted, there's no room for opinions here. |
Commits on Jan 14, 2024 before refactor after refactor Same net and bench |
that's a supposedly 0.4% speed drop, it's too small of a drop to be measured with a single bench run or even 3 or 5, it's easily explained away by noise and error bars in the bench measurement. |
A single bench run isn't very reliable. You could probably redo a single bench at some point and get reversed results. You should be using one of the many speedup scripts to better determine the effect of a patch on the overall performance. |
Here you can find all the info on how to properly measure the speed of stockfish: https://github.com/official-stockfish/Stockfish/wiki/Advanced-topics#measure-the-speed-of-stockfish, coincidentally it also says |
Describe the issue
No: imho, the problem isn't the network size.
I think the refactoring with the worker isn't no funcitonal changes.
For example, in position.cpp there are the following lines:
but the materialTable has been deleted and there is no prefetch.
Expected behavior
not this behaviour
Steps to reproduce
./Stockfish16 bench
./Stockfish16.1 bench
Anything else?
No response
Operating system
All
Stockfish version
S
The text was updated successfully, but these errors were encountered: