-
Notifications
You must be signed in to change notification settings - Fork 106
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
bladebit-cuda-v3.1.0-windows-x86-64 very slowly #448
Comments
Assuming your drive Q: is a slow HDD, you have the issue right there. To prevent this, use another fast SSD, or the same as your temp SSD as final directory. |
I have already tried it and do not see a big difference. Maybe 5 minutes less. I can also observe, that fast SSD will be used very slow. Can it be that windows or the code reduces the speed? With Gigahorce Plotter it takes only 7 min with the same hardware... |
I remember i had some issues with the short version of the parameters. I think some didnt work. So i mostly used the long version of parameters. -t1 Frist drive is the temp second drive is the final drive (use same or different fast SSD here) Give it a try. |
Still the same. What I can observe: Gigahorse uses common RAM (up to 60 GB) and bladebit do not. Only RAM use goes high... |
Can you post the terminal output thats is in front of you posted output. this might give some extra information. |
Me too. I have tried 256g ram and 128g+nvme, the speed is the same. Bladebit Chia Plotter [Global Plotting Config] [Bladebit CUDA Plotter] Selected cuda device 0 : Tesla P4 Allocating buffers (this may take a few seconds)... Generating plot 1 / 1: 841186af4a31f234ea83d3546801c33ff0ed28ef62262754cb6ddb1acdce7d39 Generating F1 H:\plot-k32-c07-2024-01-04-22-59-841186af4a31f234ea83d3546801c33ff0ed28ef62262754cb6ddb1acdce7d39.plot.tmp -> H:\plot-k32-c07-2024-01-04-22-59-841186af4a31f234ea83d3546801c33ff0ed28ef62262754cb6ddb1acdce7d39.plot Final plot table sizes: |
with this code: .\bladebit_cuda.exe -f xch -c xch --threads 14 -n 1 --compress 3 cudaplot --disk-128 -t1 Z:\TMP\ Z:\NFT\ Bladebit Chia Plotter [Global Plotting Config] [Bladebit CUDA Plotter] Selected cuda device 0 : NVIDIA GeForce RTX 3070 Allocating buffers (this may take a few seconds)... Generating plot 1 / 1: 86f5af3f8c8fd54db8626565b11fb072f47f9d5ec412b37208094a4612d7528e Generating F1 Z:\NFT\plot-k32-c03-2024-01-07-17-52-86f5af3f8c8fd54db8626565b11fb072f47f9d5ec412b37208094a4612d7528e.plot.tmp -> Z:\NFT\plot-k32-c03-2024-01-07-17-52-86f5af3f8c8fd54db8626565b11fb072f47f9d5ec412b37208094a4612d7528e.plot Final plot table sizes: |
I don't see anything obvious other than your times being much to high. Nvidia driver up do date? I know they had issues with older drivers and bladebit. Worth a check! Check you NVME SSD speed. e.g. Crystal Disk Mark. Its very odd to me that the step " Completed writing plot" took 39.16 seconds for you. If you run bladebit in powershell (my suggestion) you should run powershell with admin rights. Also odd, but probably nothing: I don't use the last backslash in the command path It might also help to open windows recource manager while plotting to locate where the bottleneck is. The second entry wit a time "Table 2 completed in" should be at something like 20 seconds or less. |
oh and a warning. check your plots when done. like deep check them. don't settle for the default 30 checks. go like 100 or 200 checks. I had so many bad plots in an earlier version. they all would pass the 30 checks. but going to 200 they showed to be faulty. |
Thank you for ideas. |
i think there is some issue sometimes with slow writes to NVMe disks. have you tired do turn off direct-io? |
.\bladebit_cuda.exe -f xch -c xch --no-direct-io --threads 14 -n 1000 --compress 3 cudaplot --disk-128 -t1 Z:\TMP Z:\NFT |
With this code line:
.\bladebit_cuda -f xch -c xch -z 7 -n 90 -w cudaplot --disk-128 -t1 Z:\Tmp\ -t2 Z:\Tmp\ Q:\NFT\
it takes up to 50 min for one plot.
System: Win 10 pro, 128 GB Ram, Ryzen 7 5800x, RTX3070, Z:\ - Gen 4 NVMe
What do I do wrong?
Generating plot 14 / 90: a87ce0887756f8dcda26bb50dd57fc1928ce6dba1e6d6c7522b873a3ffe5912a
Plot temporary file: Q:\NFT\plot-k32-c07-2023-12-28-09-40-a87ce0887756f8dcda26bb50dd57fc1928ce6dba1e6d6c7522b873a3ffe5912a.plot.tmp
Generating F1
Finished F1 in 31.75 seconds.
Table 2 completed in 74.02 seconds with 4294890872 entries.
Table 3 completed in 426.05 seconds with 4294837918 entries.
Table 4 completed in 470.47 seconds with 4294781997 entries.
Table 5 completed in 353.66 seconds with 4294636582 entries.
Table 6 completed in 554.12 seconds with 4294187494 entries.
Table 7 completed in 205.78 seconds with 4293486129 entries.
Finalizing Table 7
Finalized Table 7 in 10.29 seconds.
Completed Phase 1 in 2132.26 seconds
Marked Table 6 in 11.01 seconds.
Marked Table 5 in 25.55 seconds.
Marked Table 4 in 18.59 seconds.
Marked Table 3 in 22.13 seconds.
Completed Phase 2 in 77.28 seconds
Compressing Table 2 and 3...
Step 1 completed step in 144.55 seconds.
Step 2 completed step in 14.79 seconds.
Completed table 2 in 159.34 seconds with 3439742752 / 4294837918 entries ( 80.09% ).
Compressing tables 3 and 4...
Step 1 completed step in 233.54 seconds.
Step 2 completed step in 30.60 seconds.
Step 3 completed step in 67.32 seconds.
Completed table 3 in 331.46 seconds with 3465825632 / 4294781997 entries ( 80.70% ).
Compressing tables 4 and 5...
Step 1 completed step in 37.38 seconds.
Step 2 completed step in 16.94 seconds.
Step 3 completed step in 101.94 seconds.
Completed table 4 in 156.28 seconds with 3532540674 / 4294636582 entries ( 82.25% ).
Compressing tables 5 and 6...
Step 1 completed step in 20.95 seconds.
Step 2 completed step in 16.97 seconds.
Step 3 completed step in 116.36 seconds.
Completed table 5 in 154.29 seconds with 3712840674 / 4294187494 entries ( 86.46% ).
Compressing tables 6 and 7...
Step 1 completed step in 61.23 seconds.
Step 2 completed step in 47.44 seconds.
Step 3 completed step in 203.17 seconds.
Completed table 6 in 311.86 seconds with 4293486129 / 4293486129 entries ( 100.00% ).
Serializing P7 entries
Completed serializing P7 entries in 42.87 seconds.
Completed Phase 3 in 1156.13 seconds
Completed Plot 1 in 3365.67 seconds ( 56.09 minutes )
The text was updated successfully, but these errors were encountered: