[BUG] Hauler Performance Issues #327
-
Environmental Info:
Hauler Version:
Describe the Bug:
Steps to Reproduce:
Expected Behavior:
Actual Behavior:
Additional Context:
We should use this case to start tracking all possible performance issues with Hauler with full environment specs and details to start narrowing down exactly what's happening |
Beta Was this translation helpful? Give feedback.
Replies: 19 comments
-
MS-01
using https://gist.github.com/clemenko/11edaa5f5c84c2f5f603257dcff6787d |
Beta Was this translation helpful? Give feedback.
-
vSphere Lab 4core X 4GB RAM
|
Beta Was this translation helpful? Give feedback.
-
I dont see the issue here. Also based on the test above key validation is actualy faster..... Proxmox 4CPU/4GB with product with key real 3m55.884s |
Beta Was this translation helpful? Give feedback.
-
Proxmox 4x8
|
Beta Was this translation helpful? Give feedback.
-
System Overview:
[ec2-user@ip-172-31-91-194 ~]$ cat /etc/os-release
NAME="Amazon Linux"
VERSION="2023"
ID="amzn"
ID_LIKE="fedora"
VERSION_ID="2023"
PLATFORM_ID="platform:al2023"
PRETTY_NAME="Amazon Linux 2023.5.20240624"
ANSI_COLOR="0;33"
CPE_NAME="cpe:2.3:o:amazon:amazon_linux:2023"
HOME_URL="https://aws.amazon.com/linux/amazon-linux-2023/"
DOCUMENTATION_URL="https://docs.aws.amazon.com/linux/"
SUPPORT_URL="https://aws.amazon.com/premiumsupport/"
BUG_REPORT_URL="https://github.com/amazonlinux/amazon-linux-2023"
VENDOR_NAME="AWS"
VENDOR_URL="https://aws.amazon.com/"
SUPPORT_END="2028-03-15"
---------
[ec2-user@ip-172-31-91-194 ~]$ uname -a
Linux ip-172-31-91-194.ec2.internal 6.1.94-99.176.amzn2023.x86_64 #1 SMP PREEMPT_DYNAMIC Tue Jun 18 14:57:56 UTC 2024 x86_64 x86_64 x86_64 GNU/Linux
---------
docker.io
[ec2-user@ip-172-31-91-194 ~]$ time hauler store sync -f airgap_hauler.yaml
real 3m35.088s
user 1m2.236s
sys 0m12.466s
TOTAL | 8.5 GB
---------
rgcrprod.azurecr.us
[ec2-user@ip-172-31-91-194 ~]$ time hauler store sync -f carbide.yaml -s carbide-store
real 4m23.001s
user 1m37.314s
sys 0m14.929s
TOTAL | 8.7 GB
---------
rgcrprod.azurecr.us with carbide-key.pub
[ec2-user@ip-172-31-91-194 ~]$ time hauler store sync -f carbide-key.yaml -s carbide-key-store
real 4m29.187s
user 1m50.926s
sys 0m17.000s
TOTAL | 8.7 GB |
Beta Was this translation helpful? Give feedback.
-
System Overview:
[azureuser@hauler-testing ~]$ cat /etc/os-release
NAME="Rocky Linux"
VERSION="9.3 (Blue Onyx)"
ID="rocky"
ID_LIKE="rhel centos fedora"
VERSION_ID="9.3"
PLATFORM_ID="platform:el9"
PRETTY_NAME="Rocky Linux 9.3 (Blue Onyx)"
ANSI_COLOR="0;32"
LOGO="fedora-logo-icon"
CPE_NAME="cpe:/o:rocky:rocky:9::baseos"
HOME_URL="https://rockylinux.org/"
BUG_REPORT_URL="https://bugs.rockylinux.org/"
SUPPORT_END="2032-05-31"
ROCKY_SUPPORT_PRODUCT="Rocky-Linux-9"
ROCKY_SUPPORT_PRODUCT_VERSION="9.3"
REDHAT_SUPPORT_PRODUCT="Rocky Linux"
REDHAT_SUPPORT_PRODUCT_VERSION="9.3"
---------
[azureuser@hauler-testing ~]$ uname -a
Linux hauler-testing 5.14.0-362.8.1.el9_3.x86_64 #1 SMP PREEMPT_DYNAMIC Wed Nov 8 17:36:32 UTC 2023 x86_64 x86_64 x86_64 GNU/Linux
---------
docker.io
[azureuser@hauler-testing ~]$ time hauler store sync -f airgap_hauler.yaml
real 2m16.121s
user 1m8.562s
sys 0m16.214s
TOTAL | 8.5 GB
---------
rgcrprod.azurecr.us
[azureuser@hauler-testing ~]$ time hauler store sync -f carbide.yaml -s carbide-store
real 3m17.944s
user 1m41.441s
sys 0m17.702s
TOTAL | 8.7 GB
---------
rgcrprod.azurecr.us with carbide-key.pub
[azureuser@hauler-testing ~]$ time hauler store sync -f carbide-key.yaml -s carbide-key-store
real 4m05.921s
user 1m58.424s
sys 0m21.370s
TOTAL | 8.7 GB |
Beta Was this translation helpful? Give feedback.
-
Results from DigitalOcean comparing a yaml pointing at docker/quay vs azure https://gist.github.com/clemenko/f1a2389d34c9d69eafb08fe342b790e1
This is without ANY public key. |
Beta Was this translation helpful? Give feedback.
-
System Overview:
[zackbradys@hauler ~]$ cat /etc/os-release
NAME="Rocky Linux"
VERSION="9.4 (Blue Onyx)"
ID="rocky"
ID_LIKE="rhel centos fedora"
VERSION_ID="9.4"
PLATFORM_ID="platform:el9"
PRETTY_NAME="Rocky Linux 9.4 (Blue Onyx)"
ANSI_COLOR="0;32"
LOGO="fedora-logo-icon"
CPE_NAME="cpe:/o:rocky:rocky:9::baseos"
HOME_URL="https://rockylinux.org/"
BUG_REPORT_URL="https://bugs.rockylinux.org/"
SUPPORT_END="2032-05-31"
ROCKY_SUPPORT_PRODUCT="Rocky-Linux-9"
ROCKY_SUPPORT_PRODUCT_VERSION="9.4"
REDHAT_SUPPORT_PRODUCT="Rocky Linux"
REDHAT_SUPPORT_PRODUCT_VERSION="9.4"
---------
[zackbradys@hauler ~]$ uname -a
Linux hauler 5.14.0-427.20.1.el9_4.x86_64 #1 SMP PREEMPT_DYNAMIC Fri Jun 7 14:51:39 UTC 2024 x86_64 x86_64 x86_64 GNU/Linux
---------
docker.io
[zackbradys@hauler ~]$ time hauler store sync -f airgap_hauler.yaml
real 3m9.761s
user 0m29.725s
sys 0m22.380s
TOTAL | 8.5 GB
---------
rgcrprod.azurecr.us
[zackbradys@hauler ~]$ time hauler store sync -f carbide.yaml -s carbide-store
real 6m32.380s
user 0m42.627s
sys 0m25.667s
TOTAL | 8.7 GB
---------
rgcrprod.azurecr.us with carbide-key.pub
[zackbradys@hauler ~]$ time hauler store sync -f carbide-key.yaml -s carbide-key-store
real 7m24.888s
user 0m51.636s
sys 0m19.589s
TOTAL | 8.7 GB |
Beta Was this translation helpful? Give feedback.
-
this is clearly an azure issue |
Beta Was this translation helpful? Give feedback.
-
another test from Digital Ocean
|
Beta Was this translation helpful? Give feedback.
-
macos
|
Beta Was this translation helpful? Give feedback.
-
on harvester, ubuntu
|
Beta Was this translation helpful? Give feedback.
-
https://learn.microsoft.com/en-us/azure/container-registry/container-registry-skus What tier are we using? |
Beta Was this translation helpful? Give feedback.
-
I was able to track down the slowness on my network. My homelab was stuck on 100Mb due to a bad cable on from my core switch to my patch panel. After swapping that cable, my homelab is back to 1G and my Hauler stores of the full Rancher product are around 2 hours. For those reporting performance issues with Hauler, please use the following test script (for Rocky) and provide the following information. If you're testing this on Ubuntu, adjust the script as needed. Test Script for Rocky: https://gist.github.com/clemenko/11edaa5f5c84c2f5f603257dcff6787d
|
Beta Was this translation helpful? Give feedback.
-
@HoustonDad Here are the results. ----- speed test ----- ----- without product ----- real 8m38.141s ----- with product ----- real 0m10.670s hauler: with product with key real 0m10.683s Fedora release 40 (Forty) on Windows 11 Pro I edited the manifest for Rancher 2.7.14 and took out 250+ lines and it still took 9 hours to download. The manifest for 2.7.14 is around 953 lines, correct? |
Beta Was this translation helpful? Give feedback.
-
@c-b-r It looks like something in that test failed out:
Both of those ran for only 10 seconds, when it should have been at least 2 hours. Could you try to run some of those commands in the script manually to see what failed, fix that and run the test again? Thanks! |
Beta Was this translation helpful? Give feedback.
-
Test Results from another system: Platform (bare metal, vSphere, Proxmox, etc)
CPU/RAM (4CPU x 4G RAM, etc)
Speedtest results
Geographic area that you're pulling from (State / City would be amazing to help track down bad routes)
|
Beta Was this translation helpful? Give feedback.
-
Omaha NE. I was talking to you earlier in the day about minifying the manifest file. I'll try to run the script again tonight, I just ran it when I left without watching it. |
Beta Was this translation helpful? Give feedback.
-
Closing until we get more reports of performance issues. |
Beta Was this translation helpful? Give feedback.
Closing until we get more reports of performance issues.