[pfsync multiqueue] tcp connections dropped when pfsync is active and multiqueue is enabled #8059
Open
2 tasks done
Labels
support
Community support
Important notices
Before you add a new report, we ask you kindly to acknowledge the following:
Describe the bug
When HA is configured and pfsync is active a low precentage (2-4%) of TCP connections are dropped.
Tip: to validate your setup was working with the previous version, use opnsense-revert (https://docs.opnsense.org/manual/opnsense_tools.html#opnsense-revert)
To Reproduce
Steps to reproduce the behavior:
fail=0;success=0;while :;do curl -o /dev/null -m 30 https://fsn1-speed.hetzner.com/1GB.bin && echo "success $((++success)) fail $fail" || echo "success $success fail $((++fail))";done
"success 12539 fail 514"
When the issue occurs, curl's current download rate drops to 0, no more packets are recieved.
Assumption: the TCP state is removed and all packets are dropped.
Expected behavior
"success 12539 fail 0"
No dropped TCP connections. When pfsync is disabled, no connections are dropped.
Describe alternatives you considered
Sync compatibility 24.1 or 24.7 does not change anything, neither does multicast vs. unicast pfsync change anything. Only disabling pfsync or setting the unicast ips to other ips than the master or backup.
I recreated the same setup with voldemort 2.7.2 to rule out general pfsync issues or issues from environment/load. This setup does not show the issue on same machine type and virtualization environment, even on the same hypervisor and bridge over several hours.
Additional context
The issue is probably there for several months if not years. We have had issues especially with docker layer updates stalling. We could never pinpoint this until now. This also goes away when turning off pfsync.
Environment
Software version used and hardware type if relevant, e.g.:
OPNsense 24.7.8
Proxmox VE 8.2.7
The text was updated successfully, but these errors were encountered: