Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Server doesn't work (nicely) with multiple replicas #42

Open
pat-s opened this issue May 13, 2023 · 2 comments
Open

Server doesn't work (nicely) with multiple replicas #42

pat-s opened this issue May 13, 2023 · 2 comments
Labels
bug 🐛️ Something isn't working

Comments

@pat-s
Copy link
Collaborator

pat-s commented May 13, 2023

When enabling it, runs triggered in the UI are "pending forever" and are somehow not sent to the agents properly. It seems only one server pod is able to do that and if a user gets a session with the other one, the behavior described above occurs.

I am not meaning to infer it should work, just noting it down for visibility etc.

Killing a "wrong" pod and getting assigned a new session with the "correct" one will let subsequent restarts be triggered successfully.

@anbraten
Copy link
Member

anbraten commented May 14, 2023

@pat-s Thanks for the testing. I guess the main issue is that the agents are connecting to one of the servers (probably grpc should be behind a load-balancer as well) and the servers do not share those agents with each other.
The queue with pipelines to be executed should however be saved to the database so each server with an agent should be able to propagate this pipeline to an agent. Not sure why this wasn't already working for you (maybe you can verify it again by reloading the UI / logs).
For the UI as you currently connect to a single server which is not getting real-time events from the others you have to reload the page each time to get the latest data via the database from the other servers.

We had an issue in the main repo where we "planned" HA a bit: woodpecker-ci/woodpecker#742

@pat-s pat-s added the bug 🐛️ Something isn't working label Aug 29, 2023
@genofire
Copy link
Contributor

to get the same user on some pod, it is easy: use the correct annotation on ingress for your ingress-controller.

but if the other application (woodpecker) does not support it (caching, storage ...) we still could wait till GatewayAPI support client session stickiness and then replace the ingress with it (so that the helm-chart user has not anymore to lookup for the correct annotations).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug 🐛️ Something isn't working
Projects
None yet
Development

No branches or pull requests

3 participants