Skip to content
This repository has been archived by the owner on Oct 16, 2024. It is now read-only.

Monitor for stuck terminating pods #59

Open
ziegeer opened this issue May 21, 2020 · 0 comments
Open

Monitor for stuck terminating pods #59

ziegeer opened this issue May 21, 2020 · 0 comments

Comments

@ziegeer
Copy link
Contributor

ziegeer commented May 21, 2020

As per the 2020-05-18 Sumo Incident Report, we had pods stuck in a terminating state for ~2 days which isn't right. It's not clear to me how we'd monitor this as the service was running fine and had the desired number of pods but let's try to find a way!

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant