Skip to content

Commit

Permalink
fix zone aware alertmanager http idle timeout
Browse files Browse the repository at this point in the history
  • Loading branch information
daanschipper committed Nov 7, 2024
1 parent f803fbb commit e2dc400
Show file tree
Hide file tree
Showing 4 changed files with 26 additions and 1 deletion.
1 change: 1 addition & 0 deletions operations/helm/charts/mimir-distributed/CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -33,6 +33,7 @@ Entries should include a reference to the Pull Request that introduced the chang
* [ENHANCEMENT] Add the possibility to create a dedicated serviceAccount for the `alertmanager` component by setting `alertmanager.serviceAcount.create` to true in the values. #9781
* [BUGFIX] Fix PVC template in AlertManager to not show diff in ArgoCD. #9774
* [BUGFIX] Fix how `fullnameOverride` is reflected in generated manifests. #9564
* [BUGFIX] Alertmanager: Set -server.http-idle-timeout to avoid EOF errors in ruler, also for zone aware Alertmanager #9851

## 5.5.1

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -147,11 +147,11 @@ spec:
{{- if .Values.alertmanager.zoneAwareReplication.enabled }}
- "-alertmanager.sharding-ring.instance-availability-zone=zone-default"
{{- end }}
{{- end }}
# Prometheus HTTP client used to send alerts has a hard-coded idle
# timeout of 5 minutes, therefore the server timeout for Alertmanager
# needs to be higher to avoid connections being closed abruptly.
- "-server.http-idle-timeout=6m"
{{- end }}
{{- range $key, $value := .Values.alertmanager.extraArgs }}
- "-{{ $key }}={{ $value }}"
{{- end }}
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -87,6 +87,10 @@ spec:
- "-config.expand-env=true"
- "-config.file=/etc/mimir/mimir.yaml"
- "-alertmanager.sharding-ring.instance-availability-zone=zone-a"
# Prometheus HTTP client used to send alerts has a hard-coded idle
# timeout of 5 minutes, therefore the server timeout for Alertmanager
# needs to be higher to avoid connections being closed abruptly.
- "-server.http-idle-timeout=6m"
volumeMounts:
- name: config
mountPath: /etc/mimir
Expand Down Expand Up @@ -214,6 +218,10 @@ spec:
- "-config.expand-env=true"
- "-config.file=/etc/mimir/mimir.yaml"
- "-alertmanager.sharding-ring.instance-availability-zone=zone-b"
# Prometheus HTTP client used to send alerts has a hard-coded idle
# timeout of 5 minutes, therefore the server timeout for Alertmanager
# needs to be higher to avoid connections being closed abruptly.
- "-server.http-idle-timeout=6m"
volumeMounts:
- name: config
mountPath: /etc/mimir
Expand Down Expand Up @@ -341,6 +349,10 @@ spec:
- "-config.expand-env=true"
- "-config.file=/etc/mimir/mimir.yaml"
- "-alertmanager.sharding-ring.instance-availability-zone=zone-c"
# Prometheus HTTP client used to send alerts has a hard-coded idle
# timeout of 5 minutes, therefore the server timeout for Alertmanager
# needs to be higher to avoid connections being closed abruptly.
- "-server.http-idle-timeout=6m"
volumeMounts:
- name: config
mountPath: /etc/mimir
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -110,6 +110,10 @@ spec:
- "-config.expand-env=true"
- "-config.file=/etc/mimir/mimir.yaml"
- "-alertmanager.sharding-ring.instance-availability-zone=zone-a"
# Prometheus HTTP client used to send alerts has a hard-coded idle
# timeout of 5 minutes, therefore the server timeout for Alertmanager
# needs to be higher to avoid connections being closed abruptly.
- "-server.http-idle-timeout=6m"
volumeMounts:
- name: config
mountPath: /etc/mimir
Expand Down Expand Up @@ -262,6 +266,10 @@ spec:
- "-config.expand-env=true"
- "-config.file=/etc/mimir/mimir.yaml"
- "-alertmanager.sharding-ring.instance-availability-zone=zone-b"
# Prometheus HTTP client used to send alerts has a hard-coded idle
# timeout of 5 minutes, therefore the server timeout for Alertmanager
# needs to be higher to avoid connections being closed abruptly.
- "-server.http-idle-timeout=6m"
volumeMounts:
- name: config
mountPath: /etc/mimir
Expand Down Expand Up @@ -414,6 +422,10 @@ spec:
- "-config.expand-env=true"
- "-config.file=/etc/mimir/mimir.yaml"
- "-alertmanager.sharding-ring.instance-availability-zone=zone-c"
# Prometheus HTTP client used to send alerts has a hard-coded idle
# timeout of 5 minutes, therefore the server timeout for Alertmanager
# needs to be higher to avoid connections being closed abruptly.
- "-server.http-idle-timeout=6m"
volumeMounts:
- name: config
mountPath: /etc/mimir
Expand Down

0 comments on commit e2dc400

Please sign in to comment.