Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update Helm release generic-prometheus-alerts to v1 #2265

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

renovate[bot]
Copy link
Contributor

@renovate renovate bot commented Dec 23, 2024

This PR contains the following updates:

Package Update Change
generic-prometheus-alerts major 0.4.1 -> 1.11.3

Release Notes

ministryofjustice/hmpps-helm-charts (generic-prometheus-alerts)

v1.11.3

Compare Source

Creates a set of standard prometheus alert rules for a given application

v1.11.2

Compare Source

Various fixes to the elasticache alert yaml

v1.11.1

Compare Source

Fixes to the default elasticache alert thresholds.

v1.11.0

Compare Source

Update to Elasticache alerts, separating out the following alerts:

  • elasticache-freeable-memory (alert on a minimum FreeableMemory threshold)
  • elasticache-memory-utilisation (alert on a maximum DatabaseMemoryUsagePercentage threshold)

To configure Elasticache alerts, add into the helm values file under generic-prometheus-alerts: a list of cluster ids to monitor. For example:

elastiCacheAlertsClusterIds:
  elasticache-cluster-id: "Meaningful cluster display name"
  elasticache-cluster-2-id: "Meaningful cluster display name 2"

The alert thresholds can be configured with the following parameters:

### Maxiumum threshold for Elasticache EngineCPUUtilisation - value between 0 and 100.
elastiCacheAlertsEngineCPUThreshold: 85
### Duration over which to measure Elasticache EngineCPUUtilisation (minutes).
elastiCacheAlertsEngineCPUThresholdMinutes: 5
### Maximum threshold for Elasticache CPUUtilisation - value between 0 and 100.
elastiCacheAlertsCPUThreshold: 90
### Duration over which to measure Elasticache CPUUtilisation (minutes).
elastiCacheAlertsCPUThresholdMinutes: 5
### Minimum threshold for Elasticache FreeMemory - value should be above 150MB.
elastiCacheAlertsFreeMemoryThreshold: 150
### Duration over which to measure Elasticache FreeMemory (minutes).
elastiCacheAlertsFreeMemoryThresholdMinutes: 5
### Maximum threshold for Elasticache DatabaseMemoryUsagePercentage - value between 0 and 100.
elastiCacheAlertsMemoryUsageThreshold: 75
### Duration over which to measure Elasticache DatabaseMemoryUsagePercentage (minutes).
elastiCacheAlertsMemoryUsageThresholdMinutes: 5

v1.10.0

Compare Source

Add in ingress 2xx response monitoring (default disabled) to ensure that enough traffic is going to a service.

Can be enabled by setting ingress2xxEnabled to true.
Controlled by:

  • ingress2xxWindowMinutes: number of minutes to sample over, default 60.
  • ingress2xxThreshold: number of requests to expect in the time window, default 100.

v1.9.0

Compare Source

Remove old unused applicationAlertsEnabled configuration. See #​157 for changes.

v1.8.0

Compare Source

Add in sqsNumberAlertQueueMappings new functionality to create separate prometheus alerts per mapping rather than one alert for all the queues. This then means that separate alerts are created in slack for each one and aren't included in a Show more group.

v1.7.0

Compare Source

https://github.com/ministryofjustice/hmpps-helm-charts/pull/152: Attempt to stop failing cronjobs from falsely being marked as resolved if they are actually running at the time that they are checked by Prometheus. We do this by counting the failures over the last 5 minutes (configurable by applicationCronJobStatusFailedWindowMinutes) rather than at a specific point in time.

v1.6.0

Compare Source

Increases the threshold for the 5xxErrorResponsesOnHealthEndpoint alert from 0 to 0.004 (configurable via the ingress5xxHealthEndpointThreshold property`). This is to attempt to stop false positives from firing for health alerts.

v1.5.0

Compare Source

Remove KubeCronJobRunning job as was causing lots of false positives

v1.4.1

Compare Source

Bug fix release so that when applicationAlertsEnabled is set to false no alert rules are created.

v1.4.0

Compare Source

Add in ability to turn off the application alerts by setting applicationAlertsEnabled to false

v1.3.12

Compare Source

5xxErrorResponses alert altered to now target all paths except /health (bug introduced in 1.3.9 that meant that it only targetted /).

v1.3.11

Compare Source

Do not use - broken since 1.3.9. Fixed in 1.3.12. Updated alert test rule path

v1.3.10

Compare Source

Do not use - broken since 1.3.9. Fixed in 1.3.12. Creates a set of standard prometheus alert rules for a given application

v1.3.9

Compare Source

Do not use - ingress matching and path matching broken in this version. Fixed in 1.3.12.

v1.3.8

Compare Source

Creates a set of standard prometheus alert rules for a given application

v1.3.7

Compare Source

Creates a set of standard prometheus alert rules for a given application

v1.3.6

Compare Source

Creates a set of standard prometheus alert rules for a given application

v1.3.5

Compare Source

Creates a set of standard prometheus alert rules for a given application

v1.3.4

Compare Source

Creates a set of standard prometheus alert rules for a given application

v1.3.3

Compare Source

Creates a set of standard prometheus alert rules for a given application

v1.3.2

Compare Source

Fixes bug in 1.3.1 when business hours enabled

v1.3.1

Compare Source

This release extends from 1.3.0 and configures the business hours restrictions for all alerts. It also takes into account daylight saving so will alert 8am to 6pm.

Broken release - does not work when business hours enabled

v1.3.0

Compare Source

Added functionality to only create alerts during business hours. Setting

businessHoursOnly: true

will restrict to 7am to 6pm UTC (8am to 7pm BST) on week days (Monday to Friday).

v1.2.4

Compare Source

Creates a set of standard prometheus alert rules for a given application

v1.2.3

Compare Source

Creates a set of standard prometheus alert rules for a given application

v1.2.2

Compare Source

Creates a set of standard prometheus alert rules for a given application

v1.2.1

Compare Source

Creates a set of standard prometheus alert rules for a given application

v1.2.0

Compare Source

Creates a set of standard prometheus alert rules for a given application

v1.1.0

Compare Source

Creates a set of standard prometheus alert rules for a given application

v1.0.0

Compare Source

Creates a set of standard prometheus alert rules for a given application


Configuration

📅 Schedule: Branch creation - "after 9am every weekday,before 5pm every weekday" in timezone Europe/London, Automerge - At any time (no schedule defined).

🚦 Automerge: Disabled by config. Please merge this manually once you are satisfied.

Rebasing: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox.

🔕 Ignore: Close this PR and you won't be reminded about this update again.


  • If you want to rebase/retry this PR, check this box

This PR was generated by Mend Renovate. View the repository job log.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

0 participants