aboutsummaryrefslogtreecommitdiffstats
path: root/kubernetes/namespaces/monitoring (follow)
Commit message (Collapse)AuthorAgeLines
* Remove PostgreSQL Exporter from KubernetesGravatar Joe Banks2024-06-02-55/+0
|
* Remove Kubernetes PostgreSQL AlertsGravatar Joe Banks2024-06-02-29/+0
|
* Fix AlertManager Discord instance formattingGravatar Joe Banks2024-05-27-1/+1
| | | | | | | | | | | We made a change to include the instance in alerts sent to Discord, but not all of our configured alerts send this field. As a result, we would have incorrectly formatted alerts being sent through to Discord which were tricky to read. The format template has now been changed to only conditionally render the instance label if it is present on a triggered alert.
* Take 15 minutes before alerting on high latencyGravatar Johannes Christ2024-05-20-2/+2
|
* Annotations.instance => Labels.instanceGravatar Joe Banks2024-05-18-1/+1
|
* Add instance to AlertManager Discord embedsGravatar Joe Banks2024-05-17-1/+1
|
* Move AlertManager to 4 replicasGravatar Joe Banks2024-05-16-1/+1
|
* Move AlertManager to pydis.wtfGravatar Joe Banks2024-05-14-4/+5
|
* Move prometheus to pydis.wtfGravatar Joe Banks2024-05-14-3/+4
|
* Update Grafana configmap to grafana.pydis.wtfGravatar Joe Banks2024-05-14-2/+2
|
* Update Grafana ingress to grafana.pydis.wtfGravatar Joe Banks2024-05-14-3/+3
|
* Stop alerting for slow GitHub webhook filter endpoint calls (#235)Gravatar jchristgit2024-04-29-2/+2
| | | | | These are directly forwarded to GitHub with no time-consuming processing done on the site. We would therefore be alerting for GitHub's slowness, which is rather useless.
* Update all secrets to new PostgreSQL serviceGravatar Joe Banks2024-04-27-0/+0
|
* Exclude home and tag views from latency alertsGravatar Johannes Christ2024-04-24-2/+2
| | | | | These are known issues and we probably won't do anything about them, so stop alerting us about it.
* Update ContainerOOMEvent alertGravatar Joe Banks2024-04-17-4/+4
|
* Move Redis to databases namespaceGravatar Joe Banks2024-04-15-0/+0
|
* Move Grafana to monitoring namespaceGravatar Joe Banks2024-04-15-0/+151
|
* Remove pydis-site from nginx p99 reportGravatar Chris Lovering2024-04-15-1/+1
|
* Update Olli targets with new namespacesGravatar Joe Banks2024-04-15-1/+1
|
* Move olli to monitoring namespaceGravatar Joe Banks2024-04-15-0/+23
|
* Remove custom postgres-exporter queriesGravatar Chris Lovering2024-04-09-11/+0
| | | | These feature has been deprecated by postgres-exporter, and the only query currently defined in our file didn't work anyway.
* Enable sticky session cookie for AlertManagerGravatar Joe Banks2024-04-09-0/+5
| | | | Add a session cookie to AlertManager to always route requests to the same instance
* Exclude HTTP 444 from 4XX alerts as it is manually sentGravatar Joe Banks2024-04-04-1/+1
|
* fix EOF in django alerts fileGravatar Joe Banks2024-04-02-1/+1
|
* update routing configuration for alertmanagerGravatar Joe Banks2024-04-02-1/+9
|
* update alert prioritiesGravatar Joe Banks2024-04-02-5/+15
|
* Increase KubernetesNodeCordoned wait durationGravatar Joe Banks2024-04-01-2/+2
|
* Line endings in memory.yamlGravatar Joe Banks2024-04-01-1/+1
|
* Update wording of OOM alertGravatar Joe Banks2024-04-01-1/+1
| | | Co-authored-by: jchristgit <[email protected]>
* exclude api:github-artifacts from django latencyGravatar Joe Banks2024-04-01-1/+1
|
* Add a sanity alert for when a node becomes unschedulableGravatar Joe Banks2024-04-01-1/+10
|
* Add alerts for container OOMsGravatar Joe Banks2024-04-01-0/+9
|
* Add etcd alerts for increase in failed requestsGravatar Joe Banks2024-04-01-0/+11
|
* Add some new Django alerts through PrometheusGravatar Joe Banks2024-04-01-0/+20
|
* Add PagerDuty to AlertManager channelsGravatar Joe Banks2024-04-01-14/+17
|
* Support port pretty names as well as integer ports for SDGravatar Joe Banks2024-04-01-1/+41
|
* Disable kubewatch for a number of resourcesGravatar Chris Lovering2023-10-17-9/+9
| | | | The sheer number of these webhooks being sent was taking up a significant amount of our available guild-wide ratelimit for webhooks.
* Disable kubewatch for podsGravatar Chris Lovering2023-10-17-1/+1
| | | | The sheer number of these webhooks being sent was taking up a significant amount of our available guild-wide ratelimit for webhooks.
* Copy all files from kubernetes repo into this oneGravatar Chris Lovering2023-08-14-0/+1441
This commit is a like-for-like copy of the [kubernetes repo](https://github.com/python-discord/kubernetes) check that repo for comit history prioir to this commit. Co-authored-by: Amrou Bellalouna <[email protected]> Co-authored-by: Bradley Reynolds <[email protected]> Co-authored-by: Chris <[email protected]> Co-authored-by: Chris Lovering <[email protected]> Co-authored-by: ChrisJL <[email protected]> Co-authored-by: Den4200 <[email protected]> Co-authored-by: GDWR <[email protected]> Co-authored-by: Hassan Abouelela <[email protected]> Co-authored-by: Hassan Abouelela <[email protected]> Co-authored-by: jchristgit <[email protected]> Co-authored-by: Joe Banks <[email protected]> Co-authored-by: Joe Banks <[email protected]> Co-authored-by: Joe Banks <[email protected]> Co-authored-by: Johannes Christ <[email protected]> Co-authored-by: Kieran Siek <[email protected]> Co-authored-by: kosayoda <[email protected]> Co-authored-by: ks129 <[email protected]> Co-authored-by: Leon Sand├©y <[email protected]> Co-authored-by: Leon Sand├©y <[email protected]> Co-authored-by: MarkKoz <[email protected]> Co-authored-by: Matteo Bertucci <[email protected]> Co-authored-by: Sebastiaan Zeeff <[email protected]> Co-authored-by: Sebastiaan Zeeff <[email protected]> Co-authored-by: vcokltfre <[email protected]>