Monitor

All things monitoring related.

Cloud Native

My preferred stack: Prometheus, Grafana, Loki

Node Exporter: Prometheus exporter for server/OS statistics
Elk Stack for Log Monitoring: ELK tends to be a bit heavy, but keeping this around just in case
Changd: Notify if WebUI changes.
Performance related articles at https://www.brendangregg.com
Internet Monitoring (globally)
- AWS CloudWatch Internet Weather Map
- Contrack talkes - one thousand and one flows - Interesting article on monitoring the maximum number of entries in the Linux Contrack table, used for statefile firewall setup
- Pingdom’s State of the Internet
- Down Detector
- Oracle Internet Intelligence
- The Outage Mailing List - Network admins chatting about global issues
Internet Monitoring (locally)
- Open Speed Test: Browser based, no client login required.
- Trippy: More advance Traceroute

One Uptime: Open source observability platform - uptime monitoring, incident maganement, oncall alerts, logs, traces, etc (and maybe metrics, but not widely advertised)
OpenObserve: Open source, lightweight, single binary, drop in replacement for Elisticsearch, support OpenTelementry/OTEL
Signoz: Open source, lightweight, log, metrics, traces, all working with OpenTelementry
BindplaneOP: Manage sources that are OpenTelementry Specific

Purposely in the “monitor” phase of the DevOps cycle, as you do not want to prematurely optimize an architecture.

Also see plan for actual retrospective stores - as those are the basis for planning improvements

Steve Miller BY-NC 4.0 | Rendered by Hugo | Subscribe