How do you all monitor your server performance?

Michaelscarn69-@alien.top · 3 years ago

How do you all monitor your server performance?

Dizzybro@alien.top · 3 years ago

The fastest way? Probably netdata

SadanielsVD@alien.top · 3 years ago

This. If you have more servers you can also get them all connected to a single UI where you can see all the Infos at once. With netdata cloud

Spaceman_Splff@alien.top · 3 years ago

Just set this up yesterday. I used a parent node and then have all my vms point to that. Took like an hour to figure it out

scotrod@alien.top · 3 years ago

Hey, did you use the cloud functionality or not? I’m tryna go all local with parent-child kind of capability but so far unable to.

Spaceman_Splff@alien.top · 3 years ago

The parent still is visible to the cloud portal. My understanding is the data all resides local, but when you login to their cloud portal, it connects to the parent to display the information. I’m still playing with it to confirm. My parent node shows all the child nodes on the local interface but the cloud still shows them all.

Spaceman_Splff@alien.top · 3 years ago

I don’t know if I’ll keep running this. Already the child nodes are complaining about increase write delays since installing the agents on them.

Michaelscarn69-@alien.top · 3 years ago

I’ll look into this too. Thank you.

weller_rocks@alien.top · 3 years ago

agreed … BY FAR the fastest. Easiest learning curve as well

AstrologicalMob@alien.top · 3 years ago

I currently use thr classic “Hu seems slow, checks basic things like disk usage and process CPU/RAM usage I’ll do a reboot to fix it for now”.

Nagashitw@alien.top · 3 years ago

This is me. Can’t hurt to just do a reboot

dibu28@alien.top · 3 years ago

Windows Server? )

Mother_Construction2@alien.top · 3 years ago

I know that it needs a fix when my dad complaining that he can’t watch TV and the rolling door doesn’t open in the morning.

Theon@alien.top · 3 years ago

Netdata, I’ve meant to look into Grafana but it always seemed way too overcomplicated and heavy for my purposes. Maybe one day, though…

weller_rocks@alien.top · 3 years ago

I thought the same thing but it’s not bad actually, there are some pre build dashboards you can import for common metrics from Linux, windows, firewalls etc …

netdata is much better though (IMHO)

HCharlesB@alien.top · 3 years ago

Checkmk (Raw - free version.) Some setup aspects are a bit annoying (wants to monitor every last ZFS dataset and takes too long to ‘ignore’ them one by one.) It does alert me to things that could cause issues, like the boot partition almost full. I run it in a Docker container on my (primarily) file server.

TheDeepTech@alien.top · 3 years ago

I use this as well! Works well and has built in intelligence for thresholds.

how_now_brown_cow@alien.top · 3 years ago

TICK stack is the only answer

maximus459@alien.top · 3 years ago

Observium…

If it’s just one server, Netdata is a better option…

BouncyPancake@alien.top · 3 years ago

If its down, I assume performance is bad

lestrenched@alien.top · 3 years ago

I came across monit recently, seems nice

thibmaek@alien.top · 3 years ago

Quick checks: Proxmox dashboard, htop or glances, Portainer

Extensive monitoring: Prometheus (node-exporter), Rsyslog server, Loki, Grafana, Uptime Kuma, Alertmanager (via Gotify)

Savancik@alien.top · 3 years ago

Girlfriend first Alert Manager second. Girlfriend is usually faster.

Majestic-Contract-42@alien.top · 3 years ago

If one of my users ever complained about anything I would possibly look into it, otherwise it all works so I don’t waste life energy on that.

Do_TheEvolution@alien.top · 3 years ago

Prometheus + Grafana + Loki

It is bit difficult at start, but really in the end you can monitor and get notification on anything thats happening on your system.

Large_Yams@alien.top · 3 years ago

I don’t track their performance, I just track if they’re up or down.

I use uptimekuma running on a free tier of fly.io so I can tell if my cluster had a catastrophic failure. There’s no point in the alerting system running on the same system.

LumePart@alien.top · 3 years ago

Zabbix for hardware, certificate monitoring

Prometheus for service monitoring (e.g how many are actually using my Jellyfin server, so i know if I need to scale etc.)