F5F Stay Refreshed Power Users Networks Monitoring Network and UPS Systems

Monitoring Network and UPS Systems

Monitoring Network and UPS Systems

C
carp3
Senior Member
572
05-24-2025, 06:10 PM
#1
Hello all, I’ve done thorough research and believe my request is quite specific. Here’s a summary:
Current equipment: 4x Eaton 3000VA 9PX units, each with a 2/3kVA Extended Battery Module. All have independent power circuits from separate main boards. Each includes a Management Network Card, and we’ve installed SNMP 2 Server Racks. We run two Eaton systems per rack with load balancing, and every server with two PSUs is connected to a UPS tailored for its load. We currently use Dell Servers with VMware (Vcenter) on one host; HA isn’t available due to licensing.

We also have several bare-metal servers running Windows and Ubuntu. The software we use (IPM2) requires a VM, even though it could work on a non-critical host. I often face issues with clean shutdowns in Guest OS, causing hosts to hang instead of shutting down. We aim for about 80% effectiveness.

The main goal is to have a solution that not only connects to UPS via SNMP but also responds to alerts and initiates shutdowns automatically—especially during power loss or high temperatures. We need coverage for two key scenarios:
A. When power is out and UPS battery hits 60%, triggering shutdown (via actions or scripts).
B. When sensor temperature reaches a critical level, prompting shutdown.

Even if one UPS fails, we want to detect it and start the process. Unfortunately, existing tools like PRTG, Zabbix, Observium, etc., don’t fully meet our needs—they either monitor SNMP or lack robust automation.

I’ve explored NUT but feel uncertain about diving deep. Alternatives such as LanSweeper, NinjaOne, Nagios, CheckMk are available, but none offer the full integration we desire.

We require a platform that can act on UPS status changes and manage shutdowns during off-hours, especially since our AD environment relies on static IPs. Ideally, it should support remote control during deep sleep.

My priority is a system that balances reliability, automation, and ease of management without depending heavily on cloud services. I’m open to discussing other options or hearing from others who’ve tackled similar challenges.

Thanks for your time, and I look forward to your insights.
C
carp3
05-24-2025, 06:10 PM #1

Hello all, I’ve done thorough research and believe my request is quite specific. Here’s a summary:
Current equipment: 4x Eaton 3000VA 9PX units, each with a 2/3kVA Extended Battery Module. All have independent power circuits from separate main boards. Each includes a Management Network Card, and we’ve installed SNMP 2 Server Racks. We run two Eaton systems per rack with load balancing, and every server with two PSUs is connected to a UPS tailored for its load. We currently use Dell Servers with VMware (Vcenter) on one host; HA isn’t available due to licensing.

We also have several bare-metal servers running Windows and Ubuntu. The software we use (IPM2) requires a VM, even though it could work on a non-critical host. I often face issues with clean shutdowns in Guest OS, causing hosts to hang instead of shutting down. We aim for about 80% effectiveness.

The main goal is to have a solution that not only connects to UPS via SNMP but also responds to alerts and initiates shutdowns automatically—especially during power loss or high temperatures. We need coverage for two key scenarios:
A. When power is out and UPS battery hits 60%, triggering shutdown (via actions or scripts).
B. When sensor temperature reaches a critical level, prompting shutdown.

Even if one UPS fails, we want to detect it and start the process. Unfortunately, existing tools like PRTG, Zabbix, Observium, etc., don’t fully meet our needs—they either monitor SNMP or lack robust automation.

I’ve explored NUT but feel uncertain about diving deep. Alternatives such as LanSweeper, NinjaOne, Nagios, CheckMk are available, but none offer the full integration we desire.

We require a platform that can act on UPS status changes and manage shutdowns during off-hours, especially since our AD environment relies on static IPs. Ideally, it should support remote control during deep sleep.

My priority is a system that balances reliability, automation, and ease of management without depending heavily on cloud services. I’m open to discussing other options or hearing from others who’ve tackled similar challenges.

Thanks for your time, and I look forward to your insights.