Consolidation — how can I see it is working? Notice of Knowledge Base Relocation Our Knowledge Base has been relocated to the NVIDIA Enterprise Support Portal. This update is...
How do I run a health check only for a specific set of nodes? Purpose In some cases, it may not make sense to have a health check run on all nodes in a...
How do I enable monitoring for the Intel Arria 10 GX FPGA? Notice of Knowledge Base Relocation Our Knowledge Base has been relocated to the NVIDIA Enterprise Support Portal. This update is...
Monitoring integration with Slack Slack integration script for Bright Cluster Manager Slack integration with Bright Cluster Manager monitoring is available via a built-in action...
Why do I have no monitoring history for disk-related metrics? (or, approaching this issue from the other side) Too many disks are causing too many metrics — what now? This...
How can I monitor a CoolIT rack with Bright? Notice of Knowledge Base Relocation Our Knowledge Base has been relocated to the NVIDIA Enterprise Support Portal. This update is...
Why is the kipmi0 process consuming so much CPU time? This is known to occur with some BMC controllers. It can occur for several reasons including the following: kipmi0 is...
How do I trigger a script when a node comes up? Is it possible to setup a trigger event (run a script) upon node becoming UP and available? Yes, this is...
How can I use Grafana to monitor multiple Bright clusters? Notice of Knowledge Base Relocation Our Knowledge Base has been relocated to the NVIDIA Enterprise Support Portal. This update is...
How do I use Grafana to visualize monitoring data from a Bright cluster? Although Bright View has extensive capabilities when it comes to visualizing monitoring information, it may be desirable to be able...
How do I tune/adjust monitoring? Depending on the size, complexity, and hardware of your cluster the defaults for monitoring may be too aggressive. Below are...
How do I change the monitoring data backup location on my HA cluster? Notice of Knowledge Base Relocation Our Knowledge Base has been relocated to the NVIDIA Enterprise Support Portal. This update is...