How do I clone the monitoring data database? In Bright 7.x the monitoring data is a database of the results of healthcheck and metric monitoring. CMDaemon deals with...
My LDAP server won’t start because the database is corrupt. How do I fix it? Possible causeOpenLDAP uses a storage format (BerkeleyDB) which is known to get corrupted if you don’t do a clean shutdown...
Can I mix Linux distributions on my cluster? Can the Bright Cluster Manager (BCM) be used to manage a mix of Linux distributions within the same cluster? Yes....
How do I set per-user resource limits with Cgroups? You can use cgroups to set various resource limits for users. For example, you might add an entry like this...
How do I migrate users to a new cluster? Sometimes the administrator may wish to migrate users from one cluster to another (e.g. when setting up a new cluster...
How can I set up a reverse proxy for the user portal from 7.1 onward? In this KB article we describe the steps needed to configure the Apache httpd web server as a reverse proxy...
How do I solve a yum update failure problem? Depends what the failure is. If it is due to a corrupt rpm database, then a cure that often works...
How do I migrate the cmd config to a new install? This article is being updated. Please be aware the content herein, not limited to version numbers and slight syntax changes, may...
How to Copy Bright Cluster configurations between two clusters Following either method, configurations can be copied between Bright clusters with the same major version (e.g. two Bright 9.2 clusters)....
Using NVIDIA GPUs in X-application on a headless node via VNC The following steps can be followed to enable direct rendering from an x-client (glxgears or similar) running on a headless...
Enabling Kdump (RHEL/CentOS) In the case where you need to diagnose kernel crash issues on a BCM managed cluster based on RHEL you...
Managing Kubernetes deployments with Lens Exploring Kubernetes clusters without having to learn kubectl commands is great both for developers just getting started as well as...
Scripting with cmsh Sometimes it’s useful to be able to run basic commands against cmsh in an automated way. We often take the...
How To Disable Port Detection In order to disable port detection-based node identification, you will need to clear the ‘ethernetswitch’ setting of the node, category,...
Enabling Kdump (Ubuntu) The instructions in this article can be followed to enable Kdump on Ubuntu 20.04, 22.04 and 24.04 compute nodes. As...
How to use Lmod spider cache? The BCM Lmod package is built with spider cache functionality. The recommended directory for storing this cache is /var/lib/lmod/mData/cacheDir, and...
How to disable a GPU on a node In certain scenarios disabling a node GPU can be necessary, for example when a GPU on a node becomes faulty...
How Can I Set up a Reverse Proxy for Base View and User Portal in BCM 10 and Later? In this KB article, we describe the steps needed to configure the Apache httpd web server as a reverse proxy...
Create persistent UDEV rules to rename the disks consistently based on HW address This article was tested on DGX OS 6.2 BCM10 Before listing the instructions that has to be followed to create...
How to avoid the ‘too many measurables’ event message The ‘too many measurables’ event messages are logged when a monitoring data producer has more than 500 measurables. When the...
Extended Validation of HA Clusters This article will describe additional validation tests that can be completed on a HA cluster to confirm proper operation. This...
How Do I Add a BCM ISO as an APT Repository on BCM Ubuntu Clusters? First, mount the BCM ISO on the head node. For example: # mount -o loop,ro bcm-11.0-ubuntu2404.iso /mnt For installing updates...