How do I clone the monitoring data database? Notice of Knowledge Base Relocation Our Knowledge Base has been relocated to the NVIDIA Enterprise Support Portal. This update is...
My LDAP server won’t start because the database is corrupt. How do I fix it? Possible causeOpenLDAP uses a storage format (BerkeleyDB) which is known to get corrupted if you don’t do a clean shutdown...
Can I mix Linux distributions on my cluster? Notice of Knowledge Base Relocation Our Knowledge Base has been relocated to the NVIDIA Enterprise Support Portal. This update is...
How do I set per-user resource limits with Cgroups? You can use cgroups to set various resource limits for users. For example, you might add an entry like this...
How do I migrate users to a new cluster? Notice of Knowledge Base Relocation Our Knowledge Base has been relocated to the NVIDIA Enterprise Support Portal. This update is...
How do I solve a yum update failure problem? Background BCM support frequently receives this question. While support for the underlying OS is out of scope, these are some...
How to Copy Bright Cluster configurations between two clusters Following either method, configurations can be copied between Bright clusters with the same major version (e.g. two Bright 9.2 clusters)....
Using NVIDIA GPUs in X-application on a headless node via VNC The following steps can be followed to enable direct rendering from an x-client (glxgears or similar) running on a headless...
Enabling Kdump (RHEL/CentOS) Notice of Knowledge Base Relocation Our Knowledge Base has been relocated to the NVIDIA Enterprise Support Portal. This update is...
Managing Kubernetes deployments with Lens Exploring Kubernetes clusters without having to learn kubectl commands is great both for developers just getting started as well as...
Scripting with cmsh Sometimes it’s useful to be able to run basic commands against cmsh in an automated way. We often take the...
How To Disable Port Detection In order to disable port detection-based node identification, you will need to clear the ‘ethernetswitch’ setting of the node, category,...
Enabling Kdump (Ubuntu) Notice of Knowledge Base Relocation Our Knowledge Base has been relocated to the NVIDIA Enterprise Support Portal. This update is...
How to use Lmod spider cache? The BCM Lmod package is built with spider cache functionality. The recommended directory for storing this cache is /var/lib/lmod/mData/cacheDir, and...
How to disable a GPU on a node In certain scenarios disabling a node GPU can be necessary, for example when a GPU on a node becomes faulty...
How Can I Set up a Reverse Proxy for Base View and User Portal in BCM 10 and Later? Notice of Knowledge Base Relocation Our Knowledge Base has been relocated to the NVIDIA Enterprise Support Portal. This update is...
Create persistent UDEV rules to rename the disks consistently based on HW address Notice of Knowledge Base Relocation Our Knowledge Base has been relocated to the NVIDIA Enterprise Support Portal. This update is...
How to avoid the ‘too many measurables’ event message The ‘too many measurables’ event messages are logged when a monitoring data producer has more than 500 measurables. When the...
Extended Validation of HA Clusters Notice of Knowledge Base Relocation Our Knowledge Base has been relocated to the NVIDIA Enterprise Support Portal. This update is...
How Do I Add a BCM ISO as an APT Repository on BCM Ubuntu Clusters? Notice of Knowledge Base Relocation Our Knowledge Base has been relocated to the NVIDIA Enterprise Support Portal. This update is...