Why has my software suite vanished from the bunch of nodes I installed it on? How do I fix it? Most likely you installed the software directly onto those nodes instead of on the “software image”. Then, after a reboot...
How do I use Spack on my Bright cluster? Bright Cluster Manager provides a good selection of ready-to-use libraries and tools that are commonly used in a high performance...
How can I fix “Failed to initialize NVML: Driver/library version mismatch?” Background The “Failed to initialize NVML: Driver/library version mismatch?” error generally means the CUDA Driver is still running an older...
How to have Bright monitor a BittWare FPGA card If you have a BittWare FPGA card that can be inserted into a PCI/PCIe slot of a Bright-managed compute node...
How to install OpenStack Kolla-ansible on top of a Bright Cluster How to install OpenStack Kolla-ansible on top of a Bright Cluster with CentOS 8 ** This document was tested on...
How can I use Grafana to monitor multiple Bright clusters? As of Bright 8.2-24 / 9.0-12 / 9.1-2, it is possible to add the cluster as a data source to...
How do I use Grafana to visualize monitoring data from a Bright cluster? Although Bright View has extensive capabilities when it comes to visualizing monitoring information, it may be desirable to be able...
How do I validate that my DGX cluster is working properly? One of the best ways to stress test your DGX cluster is to use NVIDIA’s HPC benchmarks which can be...
Generating BIOS templates for Bright 9.1 Starting with Bright Cluster Manager 9.1, BIOS management is done using the Redfish API. This article describes how to generate...
How can I get access to nightly builds of packages? The packages you will find in the Bright repositories have gone through a QA process. Updated packages are released roughly...
Running Jupyter kernel with Conda (Anaconda/Miniconda) environments Bright Cluster Manager’s data science add-on provides many ML related packages that can be used to run AI workloads on...
Using enroot and pyxis in Bright Cluster Manager These instructions are not relevant for installations of Bright Cluster Manager 9.1 and newer. An integration with enroot and pyxis...
Enabling Kdump (RHEL/CentOS) In the case where you need to diagnose kernel crash issues on a BCM managed cluster based on RHEL you...
Deploying NICE DCV on a Bright cluster This article discusses deploying a NICE DCV server on Bright managed compute nodes.We recommend reviewing the excellent third-party NICE DCV...
Firefox issue – Secure Connection Failed If you are using firefox and failing to reach services like BrightView or the user portal with an error of...
How to Deploy Spark with Kubernetes on Bright 9.0, 9.1, 9.2. The steps described in this page can be followed to run a distributed Spark application using Kubernetes on Bright 9.0,...
How to create a Docker image to run Jupyter kernels This article demonstrates a procedure to create a Docker image which can be used to run Jupyter kernels via Kubernetes....
Upgrading Slurm This article will go over the steps needed to upgrade the Bright provided SLURM packages to a newer major version...
Installing Kubernetes on Air-Gapped Systems Kubernetes is most easily installed on a cluster that is able to access the internet. For clusters without internet access...
Installing and operating slurmrestd slurmrestd is a stateless REST compatible API to the slurm control plane. This article will go over the steps to...
Upgrading Kubernetes version 1.18 to 1.21 on a Bright 9.1 cluster. 1. Prerequisites This article is written with Bright Cluster Manager 9.1 in mind, where Kubernetes is currently deployed with the...
Installing Kubernetes on Air-Gapped Systems Kubernetes is most easily installed on a cluster that is able to access the internet. For clusters without internet access...
How do I ensure that the container images I run on my BCM cluster through Kubernetes are secure? Introduction This article describes deploying OpenClarity onto a BCM managed Kubernetes cluster for the purposes of security auditing and monitoring....
How do I add the Dell OpenManage tools to a Bright 9.2 Ubuntu 22.04 headnode? Installation instructions are available in the Dell OpenManage repositories. Dell OpenManage Repository The following example will install OpenManage version 11...
Enabling Kdump (Ubuntu) The instructions in this article can be followed to enable Kdump on Ubuntu 20.04, 22.04 and 24.04 compute nodes. As...