• Deploying NICE DCV on a Bright cluster

    This article discusses deploying a NICE DCV server on Bright managed compute nodes.We recommend reviewing the excellent third-party NICE DCV upstream documentation, which is available here: NICE DCV (amazon.com) Before we start… This article doesn’t seek to replace the upstream documentation, rather details the integration with Bright Cluster Manager. This…

  • Enabling Kdump (RHEL/CentOS)

    You can use the below instructions to configure kdump on a Bright managed cluster. These instructions should work on RHEL/CentOS 7 and 8. 1. Install required packages Install the kexec-tools in the software image:# yum install kexec-tools –installroot=/cm/images/<image-name> 2. Modify the software image Configure the software image to allow crashkernel…

  • Troubleshooting provisioning issues on a system with SuperMicro BMC

    Please note this article shouldn’t replace the need to contact your vendor for guidance on hardware issues. Here are some steps that may assist in resolving provisioning issues with SuperMicro BMCs. Is the system running the latest BMC firmware? Check the vendor website. Have you attempted a BMC reset? “ipmitool…

  • Is it possible to clone the primary headnode from the secondary in an HA cluster?

    Important note: This process is generally used to recover a primary headnode from a failure state (filesystem corruption for example). This process doesn’t replace a good backup regime. If you intend to use this process to recover a primary headnode, we recommend contacting Bright Support first so we may assess…

  • How do I disable the Shorewall firewall on the headnode?

    *Please note:* Shorewall provides a NAT Masquerade rule that allows the compute nodes to access networks that are outside the cluster via the headnode. If you are okay with the compute nodes not having access to external networks or aren’t using the headnode as the default gateway for the compute…

  • Can I use Rufus to create a bootable USB drive?

    We recommend not using Rufus to create a bootable USB drive from the Bright ISO.Rufus changes the ISO Hybrid disk format which results in the Bright installer failing at the rootfs part of the Bright installer.

  • How can I fix “Failed to initialize NVML: Driver/library version mismatch?”

    The “Failed to initialize NVML: Driver/library version mismatch?” error generally means the CUDA Driver is still running an older release that is incompatible with the CUDA toolkit version currently in use. Rebooting the compute nodes will generally resolve this issue. If you do not wish to reboot the compute node,…

  • Enabling QOS in Slurm with Bright 8.2 and earlier releases.

    Here is an example of enabling QOS (Quality of Service) in SchedMD Slurm and applying the QOS to a partition using Bright. *Please note this is valid for Bright 8.2 and earlier releases. Finally update the options for the partition in Bright to use this new QOS configuration.