How do I call Slurm utilities from nodes that run no slurmdbd? On a node, some Slurm utilities such as sacct need to connect to the slurmdbd daemon. slurmdbd in turn talks...
Does Bright write to the Slurm Accounting Database? Bright Cluster Manager does not write to the Slurm accounting database.
How do I share resources in Slurm? By default, Slurm is configured such that it allocates an entire node to a job which requests a subset of...
X11-Forwarding does not work with Slurm, what now? Bright Cluster Manager generates the SSH keys for cluster users using ECDSA instead of RSA algorithm. However, this might cause...
How do I run interactive Ansys Fluent using Slurm? This article is being updated. Please be aware the content herein, not limited to version numbers and slight syntax changes, may...
Slurmctld shows the error “we don’t have select plugin type 102” This article is being updated. Please be aware the content herein, not limited to version numbers and slight syntax changes, may...
How do I configure SLURM Fair Shares? SLURM Fair Sharing can be configured using the sacctmgr tool. The following example illustrates how 50% Fair Sharing between two...
Enabling QOS in Slurm with Bright 8.2 and earlier releases. Here is an example of enabling QOS (Quality of Service) in SchedMD Slurm and applying the QOS to a partition...
How can I run a simple test to stress test my GPUs? Make sure CUDA, git and cmake are installed on the head node of the cluster: Clone the Multi GPU Benchmark...
How should I set up Slurm on a DGX cluster? Background A workload management system is helpful for scheduling jobs on a cluster of nodes. The steps below describe how...
Using enroot and pyxis in Bright Cluster Manager These instructions are not relevant for installations of Bright Cluster Manager 9.1 and newer. An integration with enroot and pyxis...
Prepopulating the Jupyterhub Launcher with Kernels Purpose Bright’s Jupyter integration provides a rich set of features that offer a point-and-click interface to the underlying compute cluster....
Upgrading Slurm This article will go over the steps needed to upgrade the Bright provided SLURM packages to a newer major version...
Installing and operating slurmrestd slurmrestd is a stateless REST compatible API to the slurm control plane. This article will go over the steps to...
How to run SLURM jobs in Singularity containers via Jupyter In this article we are going to demonstrate a procedure to run SLURM jobs in Singularity containers by Jupyter on...
Why is sshare for Slurm 23.02 complaining about the priority/basic plugin? If you run the “sshare” tool for Slurm 23.02 on your cluster, you may see the following error output: $...
Adding a job queue or altering node settings for a subset of nodes in SLURM These instructions we completed using BCM 10 on Ubuntu 22.04 but should work for all supported platforms 9.0 and higher....
How to Use a Compute Node as a Redundant Slurm Controller? Let’s say that the BCM cluster already has Slurm deployed and that the cluster has one head node that is...