How can I use CUDA MPS with Bright? Introduction CUDA MPS is a feature that allows multiple CUDA processes to share a single GPU context. A CUDA program...
How can I fix “Failed to initialize NVML: Driver/library version mismatch?” Background The “Failed to initialize NVML: Driver/library version mismatch?” error generally means the CUDA Driver is still running an older...
How can I run a simple test to stress test my GPUs? Make sure CUDA, git and cmake are installed on the head node of the cluster: Clone the Multi GPU Benchmark...
General considerations for installing a Bright DGX cluster Loading the correct kernel modules If you are going to use the built-in gigabit Ethernet interface as your internal cluster...