Base Command Manager / Bright Cluster Manager Release Notes

Release notes for Bright 9.1-9

== General ==

- New Features

* Support for Rocky Linux and migrated to Rocky Linux clusters
* More information about migrating CentOS clusters to Rocky Linux is available in the Knowledge Base article
* https://kb.brightcomputing.com/knowledge-base/how-do-i-migrate-my-bright-centos-8-cluster-to-rocky-linux-8/

- Improvements

* Updated CUDA 11.4 packages to 11.4.2

- Fixed Issues

* An issue with setting up the Kubernetes Dashboard when using uppercase hostnames
* In some cases, an issue with compiling the cuda-driver
* An issue with the modules.sh causing an error on /bin/sh compatible shells like dash
* In some cases, cmdaemon may create a "no-version" WLM module file if /cm/shared is not (yet) mounted

== cmdaemon ==

- Improvements

* Improved client SSL certificate handling
* Use the edge director role externally visible IP in the edge ISO script
* Fix a rare crash in monitoring entity/measurable linkage manager
* Prometheus cache optimizations
* Removed the option for using a null cipher for ssl traffic
* Fixed a possible race condition in taskrunner that can cause the monitoring samplers to stop working

- Fixed Issues

* An issue with parsing the CPUs number for AWS node instance types
* Improved log information when cmdaemon cannot bind to a port
* An issue with the schedulers healthcheck for Grid Engine in the case when a queue is in a "Subordinated" state
* Updating the scheduler parameter in the slurmserver role does not restart the slurmctld service
* Fixed a possible deadlock in labeled entity manager when a large number of nodes are adding prometheus data at the same time
* An issue with selecting the external network and gateway for an edge director when the BMC is also on the external network
* An issue with clearing the "Unknown error" state after a failed ping to a PDU
* An issue with Kubernetes label management when using mixed-case node names
* Removed the Slurm prolog and epilog scripts from the exclude lists, i.e. they now can be rsynced from the software image also when using sync or update modes

== node-installer ==

- Fixed Issues

* An issue with rsyslog segfaulting in node-installer images created for archos installs
* In some cases, an issue with the standalone script on RHEL/Centos 8.4
* An issue with booting nodes with a bond interface with multiple VLANs
* An issue with getclientid script when using bonded Mellanox ConnectX4-Lx ethernet interfaces

== buildmaster ==

- Improvements

* Added hid-microsoft kernel module to default RHEL8/CentOS 8 image kernel modules

== cmsh ==

- New Features

* Ability to list the available kernel versions for a software image

- Improvements

* Improved cmsh job filter command help

== cm-kubernetes ==

- Fixed Issues

* Ensure all kubernetes images are prefixed with the full registry name

== jupyter ==

- New Features

* Introduced support for OpenPBS in Jupyter Kernel Creator