Base Command Manager / Bright Cluster Manager Release Notes
Release notes for Bright 9.2-6
== General ==
- New Features
* Add support for weave networking in Kubernetes
* Add support for installing Kubernetes 1.24
- Improvements
* Ubuntu 20.04: update to 20.04.5.
* Update coredns to 1.9.4 for Kubernetes 1.24
* Update lua to 5.4.4 (CVE-2022-28805)
* mlnx-ofed57: added mlnx-ofed57 packages for installing the Mellanox 5.7 OFED stack.
- Known issues
* Upgrading the CM packages on SLES12 can result in a conflict between the cm-dhcp and the base distro dhcp packages. It is safe to answer “yes” to replace the conflicting dhcp files with files from cm-dhcp
== CMDaemon ==
- Improvements
* exclude all /snap/.* mount points from the "procmounts" sampler, which otherwise creates unnecessary metrics in CMDaemon
* Copy the file cluster.csr.new to all head nodes during install-license
* Increase the default for Kubernetes kubelet's --max-pods from 50 to 110 for new installations
* Allow the administrator to configure the proxy-mode in the Kubernetes kube-proxy role via the revision parameter
- Fixed Issues
* An issue with the JSON whoami API call returning a username instead of a profile
* An issue with removing OSDs from a Ceph cluster if the corresponding OSD nodes are down
* An issue where the version config file timestamps (versionconfigfiles=yes) are always set to the Unix epoch (1970)
* An issue with setting cmjob and cm-scale constrains for LSF which overwrites other already specified job requirements
* In some cases, the passive head node may be listed as DOWN due to a race condition between watching for new HPC jobs and CMDaemon loading all nodes information
* An issue where a cloud director power off may hang for up to a minute if the node is already off
* An issue with merging CMDaemon monitoring execution multiplexers into one, which results in only the last multiplexer being taken into account
* An issue where CMDaemon may not (re)generate the Slurm logrotate files in some cases, such as when the files are modified or deleted outside of CMDaemon
== Bright View ==
- Fixed Issues
* An issue with showing the correct queue name of WLM jobs
* An issue with removing Slurm generic resources (GRES) with Bright View
* An issue with updating Slurm queue parameters in Bright View
== Machine Learning ==
- New Features
* Deprecated ML package cm-chainer-py39-cuda11.2-gcc9
* Introduced ML package cm-cutensor-cuda11.7
* Introduced ML package cm-ml-distdeps-cuda11.7
* Introduced ML package cm-nccl2-cuda11.7-gcc9
* Introduced ML package cm-cudnn8.5-cuda11.7
== cm-kubernetes-setup ==
- Improvements
* Ensure kernel modules required for istio are loaded on the hosts when setting up Kubernetes
- Fixed Issues
* An issue with conflicting containerd / containerd.io packages on Ubuntu when installing nvidia-docker2
* Crash when uninstalling Kubernetes if a user with role bindings exists outside of Bright LDAP
== cm-scale ==
- Fixed Issues
* An issue with starting cloned nodes when using the default resources configuration
== cm-setup ==
- Fixed Issues
* Make cm-*-setup configuration file permissions more restrictive
== cm-wlm-setup ==
- Fixed Issues
* An issue with setting up pyxis/enroot in non-default/additional software images
== cmha-setup ==
- Fixed Issues
* An issue with generating the disk layout XML file for the primary head node when cloning it to the secondary if there is a commented-out entry for swap in the primary head node fstab file
== cmsh ==
- New Features
* Allow the --start and --end arguments in a rangequery command to be specified as date/time stamps
- Improvements
* Print a warning if devices with duplicate IPs are being added or cloned
- Fixed Issues
* An issue where the "--user" option of the cmsh "rshell" command does not take effect
== pbspro2022 ==
- Improvements
* Add support for PBS Pro 2022