Base Command Manager / Bright Cluster Manager Release Notes
Release notes for Bright 9.2-8
== General ==
- Improvements
* Integration with Run:ai
- Fixed Issues
* mlnx-ofed57: An issue where the mst service does not start when using mlnx-ofed57 due to the mst sysvinit service file being packaged as a systemd unit file
== CMDaemon ==
- New Features
* Add a cm-package-release-info tool which can determine the Bright 9.X-Y version of installed packages
- Improvements
* Introduce a new CPUUsage metric for compute and head nodes to show the percentage of the CPU usage
* Exclude the ram and loop devices from /sys/block/ from being sampled by the SysBlockStat monitoring data producer
* Allow for special OID for PDU load to be specified via the revision property
- Fixed Issues
* Disable the cgroup job metrics collection for users (user IDs) that cannot be found on the nodes
* An issue where the monitoringdrop command may drop the data only for the head node
* An issue with creating the LSF configuration when some node is converted from a compute to a submit-only host
* An issue where cloud node that have never been booted may have a status "unknown error"
* Automatically start slurmdbd when Slurm configuration is frozen in cmd.conf
* An issue where the lite daemon may not reconnect when the websocket is closed cleanly on the server side
== Head Node Installer ==
- Fixed Issues
* An issue with head node installations with Lmod where the DefaultModules.lua module file is not created by default, resulting in messages about empty LMOD_SYSTEM_DEFAULT_MODULES environment variable
== Machine Learning ==
- New Features
* Introduced ML package cm-onnx-pytorch-*-cuda11.7-*
* Introduced ML package cm-gpytorch-*-cuda11.7-*
* Introduced ML package cm-fastai2-*-cuda11.7-*
* Introduced ML package cm-pytorch-extra-*-cuda11.7-*
* Introduced ML package cm-xgboost-*-cuda11.7-*
* Updated cm-gcc9-* packages to v9.5.0
* Updated cm-pytorch-* packages to v1.13.0
* Updated cm-tensorflow2-* packages to v2.10.0
* Updated cm-gpytorch-* packages to v1.9.0
* Updated cm-fastai2-* packages to v2.7.0
* Updated cm-xgboost-* packages to v1.6.2
* Introduced ML package cm-pytorch-cuda11.7
* Introduced ML package cm-cub-cuda11.7
* Introduced ML package cm-tensorflow2-*-cuda11.7-*
* Introduced ML package cm-opencv4-*-cuda11.7-*
* Introduced ML package cm-ml-pythondeps-*-cuda11.7-*
* Deprecated ML packages for CUDA 11.2 and introduced new variants for CUDA 11.7
- Improvements
* Deprecated cm-openmpi4-cuda11.2-ofed47-gcc9 and cm-openmpi4-cuda11.2-ofed51-gcc9 packages
== cm-wlm-setup ==
- New Features
* Allow to use the "master" keyword in the cm-wlm-setup configuration as a placeholder for the real head node host name