Base Command Manager / Bright Cluster Manager Release Notes

Release notes for Bright 8.1-5

== General ==

- New Features

* Nodes are now always rebooted in the case when nvidia-docker package is deployed

- Improvements

* Report correct error on insufficient space during head node installation

== cmdaemon ==

- Fixed Issues

* Power environment not set in super power scripts
* An issue with determining SGE (OGS) job submission time
* In some cases, cmdaemon crash when finding all nodes in a rack
* An issue with changing user home directories across devices
* Rare cm-nfs-checker core dump
* In some cases, node ldap certificate can have wrong ownership
* Improved validation of long names in CSR
* Increased the cm-nfs-checker buffer size for very long paths
* An issue with long node names drain status for UGE/OGS
* An issue with cm-repair-cmdaemon-db
* An issue with removing old ramdisk files in /tftpboot on the passive head node

== node-installer ==

- Fixed Issues

* Skip DNS reverse lookup for rsync in node-installer

== Bright View ==

- New Features

* Added average GPU utilization metric
* Added global search capability
* Use indexInsideContainer property to display correctly devices in the rack view
* Added embedded shell

- Improvements

* New Basic/Advanced modes for accounting
* Cycle through multiple metrics in rack view

== cm-scale ==

- Fixed Issues

* In some cases, cm-scale could stop the extra node before the compute node was stopped

== cmsh ==

- Improvements

* Added "ls" alias for "list"

- Fixed Issues

* cmsh clone user does not set user name, but user ID
* An issue with use / remove of a static route
* An issue with cmsh bmcsettings command for category

== cod ==

- New Features

* COD-OS: using cluster create --store-head-node-ip now writes the IP earlier, which means the file will be available even if waiting for cmdaemon times out
* Pressing Ctrl+C twice quickly is now required to terminate the script during the execution
* COD-OS: The create cluster command now shows the head node hypervisor if exposing this information was enabled via nova's policy.json

- Improvements

* Ask interactively for the user password if it is not set in the environment variables

- Fixed Issues

* An issue that prevented "cluster list" from showing inactive clusters
* An issue with calculating the total requested node count

== monitoring ==

- New Features

* Added unit to PromQL queries
* Changed job queries to use range intervals by default

- Improvements

* Added extra flags to sample_ipmi for more fine grained control
* Removed the limit on the number of labelled entities
* Openstack interfaces are now added to default exclude list for proc dev net sampler
* Allow the trigger entity matcher to also match by type and resource

- Fixed Issues

* Metric parameter not shown in send email action
* Allow monitoring storage defaults to be changed with AdvancedConfig
* Sample now results can appear twice when timed sampling is done at exactly the same time
* Reduced overhead of the smart monitoring sampler