Our Linux support group used IEM to patch the glibc bug this weekend. They used a Baseline to target Redhat 5 & 6 systems, ~250 of them.
They configured the action to remain ‘active’ for 5 hours, from 1/31/2015@11PM until 2/1/2015@4AM.
The configured a Post Action restart, but they inadvertently left the Post-Action reboot deadline at 1 day.
Some percentage (as yet undetermined) of the machines are now refusing SSH sessions with the following message …
The system is going down on Fri Jan 30 23:06:09 2015
IBM Endpoint Manager Restart (Force count:2) from ActionID xxxxxx
Connection closed by nnn.nnn.nnn.nnn
This message was generated circa 2/2/2015 @ 8:37am when the Unix admins started to check the status of their systems. Given the fact that the Reboot Deadline extended beyond the Action Deadline, is this ‘expected’ behavior?
All the services on the machines are function, but nobody can SSH into a large number of the systems. Restarting the systems resolves the problem, but simply cycling the BESClient service doesn’t.