Cluster resources goes comatose on upgraded OES2 SP1 cluster node

  • 7002593
  • 09-Feb-2009
  • 09-Jul-2013

Environment

Novell Cluster Services 1.8.4
Novell Open Enterprise Server 2 (OES 2)
Novell Open Enterprise Server 2 SP1 (OES 2 SP1)

Situation

Purpose:
A two node cluster system is being upgraded from OES2 to OES2 SP1. Resources has been successfully migrated to one of the nodes so to upgrade the other one, and after the upgrade has been completed for the first node, the resources needs to be migrated back so to continue with the upgrade of the second one.

Symptoms:
Pool resources cannot be successfully brought on-line on the upgraded node, they go in comatose state.

File "/var/log/evms-daemon.log" reports the following error message:

Daemon: engine_user_message: Message is: Daemon: There was an error when connecting to Novell-NCS.  The error code was 11: Resource temporarily unavailable  EVMS will only manage local devices on this system.

File "/var/log/boot.msg" reports the following error:

Starting EVMSEngine: The plug-in Novell-NCS in module /lib/evms/2.5/ncs-1.0.0.so failed to load. The plug-in's setup_evms_plugin() function failed with error code 13: Permission denied.

Resolution

The upgrade from OES2 to OES2 SP1 caused a faulty behaviour of EVMS, that is required to properly run Novell Cluster Services on OES Linux.

In order to troubleshoot this issue, please read carefully the TID 7001928"EVMS: After updating, odd behavior seen on systems with EVMS volumes, disks or multipath devices" and in case it applies to your problem, then follow the steps outlined in it.