sge_shadowd man page

sge_shadowd — Sun Grid Engine shadow master daemon




sge_shadowd is a "light weight" process which can be run on so-called shadow master hosts in a Sun Grid Engine cluster to detect failure of the current Sun Grid Engine master daemon, sge_qmaster(8), and to start-up a new sge_qmaster(8) on the host on which the sge_shadowd runs. If multiple shadow daemons are active in a cluster, they run a protocol which ensures that only one of them will start-up a new master daemon.

The  hosts suitable for being used as shadow master hosts must have shared root read/write access to the directory $SGE_ROOT/$SGE_CELL/common as well as to the master daemon spool directory (by default $SGE_ROOT/$SGE_CELL/spool/qmaster). The names of the shadow master hosts need to be contained in the file  $SGE_ROOT/$xQS_NAME_Sxx_CELL/common/shadow_masters.


sge_shadowd may only be started by root.

Environment Variables


Specifies the location of the Sun Grid Engine standard configuration files.


If set, specifies the default Sun Grid Engine cell. To address a Sun Grid Engine cell sge_shadowd uses (in the order of precedence):

The name of the cell specified in the environment  variable SGE_CELL, if it is set.

The name of the default cell, i.e. default.


If set, specifies that debug information should be written to stderr. In addition the level of detail in which debug information is generated is defined.


If set, specifies the tcp port on which sge_qmaster(8) is expected to listen for communication requests. Most installations will use a services map entry for the service "sge_qmaster" instead to define that port.


This variable controls the interval in which sge_shadowd pauses if a takeover bid fails. This value is used only when there are multiple sge_shadowd instances and they are contending to be the master. The default is 600 seconds.


This variable controls the interval in which the sge_shadowd checks the heartbeat file (60 seconds by default).


This variable controls the interval when a sge_shadowd instance tries to take over when the heartbeat file has not changed.


	Default configuration directory
	Shadow master hostname file.
	Default master daemon spool directory
	The heartbeat file.

See Also

sge_intro(1), sge_conf(5), sge_qmaster(8), Sun Grid Engine Installation and Administration Guide.

Referenced By

sge_bootstrap(5), sge_qmaster(8).

$Date: 2007/11/08 23:04:23 $ SGE 6.2u5 Sun Grid Engine Administrative Commands