[Beowulf] Power Cycling Question

Rémy Dernat remy.dernat at umontpellier.fr
Wed Aug 18 10:03:40 UTC 2021


Hi,

I just saw this thread right now. I do not have the time tight now to 
read the whole thread, so, sorry if I am missing / describing something 
already discussed.

I also created a plain bash code to handle power saving on our cluster 
many years ago on SGE.

Le 17/07/2021 à 02:44, Skylar Thompson a écrit :
> We had tried engineering our custom "green cluster" automation with Grid
> Engine years ago where we would shutdown idle nodes until they were needed,
> but doing it independently of the resource manager was far too complicated
> for us to maintain, especially since it was all cost and no benefit for us
> with our power and cooling charges being absorbed through a flat overhead
> rate.

However after a big code update on SVN (before this git repo), the code 
did not work as expected and I decided to let that code as it was. It 
was also too hard to maintain for me, even if it is not really very 
complicated.

The main idea is to setup an availability level for a cluster. Let's say 
you may want 100% of active nodes + X nodes doing nothing that are ready 
to receive jobs + Y nodes poweroff (nowadays I would choose hibernate 
instead). That is the availabity level. You may decide what nodes should 
be started first, how many nodes you need to boot at a time, what queue 
to exclude...

I think all this stuff is already present SLURM, and pretty well integrated.


About suspend/hibernate, Ubuntu does this by default from Focal with 
systemd Units sleep.target, suspend.target, hibernate.target, 
hybrid-sleep.target (if you have a desktop env, even Gnome has some 
hibernate functions (see org.gnome.settings-daemon.plugins.power...).

Best regards



-- 
Rémy Dernat
Chef de projet SI
IR CNRS - ISI / ISEM
-------------- next part --------------
A non-text attachment was scrubbed...
Name: OpenPGP_0xCF65C2676B3A1402.asc
Type: application/pgp-keys
Size: 3070 bytes
Desc: OpenPGP public key
URL: <http://beowulf.org/pipermail/beowulf/attachments/20210818/50524179/attachment-0001.key>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: OpenPGP_signature
Type: application/pgp-signature
Size: 840 bytes
Desc: OpenPGP digital signature
URL: <http://beowulf.org/pipermail/beowulf/attachments/20210818/50524179/attachment-0001.sig>


More information about the Beowulf mailing list