[Beowulf] Slow RAID reads, no errors logged, why?

Alex Chekholko alex at calicolabs.com
Mon Mar 19 14:21:57 PDT 2018


Normally I would suggest to do a diagnostic read dd from each disk, but you
may not be able to do that with your RAID controller since it hides the
individual disks.

My next recommendation would be a full AC cycle; can you power the host off
for a few minutes? It's a bit cargo cult-y but sometimes it works. It may
also help (or not) for you to spin around 3 times while the machine is off.

On Mon, Mar 19, 2018 at 2:03 PM, David Mathog <mathog at caltech.edu> wrote:

> On 19-Mar-2018 13:58, David Mathog wrote:
>
>> The only oddness of late on "B" is that a few days ago it loaded too
>> many memory hungry processes so the OS killed some.  I have had that
>> happen before on other systems without them doing anything odd
>> afterwards.
>>
>
> Sorry, hit return to soo.
>
> The /var/log/messages entries associated with that showed OOM only killed
> some user processes, no system processes were removed.
>
> Regards,
>
>
> David Mathog
> mathog at caltech.edu
> Manager, Sequence Analysis Facility, Biology Division, Caltech
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org sponsored by Penguin Computing
> To change your subscription (digest mode or unsubscribe) visit
> http://www.beowulf.org/mailman/listinfo/beowulf
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.beowulf.org/pipermail/beowulf/attachments/20180319/b3f832c0/attachment.html>


More information about the Beowulf mailing list