[pve-devel] need help to debug random host freeze on multiple hosts

Cesar Peschiera brain at click.com.py
Mon Dec 29 08:56:28 CET 2014


I know that this isn't a solution, but i will tell you only as a comment for
future decisions:

Long time ago, when i worked with Novell Netware, i had a problem of cache
in the AMD processor, so i had that disable it, and after, this server was
very slow, but was stable. Since that time i never recommended servers with
AMD processor.
(Maybe that you have the same problem?)

Moreover, maybe will be good disable some flags to AMD processor and test
it. How do it?, sincerely i don't know, but if you know it, please comment
it here, as also your tests (if you can)

----- Original Message ----- 
From: "Alexandre DERUMIER" <aderumier at odiso.com>
To: "Cesar Peschiera" <brain at click.com.py>
Cc: "datanom.net" <mir at datanom.net>; "pve-devel" <pve-devel at pve.proxmox.com>
Sent: Monday, December 29, 2014 3:31 AM
Subject: Re: [pve-devel] need help to debug random host freeze on multiple
hosts


>>Maybe i ask you a silly question, did you see the syslog and kern.log
>>file?

Yes sure , I have nothing in logs.
(That's why I thinked of kdump to try to have more info).

I'll really don't known if it's a software real kernel panic, or a hardware
bug.

I just see on vmware forum some amd microcode bug, and see that dell provide
a new bios update this month.
I'll try to update to see if it's help.



----- Original Message ----- 
From: "Alexandre DERUMIER" <aderumier at odiso.com>
To: "datanom.net" <mir at datanom.net>
Cc: "pve-devel" <pve-devel at pve.proxmox.com>
Sent: Monday, December 29, 2014 1:49 AM
Subject: Re: [pve-devel] need help to debug random host freeze on multiple
hosts


>>>Bad RAM stick?
>>>Bad PSU?
>>>Overheating of the CPU?
>
> No errors reporting in dell Idrac.
>
> (I have the problem on 6 differents nodes.....)
>
> I was also thinking of electrical problem, but voltages don't report any
> error.
>
> Maybe the only difference is that I have more load currently on all my
> nodes because of Xmas period
> (We host a lot of ecommerce websites)
> I'm around 60-70% load on this quad opteron platforms.
>
>
> I'll try to implement kdump today.
>
>
>
> ----- Mail original ----- 
> De: "datanom.net" <mir at datanom.net>
> À: "pve-devel" <pve-devel at pve.proxmox.com>
> Envoyé: Dimanche 28 Décembre 2014 19:02:04
> Objet: Re: [pve-devel] need help to debug random host freeze on multiple
> hosts
>
> On Sun, 28 Dec 2014 17:37:50 +0100 (CET)
> Alexandre DERUMIER <aderumier at odiso.com> wrote:
>
>>
>> I really don't known how to debug that, because the system freeze, and I
>> don't have any kernel panic output in display or serial.
>>
>>
>> Can somebody help me to add something to have debug output ?
>>
> Bad RAM stick?
> Bad PSU?
>
> -- 
> Hilsen/Regards
> Michael Rasmussen
>
> Get my public GnuPG keys:
> michael <at> rasmussen <dot> cc
> http://pgp.mit.edu:11371/pks/lookup?op=get&search=0xD3C9A00E
> mir <at> datanom <dot> net
> http://pgp.mit.edu:11371/pks/lookup?op=get&search=0xE501F51C
> mir <at> miras <dot> org
> http://pgp.mit.edu:11371/pks/lookup?op=get&search=0xE3E80917
> -------------------------------------------------------------- 
> /usr/games/fortune -es says:
> Bridge ahead. Pay troll.
>
> _______________________________________________
> pve-devel mailing list
> pve-devel at pve.proxmox.com
> http://pve.proxmox.com/cgi-bin/mailman/listinfo/pve-devel
> _______________________________________________
> pve-devel mailing list
> pve-devel at pve.proxmox.com
> http://pve.proxmox.com/cgi-bin/mailman/listinfo/pve-devel
>




More information about the pve-devel mailing list