[PVE-User] HDD errors in VMs

Emmanuel Kasper e.kasper at proxmox.com
Tue Jan 5 13:30:05 CET 2016



On 01/04/2016 07:53 PM, Michael Pöllinger wrote:
> Hi Emmanuel.
> 
> Wow this are good tips. we can check for. thank you!
> 
> What we´ve started with is my thread in december.
> [So Dez 27 05:17:44 2015] ata1.00: exception Emask 0x0 SAct 0x0 SErr 0x0
> action 0x6 frozen
> [So Dez 27 05:17:44 2015] ata1.00: failed command: WRITE DMA
> [So Dez 27 05:17:44 2015] ata1.00: cmd
> ca/00:80:b8:4e:ce/00:00:00:00:00/eb tag 0 dma 65536 out res
> 40/00:01:00:00:00/00:00:00:00:00/a0 Emask 0x4(timeout)
> [So Dez 27 05:17:44 2015] ata1.00: status: { DRDY }
> [So Dez 27 05:17:44 2015] ata1: soft resetting link
> [So Dez 27 05:17:45 2015] ata1.01: NODEV after polling detection
> [So Dez 27 05:17:45 2015] ata1.00: configured for MWDMA2
> [So Dez 27 05:17:45 2015] ata1.00: device reported invalid CHS sector 0
> [So Dez 27 05:17:45 2015] ata1: EH complete
> 
> OR
> 
> kernel: [309438.824333] ata1.00: exception Emask 0x0 SAct 0x0 SErr 0x0
> action 0x6 frozen
> kernel: [309438.825198] ata1.00: failed command: FLUSH CACHE
> kernel: [309438.825921] ata1.00: cmd e7/00:00:00:00:00/00:00:00:00:00/a0
> tag 0
> kernel: [309438.825921]          res 40/00:01:00:00:00/00:00:00:00:00/a0
> Emask 0x4 (timeout)
> kernel: [309438.827996] ata1.00: status: { DRDY }
> kernel: [309443.868140] ata1: link is slow to respond, please be patient
> (ready=0)
> kernel: [309448.852147] ata1: device not ready (errno=-16), forcing
> hardreset
> kernel: [309448.852175] ata1: soft resetting link
> kernel: [309449.009123] ata1.00: configured for MWDMA2
> kernel: [309449.009129] ata1.00: retrying FLUSH 0xe7 Emask 0x4
> kernel: [309449.009532] ata1.00: device reported invalid CHS sector 0
> kernel: [309449.009545] ata1: EH complete
> 


Hi Michael

I have not yet seen this kind of messages in my test lab ( but it looks
as well to me like the host is starving for I/O)

Can you post the vm.conf of the VM who shows this behaviour, and the
output of free -h *when the problem occurs*
( dmesg -T will display human readable timestamps)


Also remember that Raid 5 is evil ( lots of links in Google about that :)

Emmanuel









More information about the pve-user mailing list