Skip to main content

Disk I/O Utilisation High and Errors in Logs - Site Unusable

Comments

4 comments

  • cPanelPeter cPanel Staff
    Hello, You most likely have a hard drive problem. The first log you provided, clearly indicates DMA write errors on ata1. Hard drive is slowly failing and needs to be replaced. Once that is done, see if the other problems go away.
    0
  • heusdens
    Thank you Peter My host says I'm on a shared SAN and is subject to abuse now and then, but that there is no abuse and therefore will be moving me to another hypervisor to see if that resolves the issue. I'm not entirely convinced of this. Even restarting my virtual server does not solve the issue. What do you make of the other logs that I posted?
    0
  • JaredR.
    [QUOTE]My host says I'm on a shared SAN and is subject to abuse now and then
    This: Jan 6 04:47:09 server1 kernel: ata1.00: failed command: WRITE DMA Jan 6 04:47:09 server1 kernel: ata1.00: cmd ca/00:58:c0:71:8a/00:00:00:00:00/e3 tag 0 dma 45056 out Jan 6 04:47:09 server1 kernel: res 40/00:01:06:4f:c2/00:00:00:00:00/a0 Emask 0x4 (timeout) Jan 6 04:47:09 server1 kernel: ata1.00: status: { DRDY }
    is a sign of a hardware problem, not "abuse". It is happening a lot, based on the logs you provided. You need to ask your host to run diagnostics on each actual hard drive, because when you start to see that kind of error message, hardware failure may be imminent. This is not something that cPanel would have any control over. It is happening at a deeper level, in the hardware, and the hardware needs to be carefully investigated before a hard drive is lost (and your data with it).
    0
  • heusdens
    Thanks I agree with you, I've been moved over to another machine and everything is running smooth now. Just another thing they picked up was they saw I was running a debug version of the Kernel. I actually upgraded the Kernel yesterday, this was subsequent to these issues so it was not the primary issue. Security Advisor in Cpanel recommended I upgrade to 2.6.32-431.3.1.el6 which I did. Hosting support changed it to the "normal" kernel in the bootloader. How would I ensure that the debug version is not selected in the reboot going forward? How does one check the difference between the debug and normal version and ensure it's configured properly in the bootloader?
    0

Please sign in to leave a comment.