problem with server hang up
my server sometimes bug and I get the a lot of message like this :
ChkServd Version: 15.3 please help
HANG: chkservd on xxxxxxxxxxxxx
The chkservd sub-process with pid 30261 ran for 7455 seconds. This sub-process was terminated when it exceeded the time allowed between checks, which is 300 seconds. To determine why, you can check /var/log/chkservd.log and /usr/local/cpanel/logs/tailwatchd_log.
You likely received this notification as a symptom of a larger problem. If your server is experiencing a high load, we recommend investigating the cause. If you continue to receive this notification, it is likely that your system is unable to handle demand or a misconfiguration is delaying restarts.
If you are sure that no misconfigurations exist, you should consider gradually increasing the following options in WHM's "Tweak Settings" feature: "The number of times ChkServd will allow a previous check to complete before terminating the check" and/or "The number of seconds between ChkServd service checks".
FAILED: imap on xxxxxxxxxxxxxxxxx
Server: vps4
Primary IP: xxxxxxxxxx
Service: imap
Notification Type: failed
Notification: imap failed @ Sun May 25 02:03:33 2014. A restart was attempted automagically.
Service Check Method: [socket connect]
Reason: Timeout while trying to get data from service: Died at /usr/local/cpanel/Cpanel/TailWatch/ChkServd.pm line 822, <$socket_scc> line 3.
Number of Restart Attempts: 3
Startup Log: Starting Dovecot Imap: [ OK ]
Starting Dovecot Imap: [ OK ]
Starting Dovecot Imap: [ OK ]
Starting Dovecot Imap: [ OK ]
Syslog Messages: May 25 02:21:46 vps dovecot: imap(__cpanel__service__auth__imap__sa8ui46xwimb2i_vuuni5uuwusnsc4wgzzmub_yhzv3fkbdvt83tdkle8brmlnzg): Error: Internal error occurred. Refer to server log for more information.
May 25 02:21:44 vps41334 dovecot: imap(__cpanel__service__auth__imap__sa8ui46xwimb2i_vuuni5uuwusnsc4wgzzmub_yhzv3fkbdvt83tdkle8brmlnzg): Error: user __cpanel__service__auth__imap__sa8ui46xwimb2i_vuuni5uuwusnsc4wgzzmub_yhzv3fkbdvt83tdkle8brmlnzg: Error reading configuration: Timeout reading config from /var/run/dovecot/config
May 25 02:21:40 vps41334 dovecot: imap-login: Login: user=<__cpanel__service__auth__imap__sa8ui46xwimb2i_vuuni5uuwusnsc4wgzzmub_yhzv3fkb...>, method=PLAIN, rip=127.0.0.1, lip=127.0.0.1, mpid=22354, secured, session=
May 25 01:32:40 vps dovecot: master: Error: service(anvil): command startup failed, throttling for 2 secs
May 25 01:32:40 vps dovecot: anvil: Fatal: Error reading configuration: Timeout reading config from /var/run/dovecot/config
Memory Information: " Used: 5470MB
" Available: 479MB
" Installed: 5963MB
Load Information: xxxxx
Uptime: 5 days, 12 hours, 29 seconds
IOStat Information: avg-cpu: %user %nice %system %iowait %steal %idle
6.44 0.38 2.75 0.51 0.00 89.93
Device: tps Blk_read/s Blk_wrtn/s Blk_read Blk_wrtn
sda 15.79 474.16 142.17 226168114 67813184
ChkServd Version: 15.3 please help
-
Hello :) You should review the time stamps in /var/log/maillog and match up the failure messages to the times that Chkservd reported the issue. You may need to browse to "WHM Home " Service Configuration " Mailserver Configuration" and increase the "Number of Spare Authentication Processes". Thank you. 0 -
[quote="cPanelMichael, post: 1654082">Hello :) You should review the time stamps in /var/log/maillog and match up the failure messages to the times that Chkservd reported the issue. You may need to browse to "WHM Home " Service Configuration " Mailserver Configuration" and increase the "Number of Spare Authentication Processes". Thank you.
hello, thanks for response i have modify the number of Spare Authentication Processes. but this morning my server is stopped and I received many emails again :Subject: HANG: chkservd on xxxxxxxxxxxxxxxx (xxxxxxxxxxxxx) Date: Tue, 3 Jun 2014 08:19:29 +0200 The chkservd sub-process with pid 27256 ran for 8610 seconds. This sub-process was terminated when it exceeded the time allowed between checks, which is 300 seconds. To determine why, you can check /var/log/chkservd.log and /usr/local/cpanel/logs/tailwatchd_log. You likely received this notification as a symptom of a larger problem. If your server is experiencing a high load, we recommend investigating the cause. If you continue to receive this notification, it is likely that your system is unable to handle demand or a misconfiguration is delaying restarts. If you are sure that no misconfigurations exist, you should consider gradually increasing the following options in WHM's "Tweak Settings" feature: "The number of times ChkServd will allow a previous check to complete before terminating the check" and/or "The number of seconds between ChkServd service checks". Server: xxxxxxxxxxxx Primary IP: xxxxxxxx Service: chkservd Notification Type: hang Memory Information: " Used: 5895MB " Available: 66MB " Installed: 5963MB Load Information: 168.91 159.51 166.16 Uptime: 2 days, 17 hours, 26 seconds IOStat Information: avg-cpu: %user %nice %system %iowait %steal %idle 7.99 0.29 2.83 0.43 0.00 88.46 Device: tps Blk_read/s Blk_wrtn/s Blk_read Blk_wrtn sda 11.57 346.33 110.30 81609186 25990408 ChkServd Version: 15.3 Subject: FAILED: mysql on xxxxxxxxxxxxx (xxxxxxxxxxxxx) Date: Tue, 3 Jun 2014 05:16:30 +0200 Server: xxxxxxxxxxx Primary IP: xxxxxxxxxxxxx Service: mysql Notification Type: failed Notification: mysql failed @ Tue Jun 3 04:29:12 2014. A restart was attempted automagically. Service Check Method: [check command] Number of Restart Attempts: 2 Service Check Raw Output: mysql is not running Restart Message: Stuck on restart of mysql at /usr/local/cpanel/Cpanel/TailWatch/ChkServd.pm line 1052. Startup Log: Starting MySQL...................................................... SUCCESS! Memory Information: " Used: 5880MB " Available: 78MB " Installed: 5963MB Load Information: 160.51 161.00 183.40 Uptime: 2 days, 14 hours, 17 seconds IOStat Information: avg-cpu: %user %nice %system %iowait %steal %idle 6.84 0.29 2.32 0.44 0.00 90.12 Device: tps Blk_read/s Blk_wrtn/s Blk_read Blk_wrtn sda 8.34 215.12 108.11 48250610 24249272 ChkServd Version: 15.3
please help0 -
I had another server outage this morning and I received another e-mail, look: Subject: FAILED: lfd on xxxxxxxxxxxxxxxxxxxxx Server: xxxxxxxxxxxxxxxxxxxxx Primary IP: xxxxxxxxxxxxxxxxxxxxx Service: lfd Notification Type: failed Notification: lfd failed @ Thu Jun 5 10:05:18 2014. A restart was attempted automagically. Service Check Method: [check command] Number of Restart Attempts: 1 Memory Information: " Used: 431MB " Available: 5532MB " Installed: 5963MB Load Information: 1.34 0.29 0.10 Uptime: 0 days, 0 hours, 0 seconds IOStat Information: avg-cpu: %user %nice %system %iowait %steal %idle 46.34 0.38 12.11 13.67 0.00 27.51 Device: tps Blk_read/s Blk_wrtn/s Blk_read Blk_wrtn sda 502.74 17716.07 3237.21 518018 94656 ChkServd Version: 15.3 Subject: FAILED: mysql on xxxxxxxxxxxxx Server: xxxxxxxxxxxxx Primary IP: xxxxxxxxxxxxx Service: mysql Notification Type: failed Notification: mysql failed @ Thu Jun 5 08:39:11 2014. A restart was attempted automagically. Service Check Method: [check command] Number of Restart Attempts: 1 Service Check Raw Output: mysql is not running Memory Information: " Used: 4415MB " Available: 1549MB " Installed: 5963MB Load Information: 276.52 237.63 155.60 Uptime: 2 days, 0 hours, 0 seconds IOStat Information: avg-cpu: %user %nice %system %iowait %steal %idle 1.23 6.98 2.44 0.54 0.00 88.81 Device: tps Blk_read/s Blk_wrtn/s Blk_read Blk_wrtn sda 7.94 167.98 142.73 29031146 24666528 ChkServd Version: 15.3 0 -
[QUOTE]Load Information: 276.52 237.63 155.60
The load here is likely the culprit. You will need to investigate what is causing the high load average. The following thread is a good place to start: Troubleshooting High Loads On Linux Systems Thank you.0 -
thanks for response. I get an email error every 5 minutes now ! 0 -
[quote="chakerben, post: 1658381">thanks for response. I get an email error every 5 minutes now !
Yes, addressing the load issue is likely the best way to resolve this issue. Thank you.0
Please sign in to leave a comment.
Comments
6 comments