[SOLVED] Gaps / "nan" returned
Moderators: Developers, Moderators
Sadly, my gaps came back again. Found a table corruption. After running the fix, poller is processing better. Well, so far. It's been about 1 hour.
I am now running hacks to both kill old poller processes and repair tables, every 10 minutes. I'd prefer not to run these, but better these than the machine runs out of memory and OOMs important stuff. And cacti is too useful to turn off.
# kill old cacti processes
*/10 * * * cacti /usr/local/bin/find_kill_old_cacti_procs.sh
*/10 * * * root /usr/local/bin/repair_cacti_tables.sh
</usr/local/bin/repair_cacti_tables.sh>
#!/bin/bash
# repair cacti tables
/usr/bin/myisamchk --silent --force --fast --update-state --key_buffer_size=64M --sort_buffer_size=64M --read_buffer_size=1M --write_buffer_size=1M /var/lib/mysql/cacti/*.MYI
</>
I am now running hacks to both kill old poller processes and repair tables, every 10 minutes. I'd prefer not to run these, but better these than the machine runs out of memory and OOMs important stuff. And cacti is too useful to turn off.
# kill old cacti processes
*/10 * * * cacti /usr/local/bin/find_kill_old_cacti_procs.sh
*/10 * * * root /usr/local/bin/repair_cacti_tables.sh
</usr/local/bin/repair_cacti_tables.sh>
#!/bin/bash
# repair cacti tables
/usr/bin/myisamchk --silent --force --fast --update-state --key_buffer_size=64M --sort_buffer_size=64M --read_buffer_size=1M --write_buffer_size=1M /var/lib/mysql/cacti/*.MYI
</>
Same problem. Cacti hangs randomly on certain hosts, usually the same hosts. Gives SNMP timeout errors, however, snmpd is working fine on those boxes. I can manually poll that information, and get it instantly. Also tried raising the SNMP timeout for those machines to 5+ seconds. No change.
Switching back to cmd.php works, but it's not ideal.
This started when I upgraded to 0.8.7b. I am running the latest spine.
Switching back to cmd.php works, but it's not ideal.
This started when I upgraded to 0.8.7b. I am running the latest spine.
I too ran into the issue of Cacti hanging on various hosts then eventually timing out but traced it down to a problem with using SNMP v2 in the Cacti interface. Once I changed the SNMP version to V1. The problem went away and the host no longer timed out.
ENVIRONMENT
---------------------------
OS: Solaris 10 update 5
Apache 2.2.8
PHP 5.2.5
libxml 2.6.31
MySQL 5.0.51a
rrdtool 1.2.x
Cacti 0.8.7b (with settings 0.5 and thold 0.3.9
Spine 0.8.7a
Perl 5.10.0
---------------------------
OS: Solaris 10 update 5
Apache 2.2.8
PHP 5.2.5
libxml 2.6.31
MySQL 5.0.51a
rrdtool 1.2.x
Cacti 0.8.7b (with settings 0.5 and thold 0.3.9
Spine 0.8.7a
Perl 5.10.0
That's a workaround, not a resolution. Doing manual queries works fine with snmpv2. The bug is with Cacti. I actually stopped using Cacti due to this bug, and several, several others.ccogdill wrote:I too ran into the issue of Cacti hanging on various hosts then eventually timing out but traced it down to a problem with using SNMP v2 in the Cacti interface. Once I changed the SNMP version to V1. The problem went away and the host no longer timed out.
- gandalf
- Developer
- Posts: 22383
- Joined: Thu Dec 02, 2004 2:46 am
- Location: Muenster, Germany
- Contact:
Are you using php-snmp or net-snmp libraries, then? Please report versions of those packagesccogdill wrote:I too ran into the issue of Cacti hanging on various hosts then eventually timing out but traced it down to a problem with using SNMP v2 in the Cacti interface. Once I changed the SNMP version to V1. The problem went away and the host no longer timed out.
Reinhard
Re: [SOLVED] Gaps / "nan" returned
After upgrading to cacti 0.8.7g this probleem seems to have returned. Except that graphs are not updating AT ALL. Sometimes it updates..
Any suggestions? Ive seen the "poller bug". Poller time is 1 minute cron and 1 minute interval.
Any suggestions? Ive seen the "poller bug". Poller time is 1 minute cron and 1 minute interval.
Who is online
Users browsing this forum: No registered users and 1 guest