strange errors after cactid upgrade (0.8.6g->0.8.6i)

Post support questions that directly relate to Linux/Unix operating systems.

Moderators: Developers, Moderators

Post Reply
alekiv
Posts: 10
Joined: Tue Nov 23, 2004 8:30 am

strange errors after cactid upgrade (0.8.6g->0.8.6i)

Post by alekiv »

After Cactid poller upgrade (0.8.6g->0.8.6i) I noticed strange errors for each host:

Code: Select all

...
10/10/2007 05:35:33 PM - CACTID: Poller[0] ERROR: BQQQ '10.0.14.14:161' 
10/10/2007 05:35:33 PM - CACTID: Poller[0] ERROR: BQQQ '10.0.14.15:161' 
10/10/2007 05:35:33 PM - CACTID: Poller[0] ERROR: BQQQ '10.0.14.37:161' 
10/10/2007 05:35:33 PM - CACTID: Poller[0] ERROR: BQQQ '10.0.14.7:161'
...
All graphs and data generating, but strange errors in log file...

Well I be very thankful if some one give some help!
Thanks!
User avatar
gandalf
Developer
Posts: 22383
Joined: Thu Dec 02, 2004 2:46 am
Location: Muenster, Germany
Contact:

Post by gandalf »

Please see second link of my signature for more details on debugging
Reinhard
alekiv
Posts: 10
Joined: Tue Nov 23, 2004 8:30 am

Post by alekiv »

gandalf wrote:Please see second link of my signature for more details on debugging
Reinhard
I don't have NaN's in my graphs...
With debug level 5 i got:

Code: Select all

10/15/2007 01:20:34 PM - CACTID: Poller[0] DEBUG: MySQL Query ID '77': 'SELECT id, hostname, snmp_community, snmp_username, snmp_password, snmp_version, snmp_port, snmp_timeout, status, status_event_count, status_fail_date, status_rec_date, status_last_error, min_time, max_time, cur_time, avg_time, total_polls, failed_polls, availability  FROM host WHERE id=27'
10/15/2007 01:20:34 PM - CACTID: Poller[0] DEBUG: MySQL Query ID '77': OK
10/15/2007 01:20:34 PM - CACTID: Poller[0] ERROR: BQQQ '10.0.14.14:161'
User avatar
gandalf
Developer
Posts: 22383
Joined: Thu Dec 02, 2004 2:46 am
Location: Muenster, Germany
Contact:

Post by gandalf »

Please try to rebuild_poller_cache.php
Reinhard
alekiv
Posts: 10
Joined: Tue Nov 23, 2004 8:30 am

Post by alekiv »

gandalf wrote:Please try to rebuild_poller_cache.php
Reinhard
Same errors after rebuilding poller cache... :(
User avatar
gandalf
Developer
Posts: 22383
Joined: Thu Dec 02, 2004 2:46 am
Location: Muenster, Germany
Contact:

Post by gandalf »

Please use "System Utilities -> View Poller Chache", filter for that host and post the appropriate lines
Reinhard
alekiv
Posts: 10
Joined: Tue Nov 23, 2004 8:30 am

Post by alekiv »

gandalf wrote:Please use "System Utilities -> View Poller Chache", filter for that host and post the appropriate lines
Reinhard
Data Source Name** Details
10.0.14.14 (web1-1) - CPU Usage - Nice SNMP Version: 2, Community: public, OID: .1.3.6.1.4.1.2021.11.51.0 RRD: /usr/share/cacti/rra/10_0_14_14_web11_cpu_nice_1637.rrd
10.0.14.14 (web1-1) - CPU Usage - System SNMP Version: 2, Community: public, OID: .1.3.6.1.4.1.2021.11.52.0 RRD: /usr/share/cacti/rra/10_0_14_14_web11_cpu_system_1638.rrd
10.0.14.14 (web1-1) - CPU Usage - User SNMP Version: 2, Community: public, OID: .1.3.6.1.4.1.2021.11.50.0 RRD: /usr/share/cacti/rra/10_0_14_14_web11_cpu_user_1639.rrd
10.0.14.14 (web1-1) - lighttpd Statistics Script Server: /usr/share/cacti/scripts/ss_lighttpd_stats.php ss_lighttpd_stats 10.0.14.14 RRD: /usr/share/cacti/rra/10_0_14_14_web11_uptime_1651.rrd
10.0.14.14 (web1-1) - Load Average - 1 Minute SNMP Version: 2, Community: public, OID: .1.3.6.1.4.1.2021.10.1.3.1 RRD: /usr/share/cacti/rra/10_0_14_14_web11_load_1min_1640.rrd
10.0.14.14 (web1-1) - Load Average - 15 Minute SNMP Version: 2, Community: public, OID: .1.3.6.1.4.1.2021.10.1.3.3 RRD: /usr/share/cacti/rra/10_0_14_14_web11_load_15min_1641.rrd
10.0.14.14 (web1-1) - Load Average - 5 Minute SNMP Version: 2, Community: public, OID: .1.3.6.1.4.1.2021.10.1.3.2 RRD: /usr/share/cacti/rra/10_0_14_14_web11_load_5min_1642.rrd
10.0.14.14 (web1-1) - Memory - Cache SNMP Version: 2, Community: public, OID: .1.3.6.1.4.1.2021.4.15.0 RRD: /usr/share/cacti/rra/10_0_14_14_web11_mem_cache_1643.rrd
10.0.14.14 (web1-1) - Memory - Free SNMP Version: 2, Community: public, OID: .1.3.6.1.4.1.2021.4.6.0 RRD: /usr/share/cacti/rra/10_0_14_14_web11_mem_free_1644.rrd
10.0.14.14 (web1-1) - Memory - Total SNMP Version: 2, Community: public, OID: .1.3.6.1.4.1.2021.4.5.0 RRD: /usr/share/cacti/rra/10_0_14_14_web11_mem_total_1645.rrd
10.0.14.14 (web1-1) - Partition - /dev/cciss/c0d0 SNMP Version: 2, Community: public, OID: .1.3.6.1.4.1.2021.9.1.8.1 RRD: /usr/share/cacti/rra/10_0_14_14_web11_hdd_free_1649.rrd
10.0.14.14 (web1-1) - Partition - /dev/cciss/c0d0 SNMP Version: 2, Community: public, OID: .1.3.6.1.4.1.2021.9.1.7.1 RRD: /usr/share/cacti/rra/10_0_14_14_web11_hdd_free_1649.rrd

10.0.14.14 (web1-1) - Swap - Total SNMP Version: 2, Community: public, OID: .1.3.6.1.2.1.25.2.3.1.5.3 RRD: /usr/share/cacti/rra/10_0_14_14_web11_swap_total_1646.rrd
10.0.14.14 (web1-1) - Swap - Used SNMP Version: 2, Community: public, OID: .1.3.6.1.2.1.25.2.3.1.6.3 RRD: /usr/share/cacti/rra/10_0_14_14_web11_swap_used_1647.rrd
10.0.14.14 (web1-1) - Traffic - 10.0.14.14 - eth0 SNMP Version: 2, Community: public, OID: .1.3.6.1.2.1.2.2.1.10.2 RRD: /usr/share/cacti/rra/10_0_14_14_web11_traffic_in_1650.rrd
10.0.14.14 (web1-1) - Traffic - 10.0.14.14 - eth0 SNMP Version: 2, Community: public, OID: .1.3.6.1.2.1.2.2.1.16.2 RRD: /usr/share/cacti/rra/10_0_14_14_web11_traffic_in_1650.rrd


Strange... I see 2 almost same lines in poller cache
but in data sources I see 1 datasource that use 10_0_14_14_web11_hdd_free_1649.rrd file:
10.0.14.14 (web2) - Partition - /dev/sda1 Get SNMP Data (Indexed) Yes ucd/net - Hard Drive Space
Last edited by alekiv on Thu Oct 18, 2007 2:30 pm, edited 3 times in total.
User avatar
gandalf
Developer
Posts: 22383
Joined: Thu Dec 02, 2004 2:46 am
Location: Muenster, Germany
Contact:

Post by gandalf »

The lines are not equal. The OID differs. Both values will be stored in the same rrd file.
Reinhard
alekiv
Posts: 10
Joined: Tue Nov 23, 2004 8:30 am

Post by alekiv »

gandalf wrote:The lines are not equal. The OID differs. Both values will be stored in the same rrd file.
Reinhard
But in datasources I see only one datasource, that use this rrd file.
10.0.14.14 (web2) - Partition - /dev/sda1 Get SNMP Data (Indexed) Yes ucd/net - Hard Drive Space
User avatar
gandalf
Developer
Posts: 22383
Joined: Thu Dec 02, 2004 2:46 am
Location: Muenster, Germany
Contact:

Post by gandalf »

Yep, but accoridng to the data template, the data source should hold two data source items (cacti lingo)
Reinhard
alekiv
Posts: 10
Joined: Tue Nov 23, 2004 8:30 am

Post by alekiv »

gandalf wrote:Yep, but accoridng to the data template, the data source should hold two data source items (cacti lingo)
Reinhard
As I understood this is not problem reason...
Then why i see BQQQ errors in logs?
User avatar
gandalf
Developer
Posts: 22383
Joined: Thu Dec 02, 2004 2:46 am
Location: Muenster, Germany
Contact:

Post by gandalf »

Please run

Code: Select all

./cactid --verbosity=5 27 27
to get the whole DEBUG for that very host.
Reinhard
alekiv
Posts: 10
Joined: Tue Nov 23, 2004 8:30 am

Post by alekiv »

gandalf wrote:Please run

Code: Select all

./cactid --verbosity=5 27 27
to get the whole DEBUG for that very host.
Reinhard

"./cactid --verbosity=5 27 27" output
Attachments
cacti.txt
"./cactid --verbosity=5 27 27" output
(10.89 KiB) Downloaded 340 times
User avatar
gandalf
Developer
Posts: 22383
Joined: Thu Dec 02, 2004 2:46 am
Location: Muenster, Germany
Contact:

Post by gandalf »

Ok, the relevant part is here
CACTID: MYSQL: Connecting to MySQL database 'cacti' on '10.1.0.43'...
CACTID: MYSQL: Connected to MySQL database 'cacti' on '10.1.0.43'...
CACTID: DEBUG: MySQL Query ID '22': 'SELECT id, hostname, snmp_community, snmp_username, snmp_password, snmp_version, snmp_port, snmp_timeout, status, status_event_count, status_fail_date, status_rec_date, status_last_error, min_time, max_time, cur_time, avg_time, total_polls, failed_polls, availability FROM host WHERE id=27'
CACTID: DEBUG: MySQL Query ID '22': OK
CACTID: ERROR: BQQQ '10.0.14.14:161'
CACTID: Host[27] SNMP Result: Host responded to SNMP
I suppose, it's during host availability checking, even if host is detected as up. Currently, SNMP only host availabilty check seems to be in effect, right?
Do those hosts throwing this error have sth in common?
Did you already consider upgrading to latest cactid-0.8.6j?
Reinhard
Post Reply

Who is online

Users browsing this forum: No registered users and 1 guest