I have fresh ManageIQ deployment (latest stable version) connected to Openstack provider and listening to events with AMQP. Every few hours I see the following error in evm.log:
E, [2017-10-17T04:47:43.342658 #12715] ERROR – #<Bunny::Session:0x1f53010 email@example.com:5672, vhost=/, hosts=[10.19.233.1]>: Got an exception when receiving data: Connection reset by peer (Errno::ECONNRESET)
W, [2017-10-17T04:47:43.342841 #12715] WARN – #<Bunny::Session:0x1f53010 firstname.lastname@example.org:5672, vhost=/, hosts=[10.19.233.1]>: Recovering from a network failure…
W, [2017-10-17T04:47:53.343421 #12715] WARN – #<Bunny::Session:0x1f53010 email@example.com:5672, vhost=/, hosts=[10.19.233.1]>: Retrying connection on next host in line: 10.19.233.1:5672
RabbitMQ logs shows the following error, however it repeats no matter if ManageIQ is connected:
=ERROR REPORT==== 17-Oct-2017::09:01:37 ===
closing AMQP connection <0.10231.131> (172.17.101.5:43508 -> 172.17.101.4:5672):
missed heartbeats from client, timeout: 60s
Running “Refresh Relationships and Power States” on the provider doesn’t reproduce the issue.
First of all, how critical is the error? Does it mean ManageIQ is not listening to Openstack events, or there are few missed events on every connection drop or its just annoying hiccup?
Second, how would you recommend investigating this issue further? Enabling debug level for fog doesn’t seem to report anything about events.