Skip to content

HA exited with an error, HA aborted, retry deadline exceeded #900

@mburring

Description

@mburring

Describe the bug

We have two separate icinga instances running identical configurations and icingadb will randomly crash with a 'retry deadline exceeded' error.

Both of these installations are single master.

To Reproduce

Appears random

Expected behavior

That it doesn't happen

Your Environment

Include as many relevant details about the environment you experienced the problem in

  • Icinga DB version: 1.2.1-1+ubuntu20.04
  • Icinga 2 version: 2.14.5-1+ubuntu20.04
  • Operating System and version: Ubuntu 20.04

Additional context

● icingadb.service - Icinga DB
     Loaded: loaded (/lib/systemd/system/icingadb.service; enabled; vendor preset: enabled)
     Active: failed (Result: exit-code) since Tue 2025-03-11 01:50:01 AEDT; 10h ago
    Process: 1112676 ExecStart=/usr/sbin/icingadb --config /etc/icingadb/config.yml (code=exited, status=1/FAILURE)
   Main PID: 1112676 (code=exited, status=1/FAILURE)

Mar 11 01:49:01 master1 icingadb[1112676]: heartbeat: Waiting for Icinga heartbeat
Mar 11 01:49:20 master1 icingadb[1112676]: history-sync: Synced 5 notification history items
Mar 11 01:49:20 master1 icingadb[1112676]: history-sync: Synced 36 state history items
Mar 11 01:49:40 master1 icingadb[1112676]: history-sync: Synced 33 state history items
Mar 11 01:49:40 master1 icingadb[1112676]: history-sync: Synced 4 notification history items
Mar 11 01:50:00 master1 icingadb[1112676]: history-sync: Synced 4 notification history items
Mar 11 01:50:00 master1 icingadb[1112676]: history-sync: Synced 32 state history items
Mar 11 01:50:01 master1 icingadb[1112676]: retry deadline exceeded
                                                                           github.com/icinga/icingadb/pkg/icingadb.(*HA).controller
                                                                                   github.com/icinga/icingadb/pkg/icingadb/ha.go:166
                                                                           runtime.goexit
                                                                                   runtime/asm_amd64.s:1700
                                                                           HA aborted
                                                                           github.com/icinga/icingadb/pkg/icingadb.(*HA).abort.func1
                                                                                   github.com/icinga/icingadb/pkg/icingadb/ha.go:134
                                                                           sync.(*Once).doSlow
                                                                                   sync/once.go:76
                                                                           sync.(*Once).Do
                                                                                   sync/once.go:67
                                                                           github.com/icinga/icingadb/pkg/icingadb.(*HA).abort
                                                                                   github.com/icinga/icingadb/pkg/icingadb/ha.go:132
                                                                           github.com/icinga/icingadb/pkg/icingadb.(*HA).controller
                                                                                   github.com/icinga/icingadb/pkg/icingadb/ha.go:166
                                                                           runtime.goexit
                                                                                   runtime/asm_amd64.s:1700
                                                                           HA exited with an error
                                                                           main.run
                                                                                   github.com/icinga/icingadb/cmd/icingadb/main.go:336
                                                                           main.main
                                                                                   github.com/icinga/icingadb/cmd/icingadb/main.go:37
                                                                           runtime.main
                                                                                   runtime/proc.go:272
                                                                           runtime.goexit
                                                                                   runtime/asm_amd64.s:1700
Mar 11 01:50:01 master1 systemd[1]: icingadb.service: Main process exited, code=exited, status=1/FAILURE
Mar 11 01:50:01 master1 systemd[1]: icingadb.service: Failed with result 'exit-code'.

Metadata

Metadata

Assignees

No one assigned

    Labels

    area/habugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions