User Story (Required on creation):
As an Operations Engineer I want to early be aware of problems with jobs. Currently an incident is created if all job retries fail. To address certain issues quite early it'd be helpful to see in the diagram where a job is currently failing even if retries are left.
Functional Requirements (Required before implementation):
Limitations of Scope (Optional):
Hints (Optional):