Type: Feature Request
Resolution: Unresolved
Priority: L3 - Default
Fix Version/s: None
Affects Version/s: None
Component/s: backend
Labels:
- PM
- SUPPORT
- blog
- configuration
- elasticsearch
- potential_for_3.9

PM Priority:
2
Effort:
Not defined
Target Quality:
3 - Expected
Epic Link:
PRODHUB-35

Please describe the missing/desired functionality:

Optimize is currently somewhat inflexible in how it approaches ES configuration and could be better "optimized" to suit the different needs of our customers, allowing it handle data ingestion/read at large scale.

We should investigate ways in which we can improve performance and make use of ES capabilities in ways that suit all of our customers, rather than trying to find a one-size-fits all approach

Core solution

Please describe the missing/desired functionality:

We currently have a fixed number of shards for each process/decision instance index. This doesn't distinguish between the requirements of each though, instead being applied to all. This exposes a lot of redundancy and inefficiency on which we could improve. We should investigate how Optimize can handle index configuration in a smarter way based on the ingested data.

One idea would be to only have a single "warm" index for each process instance, where the currently written data is always routed. Simultaneously, "completed" instances could be periodically archived to a read-only index. For report evaluation, we can query the archive index and the warm index, but for data write, we would only ever need to update the running instances

What problem would the feature solve?

The environment.yaml's one-size-fits-all approach is inappropriate for real-world indices of varying size, and requires a complex manual effort that adds friction to the Optimize version upgrade process.
Elastic recommends splitting indices into shards with ~50GB each, but indices vary anywhere from a few MB (1 shard) to 3.4TB (70 shards).

Hint

Configuration per process definition key is a partial solution, as manual evaluation is still necessary
Elastic offers a feature for index autoscaling that we might be able to leverage. https://www.elastic.co/guide/en/elasticsearch/reference/7.10/index-lifecycle-management.html
The archive indices could be rolled over based on size in line with ES recommendation
The "warm" index could still be shard configurable as now, to allow for the different needs of different customers
We might be able to identify completed instances based on the timestamps/positions of the mediator
- i.e. a completed instance is one in which we have an end date, and if the latest timestamp/position is more recent than the timestamp/position in which the end date was written (note that we don't know the position on the zeebe instance data)

This is the controller panel for Smart Panels app

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

ilm-free.png
5 kB
03/Dec/21 9:28 AM

1.	Store timestamp information on Zeebe import mediators	Done	Unassigned
2.	Create archive indices for process instance indices	Done	Unassigned
3.	Archive instances that will no longer be updated	Open	Unassigned
4.	Rollover process instance archive indices	Open	Unassigned
5.	Update the shard count configuration option	Done	Unassigned
6.	Document the changes behind Optimize's instance index management	Open	Unassigned
7.	Make external variable ingestion work for archived process instances	Open	Unassigned
8.	Make history cleanup work for archived process instances	Open	Unassigned
9.	Remove TimestampBasedIngestedDataImportIndexHandler heirarchy	Done	Unassigned
10.	Allow interval for archiving of instances to be configurable	Done	Unassigned

Ingo Richtsmeier added a comment - 20/Jan/22 11:56 AM

Hi everybody,

in the moment Allianz didn't use Optimize. There is an open ticket to show them the capabilities as Optimize is one piece in the puzzle to reduce the SQL database size.

You can follow the progress on the demo here: https://jira.camunda.com/browse/SUPPORT-12353

Ingo Richtsmeier added a comment - 20/Jan/22 11:56 AM Hi everybody, in the moment Allianz didn't use Optimize. There is an open ticket to show them the capabilities as Optimize is one piece in the puzzle to reduce the SQL database size. You can follow the progress on the demo here: https://jira.camunda.com/browse/SUPPORT-12353

Omran Abazeed added a comment - 27/Dec/23 2:30 PM

This ticket was migrated to github: https://github.com/camunda/camunda-optimize/issues/10654. Please use this link for any future references and continue any discussion there.

Omran Abazeed added a comment - 27/Dec/23 2:30 PM This ticket was migrated to github: https://github.com/camunda/camunda-optimize/issues/10654 . Please use this link for any future references and continue any discussion there.

Camunda Optimize

Optimize can scale up and down infrastructure assets to meet demands

Please describe the missing/desired functionality:

Core solution

Please describe the missing/desired functionality:

What problem would the feature solve?

Hint

This is the controller panel for Smart Panels app

[OPT-5790] Optimize can scale up and down infrastructure assets to meet demands

Camunda Optimize

Details

Description

Please describe the missing/desired functionality:

Core solution

Please describe the missing/desired functionality:

What problem would the feature solve?

Hint

mgm-controller-panel

This is the controller panel for Smart Panels app

Attachments

Attachments

Sub-Tasks

Activity

[OPT-5790] Optimize can scale up and down infrastructure assets to meet demands

Collapse comment: Ingo Richtsmeier added a comment - 20/Jan/22 11:56 AM

Expand comment: Ingo Richtsmeier added a comment - 20/Jan/22 11:56 AM

Collapse comment: Omran Abazeed added a comment - 27/Dec/23 2:30 PM

Expand comment: Omran Abazeed added a comment - 27/Dec/23 2:30 PM

People

Dates

Salesforce