Q: How can start be bigger then total count?
22:35:31.694 [ElasticsearchImportJobExecutor-pool-thread-2] DEBUG o.c.o.s.es.writer.ImportIndexWriter - Writing all entities based import index type [variable-process-instance-tracking] to elasticsearch. Starting from [1942635] and having a max entity count [4920]
A: maxEntityCount is written on init from ES, or while calculating progress (Should not happen probably), or while resetting index. import progress is independent of this mechanic, as it relies on runtime query against PI ids
Q: why is fetch size 1000 if page size in scroll is 2M? And why scroll returns 2M ids in the first place?
22:52:15.514 [main] DEBUG o.c.o.s.e.i.i.h.i.VariableInstanceImportIndexHandler - Scroll search query got [2000000] results
22:52:15.606 [EngineImportJobExecutor-pool-thread-1] DEBUG o.c.o.s.e.i.f.i.ActivityInstanceFetcher - Fetched [10000] historic activity instances within [189] ms
22:52:16.257 [EngineImportJobExecutor-pool-thread-1] DEBUG o.c.o.s.e.i.f.i.FinishedProcessInstanceFetcher - Fetched [1000] historic process instances within [94] ms
22:52:16.309 [EngineImportJobExecutor-pool-thread-1] DEBUG o.c.o.s.e.i.f.i.ProcessDefinitionFetcher - Fetched [1] process definitions within [39] ms
22:52:16.350 [EngineImportJobExecutor-pool-thread-1] DEBUG o.c.o.s.e.i.f.i.ProcessDefinitionFetcher - Fetched [1] process definitions within [40] ms
22:52:16.409 [EngineImportJobExecutor-pool-thread-1] DEBUG o.c.o.s.e.i.f.i.ProcessDefinitionXmlFetcher - Fetched [1] process definition xmls within [59] ms
22:52:16.410 [EngineImportJobExecutor-pool-thread-1] DEBUG o.c.o.s.e.i.f.i.VariableInstanceFetcher - fetching variables for [1000] PIs
A: debug statement used totalHitCount of the scroll instead of actual hits length. Which leads to ambiguous statements in log file.
see https://hq2.camunda.com/jenkins/optimize/view/All/job/Performance%20Large%20Dataset/