services/horizon: Remove --parallel-job-size config parameter used for reingestion. #5484

urvisavla · 2024-10-03T07:51:11Z

PR Checklist

PR Structure

This PR has reasonably narrow scope (if not, break it down into smaller PRs).
This PR avoids mixing refactoring changes with feature changes (split into two PRs
otherwise).
This PR's title starts with name of package that is most changed in the PR, ex.
services/friendbot, or all or doc if the changes are broad or impact many
packages.

Thoroughness

This PR adds tests for the most critical parts of the new functionality or fixes.
I've updated any docs (developer docs, .md
files, etc... affected by this change). Take a look in the docs folder for a given service,
like this one.

Release planning

I've reviewed the changes in this PR and if I consider them worthwhile for being mentioned on release notes then I have updated the relevant CHANGELOG.md within the component folder structure. For example, if I changed horizon, then I updated (services/horizon/CHANGELOG.md. I add a new line item describing the change and reference to this PR. If I don't update a CHANGELOG, I acknowledge this PR's change may not be mentioned in future release notes.
I've decided if this PR requires a new major/minor version according to
semver, or if it's mainly a patch change. The PR is targeted at the next
release branch if it's not a patch change.

What

Removed the --parallel-job-size config parameter.
buffer_size parameter is capped to the job/range size.

Why

Fixes #5468

Known limitations

This could potentially disrupt any automation scripts using --parallel-job-size parameter, although it's unlikely since reingestion is generally run as a batch job on as needed basis.

services/horizon/cmd/db.go

tamirms · 2024-10-04T11:12:48Z

ingest/ledgerbackend/buffered_storage_backend_test.go

+	assert.Eventually(t, func() bool { return len(ledgerBuffer.ledgerQueue) == 15 }, time.Second*1, time.Millisecond*50)
+	assert.NoError(t, err)
+
+	for i := uint32(0); i < endLedger; i++ {


why does i start at 0 instead of startLedger?

tamirms · 2024-10-04T11:17:49Z

services/horizon/internal/ingest/parallel.go

+func calculateParallelLedgerBatchSize(rangeSize uint32, workerCount uint) uint32 {
+	// let's try to make use of all the workers
+	batchSize := rangeSize / uint32(workerCount)
+
 	// Use a minimum batch size to make it worth it in terms of overhead


should there be a maximum batch size as well? I remember you mentioned that it could be helpful to have a maximum batch size because the ledger ingestion time varies based on whether the ledger is from recent history vs very old history. So, in the scenario where you want to reingest full history with, for example, 4 workers, the workers which handle the first half of history will finish first and be idle for a long time.

it seems like adding max batch size here would be trying to address a larger concern of maximizing worker pool throughput(minimizing idle workers) which seems like more scope than this function is meant for, it's a good point on performance, wondering if it warrants a separate feature ticket to investigate how constant worker throughput could be accomplished?

one wild thought, workers could be interruptable, so that an idle worker could interrupt one that is running, and take some of it's upper range, the worker would stop it's captive core when it receives lcm for it's adjusted 'to' range in any case.

urvisavla force-pushed the 5468/remove-paralleljobsize branch from 70cd0ae to 6c9486e Compare October 3, 2024 07:51

Remove --parallel-job-size config parameter used for reingestion.

be2f5b4

urvisavla force-pushed the 5468/remove-paralleljobsize branch from 6c9486e to be2f5b4 Compare October 3, 2024 21:25

Update changelog. Add unit test

bcab2c8

urvisavla marked this pull request as ready for review October 3, 2024 23:47

urvisavla commented Oct 3, 2024

View reviewed changes

services/horizon/cmd/db.go Show resolved Hide resolved

tamirms reviewed Oct 4, 2024

View reviewed changes

update unit test

8de9e34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

services/horizon: Remove --parallel-job-size config parameter used for reingestion. #5484

services/horizon: Remove --parallel-job-size config parameter used for reingestion. #5484

urvisavla commented Oct 3, 2024 •

edited

Loading

tamirms Oct 4, 2024

tamirms Oct 4, 2024 •

edited

Loading

sreuland Oct 4, 2024

services/horizon: Remove --parallel-job-size config parameter used for reingestion. #5484

Are you sure you want to change the base?

services/horizon: Remove --parallel-job-size config parameter used for reingestion. #5484

Conversation

urvisavla commented Oct 3, 2024 • edited Loading

PR Structure

Thoroughness

Release planning

What

Why

Known limitations

tamirms Oct 4, 2024

Choose a reason for hiding this comment

tamirms Oct 4, 2024 • edited Loading

Choose a reason for hiding this comment

sreuland Oct 4, 2024

Choose a reason for hiding this comment

urvisavla commented Oct 3, 2024 •

edited

Loading

tamirms Oct 4, 2024 •

edited

Loading