Limit the compute job thread pool size #6624

garypen · 2025-01-22T17:52:36Z

The router has always observed the APOLLO_ROUTER_NUM_CORES environment variable to restrict the size of the main tokio async job scheduler.

We are now enhancing the compute job thread pool to both respect this environment variable and restrict the number of threads in the pool.

If the environment variable is not set, then the size of the pool is computed as a fraction of the total number of cores that the router has determined are available.

If it is set, then the environment variable is taken as the number of available cores.

From this number, let's call it available, the router then uses the following table to size the compute job thread pool:

/// available: 1 pool size: 1
/// available: 2 pool size: 1
/// available: 3 pool size: 2
/// available: 4 pool size: 3
/// available: 5 pool size: 4
/// ...
/// available: 8 pool size: 7
/// available: 9 pool size: 7
/// ...
/// available: 16 pool size: 14
/// available: 17 pool size: 14
/// ...
/// available: 32 pool size: 28
/// etc...

This table should not be relied upon as an explicit interface, since it may change in the future, but is provided here for informational purposes.

The router has always observed the APOLLO_ROUTER_NUM_CORES environment variable to restrict the size of the main tokio async job scheduler. We are now enhancing the compute job thread pool to both respect this environment variable and restrict the number of threads in the pool. If the environment variable is not set, then the size of the pool is computed as a fraction of the total number of cores that the router has determined are available. If it is set, then the environment variable is taken as the number of available cores. From this number, let's call it available, the router then uses the following table to size the compute job thread pool: /// available: 1 pool size: 1 /// available: 2 pool size: 1 /// available: 3 pool size: 2 /// available: 4 pool size: 3 /// available: 5 pool size: 4 /// ... /// available: 8 pool size: 7 /// available: 9 pool size: 7 /// ... /// available: 16 pool size: 14 /// available: 17 pool size: 14 /// ... /// available: 32 pool size: 28 /// etc... This table should not be relied upon as an explicit interface, since it may change in the future, but is provided here for informational purposes.

svc-apollo-docs · 2025-01-22T17:52:40Z

✅ Docs preview has no changes

The preview was not built because there were no changes.

Build ID: a4d6d55042cffd8b0dab3381

router-perf · 2025-01-22T17:53:07Z

CI performance tests

lrlna · 2025-01-23T17:30:45Z

I am going to try backporting it to 1.x so I can run some more perf tests.

lrlna · 2025-01-23T17:33:47Z

@Mergifyio backport 1.x

mergify · 2025-01-23T17:34:02Z

backport 1.x

🟠 Waiting for conditions to match

merged [📌 backport requirement]

theJC · 2025-01-29T23:52:25Z

Any chance you would be willing to incorporate the spirit of #6663 into this PR and log out the effective pool size and queue for diagnostics/info?

As a customer, I'd really appreciate having this available and be able to see the reality of these fairly important structures, based on the amount of cores dialed in for router ie, in kubernetes deployments.

garypen self-assigned this Jan 22, 2025

garypen requested review from a team as code owners January 22, 2025 17:52

This comment has been minimized.

Sign in to view

Add a changeset

1c752d8

garypen requested a review from a team as a code owner January 22, 2025 17:53

Fix typo poll should be pool

1640c39

lrlna added the backport-1.x Backport this PR to 1.x label Jan 23, 2025

lrlna removed the backport-1.x Backport this PR to 1.x label Jan 23, 2025

Merge branch 'dev' into garypen/compute-limit

e1cc57b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Limit the compute job thread pool size #6624

Limit the compute job thread pool size #6624

garypen commented Jan 22, 2025

svc-apollo-docs commented Jan 22, 2025 •

edited

Loading

This comment has been minimized.

router-perf bot commented Jan 22, 2025

lrlna commented Jan 23, 2025

lrlna commented Jan 23, 2025

mergify bot commented Jan 23, 2025

theJC commented Jan 29, 2025 •

edited

Loading

Limit the compute job thread pool size #6624

Are you sure you want to change the base?

Limit the compute job thread pool size #6624

Conversation

garypen commented Jan 22, 2025

svc-apollo-docs commented Jan 22, 2025 • edited Loading

✅ Docs preview has no changes

This comment has been minimized.

router-perf bot commented Jan 22, 2025

lrlna commented Jan 23, 2025

lrlna commented Jan 23, 2025

mergify bot commented Jan 23, 2025

🟠 Waiting for conditions to match

theJC commented Jan 29, 2025 • edited Loading

svc-apollo-docs commented Jan 22, 2025 •

edited

Loading

theJC commented Jan 29, 2025 •

edited

Loading