↔️ GRPO: Set max_model_len when initializing vLLM instance #2728

mirceapricop · 2025-02-01T16:01:35Z

What does this PR do?

By default, the vLLM model will be set up to support the max context of the input model.

However, during training we know we will only observe at most max_prompt_length + max_completion_length tokens, so we can use that to have a reduced memory footprint.

This is especially relevant when running on limited hardware as the improvement in memory can be significant.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a GitHub issue? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

qgallouedec · 2025-02-01T16:53:58Z

We could do that, but memory limitation shouldn't come from generation. Or maybe are you using the same device from training and generation?

mirceapricop · 2025-02-01T16:56:06Z

Indeed, my use case is specifically running on a single consumer GPU. It might be wishful thinking, but with this patch I am able to run a training loop for a 1.5B model.

qgallouedec · 2025-02-01T17:19:10Z

Have you tried to reduce the vllm gpu memory usage?

mirceapricop · 2025-02-01T17:33:02Z

Yes, in fact that's what led me to this change. Lowering it reduces the space left for a KV cache, and vLLM prints:

The model has a long context length (131072). This may cause OOM errors during the initial memory profiling phase, or result in low performance due to small KV cache space. Consider setting --max-model-len to a smaller value.

So my understanding is that with this, a smaller KV cache is more efficiently utilized.

Are there any downsides to setting this? We could make it opt-in through an arg if you think it can have negative implications.

qgallouedec · 2025-02-01T18:05:33Z

Now it makes sense. Can you add an arg in the config instead?

qgallouedec · 2025-02-01T18:06:39Z

vllm_max_model_len

mirceapricop · 2025-02-01T18:59:44Z

Works for me, done.

qgallouedec

Perfect

HuggingFaceDocBuilderDev · 2025-02-01T21:05:22Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

qunash · 2025-02-02T10:07:32Z

I think it would be cleaner to add a vllm_init_kwargs dict that would be passed directly to the vLLM engine initialization. It would replace the individual vllm_device and vllm_gpu_memory_utilization parameters with a more flexible approach that:

Makes it easier to support all vLLM engine parameters without adding new fields to the config
Follows the same pattern as model_init_kwargs that's already in the codebase

vllm_init_kwargs: Optional[dict] = field(
        default_factory=lambda: {
            "device": "auto",
            "gpu_memory_utilization": 0.9,
        },
        metadata={
        },
    )

mirceapricop · 2025-02-02T13:35:41Z

That sounds good to me, but I'm not sure if it can be done in a backwards compatible manner (or if it's acceptable to make backwards-incompatible flag changes). @qgallouedec thoughts?

sroecker · 2025-02-02T15:23:27Z

I think it would be cleaner to add a vllm_init_kwargs dict that would be passed directly to the vLLM engine initialization.

This would be the best solution for single GPU (poor) training as people might want to tune other parameters as well as seen in this thread: https://x.com/robertshaw21/status/1885781591961571455

mirceapricop · 2025-02-02T20:41:48Z

Now updated with vllm_init_kwargs and marking previous vllm init args as deprecated (though still using them to stay backward compatible)

mirceapricop · 2025-02-04T20:26:49Z

@qgallouedec I'd recommend merging this before other PRs that change the vllm init call, as it most likely covers all their needs (also merging is hard)

qgallouedec · 2025-02-04T21:58:38Z

In fact, I'm in favor of explicitly stating the parameters for two main reasons:

not doing so means implicitly assuming that any combination of parameters is compatible with GRPO, which is not the case
although less flexible, it's easier for the user to know which arguments are available, without having to refer to the vLLM doc

As for backwards compatibility, GRPO is a new trainer and the lib is still in alpha, so there's no real need to ensure that.

This can be discussed again in the future if this lack of flexibility is really a problem. But adding parameters one-by-one should be good for now

mirceapricop · 2025-02-04T23:38:02Z

Now reverted to just adding a single new arg

qgallouedec

Thanks, merging when CI is green :)

Set max_model_len when initializing vLLM instance

6abda34

mirceapricop changed the title ~~Set max_model_len when initializing vLLM instance~~ GRPO: Set max_model_len when initializing vLLM instance Feb 1, 2025

Introduce vllm_max_model_len arg

264f19d

qgallouedec approved these changes Feb 1, 2025

View reviewed changes

mirceapricop added 2 commits February 2, 2025 20:35

Replace vllm args with vllm_init_kwargs

c0fd3df

Update docstring

db14bb0

mirceapricop changed the title ~~GRPO: Set max_model_len when initializing vLLM instance~~ GRPO: Expose vllm_init_kwargs to enable vllm configuration Feb 2, 2025

mirceapricop added 2 commits February 2, 2025 21:07

Add missing import

e26d7fc

Remove default values from newly deprecated args

0533e75

mirceapricop mentioned this pull request Feb 3, 2025

📐 Add vLLM dtype configuration for GRPO trainer #2738

Merged

5 tasks

mirceapricop added 4 commits February 3, 2025 23:39

Merge branch 'main' into patch-1

87255cd

Merge branch 'main' into patch-1

61e6376

Docs update

aa6c74c

Merge branch 'main' into patch-1

32b6815

Merge branch 'main' into patch-1

7b20a1e

Reverted to adding single arg for max_model_len

25d9abc

mirceapricop changed the title ~~GRPO: Expose vllm_init_kwargs to enable vllm configuration~~ GRPO: Set max_model_len when initializing vLLM instance Feb 4, 2025

mirceapricop added 2 commits February 4, 2025 23:35

Remove spurious import

adb0da2

Merge branch 'main' into patch-1

30baab3

mirceapricop and others added 2 commits February 4, 2025 23:41

Remove spurious line

9e7030a

style

d65cec0

qgallouedec approved these changes Feb 5, 2025

View reviewed changes

qgallouedec changed the title ~~GRPO: Set max_model_len when initializing vLLM instance~~ ↔️ GRPO: Set max_model_len when initializing vLLM instance Feb 5, 2025

qgallouedec merged commit 78c5ce2 into huggingface:main Feb 5, 2025
13 checks passed

mirceapricop deleted the patch-1 branch February 5, 2025 23:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

↔️ GRPO: Set max_model_len when initializing vLLM instance #2728

↔️ GRPO: Set max_model_len when initializing vLLM instance #2728

mirceapricop commented Feb 1, 2025

qgallouedec commented Feb 1, 2025

mirceapricop commented Feb 1, 2025

qgallouedec commented Feb 1, 2025

mirceapricop commented Feb 1, 2025

qgallouedec commented Feb 1, 2025

qgallouedec commented Feb 1, 2025

mirceapricop commented Feb 1, 2025

qgallouedec left a comment

HuggingFaceDocBuilderDev commented Feb 1, 2025

qunash commented Feb 2, 2025

mirceapricop commented Feb 2, 2025

sroecker commented Feb 2, 2025

mirceapricop commented Feb 2, 2025

mirceapricop commented Feb 4, 2025

qgallouedec commented Feb 4, 2025

mirceapricop commented Feb 4, 2025

qgallouedec left a comment

↔️ GRPO: Set max_model_len when initializing vLLM instance #2728

↔️ GRPO: Set max_model_len when initializing vLLM instance #2728

Conversation

mirceapricop commented Feb 1, 2025

What does this PR do?

Before submitting

Who can review?

qgallouedec commented Feb 1, 2025

mirceapricop commented Feb 1, 2025

qgallouedec commented Feb 1, 2025

mirceapricop commented Feb 1, 2025

qgallouedec commented Feb 1, 2025

qgallouedec commented Feb 1, 2025

mirceapricop commented Feb 1, 2025

qgallouedec left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Feb 1, 2025

qunash commented Feb 2, 2025

mirceapricop commented Feb 2, 2025

sroecker commented Feb 2, 2025

mirceapricop commented Feb 2, 2025

mirceapricop commented Feb 4, 2025

qgallouedec commented Feb 4, 2025

mirceapricop commented Feb 4, 2025

qgallouedec left a comment

Choose a reason for hiding this comment