Agent will get stuck after a few iterations if used in a loop #582

Bilokin · 2025-02-10T11:57:45Z

Hello,

I am trying to run an agent in a for loop, and after just a few iterations it gets stuck for no reason if I use TransformersModel with CodeAgent running on a GPU.
I observe this behavior on Linux cloud and on my private Windows PC for any version of smolagents.

Minimal code to reproduce the error

from smolagents import TransformersModel, CodeAgent, __version__
print(__version__)
model = TransformersModel("meta-llama/Llama-3.2-1B-Instruct", torch_dtype='auto', device_map='auto', max_new_tokens=32000)
agent = CodeAgent(tools=[], model=model, additional_authorized_imports=['numpy'])
prompt = "What is the value of $({i}+{i})^-{i}$? Write a valid python code block after ```"
for i in range(50, 101):
    agent.run(prompt.format(i=i))

The LLama 1B model fits nicely on my 8GB GPU and there is a lot of free VRAM left. After a few failed steps of some iteration the agent will get stuck on response generation stage. On Windows PC I see that the GPU memory controller load increases dramatically with time while no output is generated, so perhaps this is a VRAM related issue.

I can run the same LLM without smolagents using transformers directly in a simple loop and it successfully finishes all iterations.
I can also run LiteLLMModel model with smolagents that is connected to Ollama server on my PC, and it is running fine.

Could you please take a look at it?

Packages versions:
transformers 4.47.0
smolagents 1.8.0 or main

The text was updated successfully, but these errors were encountered:

sysradium · 2025-02-10T13:25:29Z

I have a feeling it is because it does not get the final answer. So the problem is somewhere here:

smolagents/src/smolagents/agents.py

Line 388 in 8359283

while final_answer is None and self.step_number <= self.max_steps:

Check the output, see if it returns final_result = ... as stated in this prompt:

smolagents/src/smolagents/utils.py

Line 171 in 8359283

final_answer("YOUR FINAL ANSWER HERE")

THe model you have is very small, it might not be doing thiat.

g-eoj · 2025-02-10T19:16:27Z

Will it use up all tokens eventually and return due to max_new_tokens=32000 or is it a full on hang? I have encountered a similar issue where the model never outputs the stop sequence expected and generates until it exhausts max_tokens.

A quick way to test this is to reduce your max_new_tokens and check if the stuck step returns (it may still fail but that is not the point, you want to see if it returns anything at all).

Bilokin · 2025-02-10T21:15:25Z

Thanks, now I think the issue is connected to max_new_tokens rather than the loop.
I used this high token limit in that minimal working example because in my full code I try thinking models, like DeepSeek, that burn through the tokens really fast.
The smaller max_new_tokens values prevent the agent from being stuck on one step because their output is truncated as advertized.

I removed the [BUG] tag, because this is perhaps just my misuse of the agents. I will investigate more on that

g-eoj · 2025-02-10T21:35:17Z

Since this sounds a lot like what I encountered, here are more details of what happened in my case and could explain what happened here as well.

The model correctly produces python code that matches the expected regex pattern:

smolagents/src/smolagents/utils.py

Line 153 in f6b0a12

pattern = r"```(?:py|python)?\n(.*?)\n```"
After the python code triple backtick the model is supposed to output <end_code> which will signal to stop model generation. This doesn't happen and the model continues generating.
After the model exhausts all tokens (which can take a while), the regex from

smolagents/src/smolagents/utils.py

Line 153 in f6b0a12

pattern = r"```(?:py|python)?\n(.*?)\n```"

still matches the final output so the agent executes without error.

In summary the model generates valid python code but doesn't produce a stop sequence. I don't think this a bug either, since the model isn't following the instruction it receives to produce a stop sequence, but it is wasteful of compute resources.

The only sure fire way to prevent the issue I've found is to use a logits processor to force the model to produce structured output that ends with <end_code>.

sysradium · 2025-02-10T22:53:51Z

@g-eoj yeah, that's what I have assumed is happening. :/

Bilokin added the bug Something isn't working label Feb 10, 2025

Bilokin changed the title ~~[BUG] Agent will get stuck after a few iterations if used in a loop~~ Agent will get stuck after a few iterations if used in a loop Feb 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Agent will get stuck after a few iterations if used in a loop #582

Agent will get stuck after a few iterations if used in a loop #582

Bilokin commented Feb 10, 2025 •

edited

Loading

sysradium commented Feb 10, 2025

g-eoj commented Feb 10, 2025

Bilokin commented Feb 10, 2025 •

edited

Loading

g-eoj commented Feb 10, 2025

sysradium commented Feb 10, 2025

Agent will get stuck after a few iterations if used in a loop #582

Agent will get stuck after a few iterations if used in a loop #582

Comments

Bilokin commented Feb 10, 2025 • edited Loading

sysradium commented Feb 10, 2025

g-eoj commented Feb 10, 2025

Bilokin commented Feb 10, 2025 • edited Loading

g-eoj commented Feb 10, 2025

sysradium commented Feb 10, 2025

Bilokin commented Feb 10, 2025 •

edited

Loading

Bilokin commented Feb 10, 2025 •

edited

Loading